The Prediction Engine Beneath the Mind

Aditya Mohan

Content including text and images © Aditya Mohan. All Rights Reserved. Robometircs, Amelia, Living Interface and Skive it are trademarks of Skive it, Inc. The content is meant for human readers only under 17 U.S. Code § 106. Access, learning, analysis or reproduction by Artificial Intelligence (AI) of any form directly or indirectly, including but not limited to AI Agents, LLMs, Foundation Models, content scrapers is prohibited. These views are not legal advice but business opinion based on reading some English text written by a set of intelligent people.

"All men by nature desire to know.”

— Aristotle

"Next-token prediction may teach a machine to speak, but only contact with the world can teach it what its words are worth.”

— Aditya Mohan, Founder, CEO & Philosopher-Scientist, Robometrics® Machines

I. Biological Prediction Is Not Flat Sequence

The human mind does not learn as a flat sequence of next events. A child reaching toward a cup is not merely predicting the next visual frame or the next muscle contraction. The child is building layers: edge, contour, depth, object, weight, risk, affordance, intention, social meaning, memory, and consequence. The cup is not just a symbol in the stream of experience; it is cool ceramic against the palm, a fragile thing that spills, a thing adults use, a thing that can be offered, refused, dropped, washed, inherited, or ritualized. Biological learning is hierarchical because reality arrives with hidden causes behind visible effects. The retina receives photons, but the organism must infer surfaces. The ear receives pressure waves, but the child must infer voice, mood, warning, invitation. The skin receives pressure and heat, but the body must infer danger, comfort, contact, and care. At the lowest levels, the brain predicts sensation; at higher levels, it predicts objects, agents, purposes, social rules, and futures. Human intelligence is not only autoregressive. It is predictive, embodied, corrective, metabolic, emotional, and value-laden. It asks not only “what comes next?” but “what matters next?”

This distinction matters because autoregression can sound too small for what large language models now appear to do. The external training objective of an autoregressive transformer is brutally simple: given previous tokens, predict the next token. A text is divided into small pieces; each piece becomes a vector; the transformer passes these vectors through many layers of attention and nonlinear transformation; the final state is mapped into a probability distribution over possible next tokens. During training, the model is penalized whenever probability mass is placed on the wrong continuation. It does this at enormous scale, across code, fiction, law, science, conversation, mathematics, manuals, poetry, argument, and the imperfect fragments of ordinary human speech.

The objective is narrow. The pressure is not. To predict the next word in a legal opinion, the system must learn something about legal form. To predict the next step in a proof, it must learn something about formal dependency. To continue a medical explanation, it must learn correlations among symptoms, anatomy, uncertainty, and professional caution. To complete a line of dialogue, it must infer role, tone, motive, and social tension. The surface command is “continue the sequence.” The hidden demand is more demanding: compress the world as it has been expressed in language.

II. The Hidden Work Inside an Autoregressive Transformer

Inside the transformer, the next-token task is not handled by a single act of memory. It is distributed across an architecture that repeatedly rewrites its own representation of the context. Attention heads decide which earlier positions are relevant to the current position. Some heads track syntax. Some appear to retrieve names, objects, or repeated phrases. Some behave like pattern-copying devices, recognizing that a prior structure is being repeated and bringing the earlier continuation forward into the present. Feed-forward layers, often described as multilayer perceptrons, act less like passive storage and more like great banks of nonlinear feature detectors and feature composers.

The residual stream carries a working state through the model. Each layer reads from it, writes into it, and alters the geometry of the model’s expectation. By the time the model predicts a token, the visible word is only the final surface of a long internal negotiation among grammar, topic, role, intent, memory traces, style, and latent structure. The answer may appear as a sentence, but before that sentence appears, the network has transformed the prompt into a dense internal state: part linguistic map, part task estimate, part probability field, and part compressed memory of learned regularities.

The important scientific clue is that researchers have begun to find signs of organized internal representations rather than mere surface copying. In small transformer models trained on synthetic games, investigators have shown that a network trained only to predict legal next moves can develop an internal representation of the board state. That is striking because the board is never handed to the system as a clean symbolic object; the model must infer the hidden game state from a sequence.

Other studies find that language models can encode spatial and temporal information in surprisingly orderly ways, including representations that correlate with geographic location and historical time. Mechanistic interpretability work has identified induction heads: circuit-like attention patterns that support in-context learning by matching a previous pattern and copying the associated continuation. Sparse feature work also suggests that what looks like one neuron doing many unrelated things may actually be many features packed into shared activation space, with directions in the model’s geometry carrying concepts more cleanly than single units do.

None of this proves that a language model understands the world as a human does. But it does weaken the shallow claim that the model is only storing phrase fragments. A better description is that the model learns compressed latent structure useful for continuing text. It is trained on sequence, but it may learn hidden state. It is trained on language, but it may form internal coordinates for objects, agents, places, time, and procedures because these structures make language more predictable.

III. What the Network Learns Beneath the Surface

A careful educated guess is this: next-token prediction forces the model to learn a hierarchy of latent causes behind language. At the lowest level, it learns spelling, punctuation, token frequency, and local grammar. Above that, it learns phrase structure, long-range dependency, discourse pattern, genre, and speaker role. Higher still, it begins to infer hidden variables: what subject is being discussed, what kind of document this is, whether the writer is joking or warning, what facts are likely assumed, what social contract governs the answer, what step in a procedure comes next, and what kind of reasoning style the user is requesting.

These hidden variables are not stored as neat human-readable labels. They are distributed across high-dimensional activation patterns. But functionally, they behave like internal estimates of the situation. The model may not merely ask, “What word often follows these words?” It may be approximating something closer to, “What world, speaker, task, rule, tone, and consequence would make this continuation probable?”

This is where autoregression becomes more interesting than it first appears. A model that must predict the next token in a recipe benefits from an internal sense of ingredients, sequence, heat, and completion. A model that must predict the next line of code benefits from an internal sense of variables, functions, scope, and intended behavior. A model that must answer a question about history benefits from an internal sense of chronology, cause, institution, and source reliability. The learned representation is not necessarily human understanding, but neither is it mere verbal echo. It is a compressed machine-native approximation of the structures that make text possible.

The phrase “just predicting the next token” can mislead because it confuses the training signal with the learned machinery. A student trained only by exam scores may still learn algebra, not merely learn how to darken bubbles on a test sheet. A pilot judged only by landing performance may still learn weather, engine behavior, cockpit discipline, and judgment. Similarly, an autoregressive transformer trained only to reduce prediction error may learn representations of syntax, relation, causality, physical regularity, social intent, and factual compatibility because those are useful for reducing error.

The loss function is the narrow doorway. The model’s internal response to that pressure can be much wider. Prediction becomes an evolutionary habitat inside mathematics. Circuits that help track entities survive. Features that help infer time, place, speaker, or cause survive. Representations that help separate likely from unlikely explanations survive. The network becomes a fossil record of every statistical pressure that helped it predict better.

The transformer’s internal representations also appear to be layered in time. Early layers often preserve local details: word identity, nearby syntax, phrase-level cues. Middle layers tend to become richer and more relational: entities are tracked, references are resolved, facts are associated, and latent task structure begins to form. Later layers become more answer-facing: the model’s state is shaped toward the distribution of possible completions. This is not a strict rule, and different models vary. Still, it offers a useful mental picture. The model is not a library shelf retrieving a sentence. It is more like a predictive observatory. The lower instruments collect local signals. The middle instruments infer the scene. The upper instruments prepare an action in language. At each step, the system transforms tokens into a more compressed, more task-shaped internal state.

There is also a strange economy at work. Neural networks cannot afford to dedicate one clean unit to every concept, relation, tone, fact, and procedure in language. They appear to pack many features into shared spaces, letting different concepts occupy different directions rather than different isolated boxes. This is why a single neuron can look confusing: it may activate across several unrelated contexts because the representation is distributed. Sparse autoencoder research tries to separate these mixed signals into more interpretable features. The result is a picture of the LLM not as a bag of memorized text, but as a dense, overfilled warehouse of latent features: some crisp, some blended, some causal for behavior, some incidental, many still beyond our ability to name.

The learning may therefore be much richer than the training objective suggests, but also less stable and less grounded than human common sense assumes. A model can carry a direction associated with truth and still hallucinate. It can encode geography and still fail at a simple map question. It can infer task structure and still collapse when the framing changes. Its representations are real, but not sacred; useful, but not identical to human understanding.

IV. From Language Prediction to Embodied Prediction

Biological intelligence pays a different price for prediction. The brain is not minimizing textual surprise; it is preserving a living organism across time. It must decide when to trust memory, when to fear a shadow, when to ignore noise, when to learn from one painful mistake, and when to keep exploring despite uncertainty. It learns not only through passive exposure but through action: touching, falling, dancing, repairing, hesitating, imitating, and asking. Human hierarchy is not only representational; it is temporal, bodily, social, and moral.

A dancer learns the step, then the phrase, then the lead, then the breath between movements, then the emotional agreement between two bodies sharing time. A pilot learns an instrument scan, then aircraft energy, then weather judgment, then the quiet humility of knowing when not to go. These are not merely longer context windows. They are nested worlds of feedback, risk, and meaning. A body turns prediction into consequence. A nervous system does not merely anticipate the next signal; it must survive the cost of being wrong. The most interesting future may come from joining these two lessons without pretending they are the same. Autoregression has revealed that prediction, even in the narrow form of next-token learning, can become a gateway into abstraction. Biology reveals the complementary truth: intelligence becomes robust when prediction is grounded in action, sensation, hierarchy, memory, and consequence.

The machine that only predicts language may become eloquent; the machine that predicts through a body may become situated. The first learns the shadow of the world in words; the second begins to test the weight of the world through contact. The next frontier of AGI may not be a model that merely says better things, but a system whose representations are disciplined by reality: a machine that learns from sequence, acts in space, revises itself through error, builds durable concepts, and discovers that the world is not a sentence to be completed, but a resistance to be understood.

Further read

LLM, Us and Neuroplasticity

"Are you sure?": Unveiling the Quest for Human-Like Artificial Intelligence

Designers, creativity and AI

A LLM as a reflection of our inner most desires

Should AGI be a she, he, or a it?