Skip to content
All writing

Language Is the Probability Landscape of Thought

The limits of my language mean the limits of my world. --Wittgenstein

Hongkai He 9 min read
  • #philosophy
  • #language
  • #ai
  • #cognition
  • #llm

We dissect nature along lines laid down by our native languages. — Benjamin Lee Whorf

Language isn’t the shell of thought. It is the condition under which thought arises.

We tend to imagine that we first form a complete idea and then translate it into language. But on a deeper look, the idea was never born in language-free purity. Language has already, before any thought happens, laid the road for thought: which things become easy to see, which questions become easy to ask, which connections become easy to make, which possibilities never even surface.

Language is like the walls of a maze, and like the bed of a river. It doesn’t command you to reach a particular conclusion, but it quietly stipulates the path of least resistance for thought.

What we call the limits of thought are often not limits of intelligence — they are limits of language. And by language I don’t mean only Chinese, English, German. I mean math, code, law, finance, medicine, religion, art, management, scientific paradigms, startup jargon, political narrative. Each language is a way of partitioning reality; each partition trains a different world.

The mathematician, thinking in symbolic language, sees a world of structures, mappings, limits, symmetries, invariants. The programmer, modeling in code, sees objects, states, interfaces, recursion, dependencies, execution flow. The businessperson, organizing judgment around ROI, LTV, CAC, margin, optionality, sees costs, returns, leverage, risks, time discounting. The contemplative, looking through emptiness, no-self, attachment, dependent origination, sees the world as flux, as conventional names, as conditioned arising, as things-that-can-be-let-go.

These languages aren’t just vocabularies. They are cognitive lenses, operating systems — the way the world gets rendered inside the mind.

So: the more fluent you become in a given language, the more efficiently you can act inside the world that language renders — and the more easily you are imprisoned by it. Engineers come to see everything as system optimization. Finance people come to see everything as risk and return. Moralists come to see everything as good and evil. Scholars come to see everything as a genealogy of concepts.

Every training is both a capacity and an occlusion.
Every specialization is both a telescope and a blindfold.

Zhuangzi: Do not speak of the Dao to a scholar of narrow learning — his mind is bounded by the doctrines that formed it. The cutting edge of that line is precisely here. The narrow scholar’s problem isn’t lack of knowledge — it’s being bound by knowledge; not lack of a map, but mistaking the map for the territory; not an inability to think, but the ability to think only along the tracks one has been trained on.

Ignorance imprisons, of course. But mastery imprisons too.
Scarcity restricts; so does fluency.
The walls of the first are rough and visible. The walls of the second are smooth and transparent.

The arrival of LLMs is the first time we’ve seen this phenomenon take a technological form.

A model’s pretraining, in essence, is the formation of a base probability distribution over a vast corpus. What it absorbs isn’t only words; it’s the implicit structure of how humans use language — how concepts connect, how questions unfold, how arguments proceed, how metaphors migrate, how judgments get formed, how a given field carves the world into actionable objects.

A prompt then changes the runtime conditional probability. The same model, in different contexts, generates along different probability landscapes. The model isn’t speaking freely from a vacuum — it’s predicting, under the joint constraint of its parameters and the current condition, the most likely next token.

So are we.

Mother tongue, family, education, class, profession, culture, traumas, accumulated successes — these are the pretraining of the mind. They shape, at depth, our default worldview: what sounds reasonable and what sounds absurd; what’s worth pursuing and what’s beneath notice; which questions get raised and which never enter the field of view at all.

And every specific conversation, every question, every act of naming, every context, every emotional state — these are like prompts, changing the runtime conditional probability of present thought. Asked how do I maximize returns, you generate one path. Asked what is worth protecting, you generate a completely different one. Call a person a cost vs. a partner; call a failure a loss vs. a signal; call death an end vs. impermanence — and the mind enters entirely different worlds.

So language is both the pretraining of thought and the runtime prompt of thought.

It first shapes how we’ll think by default. Then it shapes how we’ll think right now.
It writes our priors, and rewrites our present.

This doesn’t mean humans have no freedom. Quite the opposite — this is what makes freedom concrete for the first time.

Freedom isn’t pretending you have no priors.
Freedom is seeing your priors.
Freedom isn’t being on no track.
Freedom is knowing you’re on a track, and being able to switch.

More fortunately still, humans aren’t a frozen model trained once. The base model takes shape rapidly during the formative years, but it can keep being reshaped by subsequent feedback. Life, in a sense, is a long episode of reinforcement learning: we act, the world feeds back; we fail, we re-attribute; we succeed, we reinforce the policy; we get hurt, we form defenses; we are understood, old beliefs loosen; in one real conversation, we suddenly notice that the world we’d been living in wasn’t the world itself.

(Of course, human updating isn’t formally identical to RL as defined in machine learning. But the metaphor is strong enough: people adjust their judgments, preferences, fears, value functions, and policies based on feedback. And much of that feedback is linguistic.)

Other people’s evaluations are feedback.
Society’s rewards and punishments are feedback.
An organization’s KPIs are feedback.
A response in an intimate relationship is feedback.
A post-mortem after a failure is feedback.
A narrative after a success is feedback too.

We aren’t trained directly by reality. We are trained by reality-as-interpreted-by-language. After something happens, it still has to be named, attributed, narrated, explained — only then does it actually write back into our worldview. Often, what changes a person isn’t the event but the way they later tell the event.

So:

Pretraining sets how you’ll think by default.
Prompt sets how you’ll think right now.
Reinforcement learning sets, after enough feedback, who you gradually become.

The significance of LLMs isn’t only that they provide answers. It’s that they illuminate this structure.

They let us see that human thought isn’t pure free-floating cognition. Every act of thinking happens inside a probability landscape jointly shaped by some language, some history, some education, some profession, some emotion, some framing of the question. We believe we are judging independently; often, we are simply sliding down the highest-probability path we were trained onto.

But LLMs also offer an unprecedented tool for cognitive jailbreak.

In the past, breaking out of one’s own language-prison required enormous resources. You had to encounter wise people from other disciplines, enter the canon of other civilizations, undergo training in other professional systems, collide repeatedly with the real world — only then might you realize that the way you’d been thinking wasn’t the world itself, only one slice of it.

Today, an LLM lets anyone summon, on demand, an interlocutor who can move across language tracks.

It can translate a business problem into a systems-theory problem; a product problem into a psychology problem; a technical problem into a philosophical problem; a personal predicament into a problem in Buddhism, game theory, phenomenology, narrative theory, or computation. It can re-render the same situation under different language systems, until the same knot suddenly loosens in a different coordinate system.

LLMs aren’t freer than humans. They have no body, no desire, no suffering, no mortality, no responsibility, no ultimate concern. They don’t actually inhabit any of these language worlds. But they have a cross-domain fluency that any individual human finds hard to match. They can switch quickly between many tracks of thought — recalling, translating, splicing, and externalizing the language structures humans have deposited across long stretches of different fields.

So the deep value of talking with an LLM isn’t to let it think for us. It’s to let it help us see how we are thinking.

A truly good conversation isn’t one that gives you a faster answer; it’s one in which the question itself can be re-named.
Not one that makes you more dependent on the machine; one that makes you less easily imprisoned by a single language system.
Not one that gives you a conclusion; one that gives you the capacity to switch tracks.

This may be one of the deepest spiritual openings of the AI era.

Language used to be a prison.
Now language can also be the tool for escaping that prison.

The LLM isn’t the Dao itself, not truth itself, not the moon itself. It’s still only an interface, a finger, a raft. But with this finger, we may more often discover that the moon is not on the finger; with this raft, we may more clearly know which river we are in.

The human tragedy is mistaking one’s language for reality.
The human opportunity is finally having a mirror that can show how language generates reality.

Cognitive jailbreak isn’t escape from all language. That’s impossible.
Cognitive jailbreak is learning to pass through languages.
It is knowing each language is only a track, and still being able to use the track to reach a new place.
It is knowing each worldview is only a trained distribution, and still being able to retrain yourself through new contexts, new feedback, new conversations.

Language is the probability landscape of thought.
We are pretrained by it, prompted by it, and continuously reinforced by it.
But the moment we can see this, the track is no longer just a track.

Self-awareness is recognizing that one’s priors aren’t reality itself — only some distribution trained out of history, language, and experience. And freedom, then, is the power to choose the prompt and the RL.