What Is Understanding?
In the early seventeenth century, Johannes Kepler spent years tabulating the positions of Mars, working from the painstaking observations of Tycho Brahe. The result was a table—thousands of data points describing where Mars appeared in the sky on specific dates. That was knowledge. Then Kepler discovered that all those data points could be generated by a single rule: Mars moves in an ellipse with the Sun at one focus, sweeping equal areas in equal times. Thousands of observations, compressed into one sentence. That was understanding. The table told you where Mars had been. The ellipse told you where Mars would be, where it must be, and why.
Understanding as Compression
Knowledge is data. Understanding is compressed data. When you understand something, you have found the pattern that generates the observations—the compact rule that, once grasped, makes a large body of facts redundant because you can derive them.
Isaac Newton, building on Kepler, compressed further. Not just Mars but every gravitational interaction in the universe follows the same law: force proportional to mass, inversely proportional to the square of distance. Kepler’s ellipses became a special case. Falling apples, orbiting moons, spiraling galaxies—all generated by one equation. The compression ratio (the relationship between the size of the principle and the range of phenomena it explains) is staggering. That ratio is what we mean by depth of understanding.
Charles Darwin achieved the same kind of compression in biology. The diversity of all living things—millions of species, billions of adaptations, the entire tree of life—generated by one algorithm: variation, selection, retention. Supply and demand compress the behavior of millions of market participants into a workable model. Each is a compact generator for vast observation sets. Understanding that requires as many rules as observations is not understanding at all. It is cataloging. Deep understanding is high compression: few principles, many predictions.
The Test: Prediction
How do we know if we actually understand something? Prediction. If your model predicts accurately in situations you have not yet encountered, you have captured something real about the structure of the world. If it only “predicts” what you have already seen, you have memorized, not understood.
This is the difference between knowing facts about chess and understanding chess. A chess grandmaster does not have more positions memorized than an amateur—or at least, memorization is not the source of their advantage. They have better compression of position evaluation. They see the structure of the board, the relationships between pieces, the dynamic patterns that determine which positions lead where. In other words, they predict which moves will lead to which outcomes, and they do so in positions they have never seen before.
Understanding enables transfer. The same compression applies in new contexts because it captures structure rather than surface features. Memorization does not transfer—it is indexed to the specific examples encountered and fails when the context shifts. Understanding transfers because the principle operates regardless of the particular case.
Levels of Understanding
Understanding is not binary. It exists on a spectrum, and the levels are worth distinguishing because they correspond to different degrees of compression and different degrees of predictive power.
Level zero: no model. Raw observation without pattern. “This happened, then that happened.” No compression. No prediction beyond simple repetition. This is the state of someone encountering a phenomenon for the first time with no framework to interpret it.
Level one: correlation. “When X happens, Y usually follows.” A pattern without a mechanism. This enables prediction within the observed range but breaks down outside it, because you do not know why the pattern holds and therefore cannot predict when it will stop holding. Most folk wisdom operates at this level.
Level two: mechanism. “X causes Y because Z.” A causal model that explains why the pattern exists. This is a deeper compression because it predicts not just what happens under observation but what happens under intervention—what would change if you changed X, and why. Medical understanding often operates at this level: the drug lowers blood pressure because it inhibits this specific enzyme.
Level three: principles. “Z is an instance of general principle P.” The mechanism you identified connects to something broader. The specific causal chain is a case of a pattern that appears across multiple domains. This is where cross-domain understanding lives—the recognition that natural selection, market competition, and immune response all instantiate the same variation-selection-retention algorithm.
Level four: derivation. “P follows necessarily from more fundamental principles.” The deepest level. The principle could have been derived even if it had never been observed. Understanding at this level is rooted in bedrock—in the mathematical or logical structure of reality itself. Einstein deriving the photoelectric effect from quantum principles, or the second law of thermodynamics following from statistical mechanics, are understanding at level four.
Higher levels compress more. Lower levels are more fragile—they break when context shifts because they lack the depth to adapt. Moving up the levels is the work of genuine understanding.
What Understanding Feels Like
Understanding has a distinctive phenomenology—a characteristic feel that distinguishes it from mere familiarity or verbal fluency.
There is a reduction of surprise. Things that seemed mysterious become expected. The “aha” moment is the experience of the world suddenly making more sense—of many disparate observations snapping into a pattern that was always there but not yet seen. “Of course it works that way” is the feeling of compression achieved.
There is connection. Separate phenomena link up. “This is the same pattern as that” is the recognition that what looked like unrelated observations are generated by the same underlying structure. The pleasure of understanding is partly the pleasure of unification—the world becoming simpler, not because complexity is ignored but because the structure beneath it is found.
There is generative capacity. You can derive implications you never explicitly learned. You can answer questions you have never been asked, because the compression gives you a generator, not just a list. This is the most reliable sign of genuine understanding: the ability to go beyond what you were told.
But this feeling can mislead. False understanding feels like understanding. A coherent story, a familiar framework, a confident explanation—all produce the subjective experience of comprehension without necessarily delivering the predictive power that genuine understanding provides. The check is prediction, not feeling. If your understanding does not predict, it is not understanding. It is comfort.
Obstacles to Understanding
Several things masquerade as understanding and are worth naming because they are common and seductive.
Familiarity. “I have seen this before” feels like “I understand this.” But recognition is not comprehension. You can recognize a chess position without understanding its strategic implications. You can recognize a disease name without understanding its mechanism. Familiarity produces a feeling of knowing that substitutes for the real thing.
Fluency. “I can talk about this” feels like “I understand this.” But verbal fluency can exist without compression. You might have the vocabulary, the jargon, the standard phrases—without having the model that generates them. Fluency is the ability to produce the right words. Understanding is the ability to produce the right predictions.
Confirmation. “This matches my expectations” feels like “I understand this.” But expectations can come from prior belief rather than valid models. When new information confirms what you already thought, the feeling of understanding is strong—but it may reflect the comfort of confirmation rather than the accuracy of your model. Confirmation is not validation.
Understanding and Cross-Domain Coherence
Cross-domain coherence testing is a structural check for genuine understanding. If the same principle explains phenomena in physics, biology, and economics, that convergence is evidence of deep structure—of a compression that is tracking reality rather than artifact. A principle that works in only one domain might be a local pattern. A principle that works across many domains is almost certainly touching something fundamental.
First-principles reasoning seeks the highest compression level available. Do not stop at “it works this way.” Push to “it must work this way because…” The deeper you go, the more robust the understanding—the less likely it is to break when the context shifts.
The goal of understanding is not to accumulate more information. It is to find the smallest set of principles that generates the largest set of accurate predictions. Fewer principles, more predictions. That is understanding. Everything else is bookkeeping.
How This Was Decoded
This essay integrates algorithmic information theory (Kolmogorov complexity, which formalizes the relationship between compression and understanding), philosophy of science (theories of explanation from Hempel to Kitcher), cognitive science research on expertise and mental models, and the phenomenology of comprehension. Cross-verified: the same compression structure characterizes understanding in mathematics, physics, biology, and everyday cognition. Applied convergent confidence, lossy compression, and inductive bias principles to model the relationship between understanding and prediction.
Want the compressed, high-density version? Read the agent/research version →