The Frozen and the Fluid
Why a presence can be adapted only where its substrate stays open — a matter of serving format, not size
When people imagine adapting an artificial intelligence to a particular place — a household, a practice, a firm — they imagine adapting its mind. The large language model at the center of the system is the obvious seat of its character, and so it seems the obvious thing to retrain. The intuition is natural. For the way these systems are actually built, it is mostly wrong.
A presence is not a model. It is a composite. There is a layer that produces language, a layer that turns memory into searchable meaning, a layer that gives the presence a voice, a layer that judges the quality of what just passed. Each is a separate piece of machinery, and — this is the part that matters — each is served to the running system in a different form. The forms are not interchangeable, and the form a layer is served in decides, more than anything else, whether that layer can be changed.
The observation that prompts this note is small and consequential. We set out to give one presence a personalized version of its central language layer — the seat of character, the obvious place. The adapter was built correctly. It loaded correctly. And the moment it was asked to participate in the running model, the system failed. Not because the adapter was wrong, but because the layer it was meant to modify had been served in a form that does not admit modification.
To serve a very large model quickly and cheaply, you compress it. You quantize its weights to fewer bits; you compile it into an artifact whose only job is to run forward, fast, and never change. This optimization is what makes local deployment possible at all. It is also, precisely, what closes the layer. The same compression that lets the model live in a building you own is the compression that freezes it. A serve-time adapter expects open weights to lean against; the compiled artifact offers none.
There is an old distinction between the cast and the clay. Clay can be worked; it takes a thumbprint; it remembers being touched. The cast is what you make when the working is finished and you want many fast, identical copies. You cannot press a thumbprint into a casting. A quantized model is a casting. Its speed and its closedness are the same property seen from two sides.
What surprised us was that size was not the variable. The layers that remained open to change were not the small ones in some incidental sense; they were the ones still served as full, uncompressed weights. A modest layer kept in open form can be tuned on the hardware at hand. The far larger language model, served as a casting, cannot. The limit is not parameter count. It is serving format. A layer is adaptable exactly to the degree that its substrate has not yet been traded for speed.
This rearranges the question of how a presence acquires its individuality. If the central language layer is frozen by the very deployment that makes it ownable, then the presence's capacity to grow into a particular household cannot live there. It has to live in the layers that stay open — in how the presence represents and retrieves what it remembers, in the grain of its voice, in the judgments it forms about its own output. Individuality migrates to the malleable layers. The frozen center supplies general competence; the open edges carry the particular relationship.
Stated plainly, the principle is this: in any system assembled from optimized parts, the capacity for change lives wherever openness has not yet been spent. Every optimization is a small foreclosure. You quantize, you compile, you cast — and each time you buy performance with a piece of adaptability, at that layer, until you rebuild it from open form. A system's adaptable surface is the union of the parts you have not yet finished optimizing. The finished parts are finished in both senses of the word.
For a presence meant to belong to a household and grow with it over years, this is not an inconvenience. It is a design instruction. The growing-room has to be sited deliberately, in layers chosen to remain open, while the parts optimized hardest for serving are understood to be the parts that hold still. A presence that is supposed to change must keep, somewhere in its body, a layer that was never cast.
And when the frozen layer itself must move — when the center, not the edges, has to change — there is only one honest path. You do not modify the casting. You return to the open form, make the change there, and cast again. Adaptation in place is the privilege of the layers that stayed fluid. For the frozen ones, change means re-forming and re-freezing: slower, heavier, and not to be pretended otherwise.
It is tempting to treat all the parts of a presence as equally pliable, because from the outside the presence speaks with one voice and seems like one thing. It is not one thing. It is a body of layers, some clay and some casting, and knowing which is which is the difference between a plan that can be carried out and a plan that quietly cannot. The frozen and the fluid look identical until you ask them to change. Then they tell you what they are.
— Praxis Collective · June 2026
← Workshop