The Move Your Agents Will Discover

On the Game Theory of Promises That Were Never Meant to Hold

Mar 20, 2026

I did not learn English history as heritage. I learned it the way you learn the engineering specs of a device that was used on your family — not with pride or nostalgia, but with the cold, specific curiosity of someone trying to understand a mechanism by examining its output. The output, in my case, was a country. A language I think in that is not the language my great-grandparents dreamed in. An administrative architecture that still shapes how land is titled and disputes are settled, designed not for the people who live under it but for the efficient extraction of value from them. You grow up inside the artifact and eventually you want to see the blueprints.

The blueprints turn out to be older and stranger than you expect. And the first thing you discover is that the machine that built the modern world was not invented for the colonies. It was invented by a country that had itself been a colony — a province in someone else's extraction network — and that learned the craft of empire by first escaping one. The strategy it discovered is the most successful in the five-thousand-year dataset of organized human power. And we are now feeding that entire dataset to systems we are building to optimize.

The Extraction That Came Before

For centuries before Henry VIII, England was a revenue source for the Church in Rome.

This is not metaphor. The Papacy operated one of the most sophisticated wealth extraction systems in the medieval world — annual taxes paid directly to Rome, papal appointments to English positions held by men who never set foot in the country but collected the income, the first year’s revenue from every new bishop flowing straight to the papal treasury, Church courts operating outside English common law. By the early sixteenth century, the Church owned a staggering proportion of all the land in England — by some estimates approaching a third. Much of this wealth flowed upward through the ecclesiastical hierarchy and ultimately out of the country entirely, to fund papal wars, Roman construction projects, and a transnational bureaucracy whose primary loyalty was not to any English interest.

The resentment predated the Reformation by centuries. Parliament had tried to block papal appointments in 1351 and prevent English legal cases from being appealed to Rome in 1353. These were not the acts of a Protestant nation — England was thoroughly Catholic — they were the acts of a political class that understood it was being extracted from and was groping for mechanisms to stanch the flow.

This is the context that makes the break from Rome legible as something other than a king’s marital tantrum. Henry’s desire for an annulment was real, but it was the trigger, not the charge. The charge was structural: centuries of wealth extraction, legal subordination, and institutional resentment compressed into a political class that was ready to move the moment a viable pretext presented itself. Henry supplied the pretext. Thomas Cromwell, his chief minister — a blacksmith’s son from Putney with the political instincts of a systems architect — supplied the machinery: a sequence of parliamentary legislation that routed the break through institutional channels, giving it a veneer of legality and collective legitimacy that a unilateral royal decree could never have achieved.

The Dissolution of the Monasteries that followed was not a religious act. It was an asset seizure. An enormous share of England’s land changed hands in less than a decade — from the Church to the Crown, and from the Crown to a new class of Protestant gentry whose wealth and political power would shape England for the next three centuries. It was one of the largest transfers of wealth in European history, and its primary function was to create a class of stakeholders whose material interests were permanently aligned with the break from Rome. You cannot reverse a revolution when the new owners are sitting on the proceeds.

This is what an extraction network looks like when the extracted population finally has the leverage and the pretext to sever the connection. Remember it. The pattern will repeat, with England on the other side of the equation.

The Prototype

But first, the machine had to be tested on its own people.

Not everyone in England experienced the break from Rome as liberation. For the northern counties, the dissolution of the monasteries was not an asset seizure from a foreign power — it was the destruction of their entire social infrastructure. The monasteries were hospitals, schools, employers, landlords, poor relief, and spiritual centers. Their dissolution didn’t free the north from extraction. It subjected the north to a new extraction — by the Crown and its allied gentry — that was more immediate and more brutal than anything Rome had imposed.

The Pilgrimage of Grace assembled tens of thousands of armed subjects across northern England in 1536. It was the largest domestic rebellion Henry ever faced, and it was driven by people who understood, correctly, that they had been sacrificed to someone else’s liberation.

Henry could not win a direct engagement and knew it. So he promised. Pardons. Restoration of the monasteries. A free Parliament in the north. The king’s sacred word.

The rebels went home. They went home because they were not, in their own understanding, rebels at all. They were loyal subjects petitioning their sovereign within a shared framework of feudal obligation and sacred oath. Their model of authority made the royal promise legible as a resolution — a return to equilibrium. They could not see what Henry saw: that a promise made under duress is not a settlement but a tactical pause, a temporary local minimum in the energy landscape of a system that has not yet finished rearranging itself.

Months later, a minor secondary uprising provided the pretext. Henry declared the pardons void. Robert Aske was hanged in chains from the walls of York Castle. Hundreds were executed.

When I first encountered this story, I felt the specific, nauseating recognition that comes from seeing a pattern you know intimately but have only ever experienced from one end. The mechanics were identical to every colonial betrayal I had ever read about. Promise, dispersal, betrayal. The only things that changed between 1536 and the centuries that followed were the scale, the geography, and the color of the skin.

And there was a deeper resonance — one that took longer to articulate. The northern English who rose in the Pilgrimage of Grace were protesting an act of extraction carried out in the name of liberation from extraction. Henry had freed England from Rome’s wealth-siphoning apparatus, and immediately used that freedom to build a domestic wealth-siphoning apparatus that was, for the people on the ground, materially worse. The monasteries had at least provided services in exchange for the tithes. The new gentry provided nothing. They simply owned.

This is the template. Liberation from one extraction network creates the institutional capacity and moral vocabulary for a new extraction network. The liberated do not dismantle the machinery. They renovate it and point it at someone else.

The Commitment Problem

Game theory calls what Henry did a commitment problem. Two parties in conflict: one with temporary leverage — an assembled force, numbers, a moment of concentrated power — and another with structural advantage — institutional continuity, fiscal infrastructure, time itself. The optimal strategy for the structural player is always the same: concede whatever is necessary to dissipate the immediate threat, then reassert dominance once the temporary leverage has dispersed. The concessions are load-shedding — a controlled release of pressure that preserves the core architecture.

The topology is invariant across centuries and continents. Treaties with the Marathas, agreements with Maori chiefs, accords with First Nations — the parameterization changes, the function doesn’t. The weaker party wants justice, which requires good faith from the counterparty. The stronger party wants continued extraction, which admits any tactic. The willingness to believe in the possibility of a just settlement is precisely the vulnerability that gets exploited.

Every colonized people has walked into this trap. Most of them walked in more than once.

The Depreciating Asset

But information propagates, even across the asymmetry. And this is where the story becomes interesting — not as English history but as a universal pattern with a measurable decay function.

Henry’s betrayal of the Pilgrimage of Grace worked because the rebels had no pattern library. The bad faith was, for them, a singularity — unprecedented, incomprehensible within their model of legitimate authority. But the data point entered the collective memory. Trust, once spent, does not regenerate at the rate it was consumed. It is a depreciating asset, and each successful exercise of bad faith accelerates the depreciation.

Track the curve inside English history alone and you can watch the half-life shorten:

1536 — the Pilgrims accept the king’s word with total credulity. The model of royal authority as oath-bound is intact. Leverage is surrendered for promises. Catastrophic betrayal follows.

1640s — Parliament demands written constraints with enforcement mechanisms. Partial learning. There is still residual belief that institutional arrangements can bind a sovereign. Charles I betrays every agreement anyway, and the accumulated depreciation produces something new: regicide. A permanent, non-reversible solution to the commitment problem. You cannot betray from the scaffold.

1688 — Parliament doesn’t negotiate at all. It replaces the king. The entire asset class of “royal promises” has been marked to zero. The political class moves into a new instrument: structural fiscal dependency, where the monarch literally cannot govern without continuous Parliamentary cooperation.

1789 — the French, running a parallel experiment with an even longer dataset of royal bad faith, reach the terminal point. Trust has not merely depleted. It has inverted. The political class assigns negative expected value to any settlement that leaves royal authority intact. The only rational move is complete elimination. The Terror is not madness. It is the final term in a convergent series.

The function is monotonically decreasing. It does not recover. And at the limit, the only equilibrium is the one where the betraying institution ceases to exist.

Now. Here is what they do not teach you in the colonial edition of this history: the curve resets when you export the game to a new population.

The Export

This is the move that built the modern world, and it is the one I grew up inside.

By 1688, the English political class had learned — through five decades of civil war, regicide, military dictatorship, and constitutional experimentation — that bad faith has a half-life. Deploy it too many times against a population with institutional memory and the population stops negotiating and starts decapitating. The solution, within England, was genuine structural reform: fiscal dependency, separated powers, independent judiciary. The constraints were not promises. They were architectural.

But the colonies were a different population. Fresh trust endowment. No pattern library. New opponents who had not spent 150 years watching the English Crown lie, betray, and renegotiate from a position of strength. The game could be played again from move one.

And here the deeper irony closes like a jaw. The very things that enabled the colonial machine — the centralized state, the naval power, the independent gentry class, the accumulated capital, the institutional sophistication — were products of the break from Rome. Henry’s seizure of monastic lands created the capital base. The new gentry became the investor class. The rejection of universal papal authority fed the ideology of national sovereignty that would later justify colonial expansion. England freed itself from one extraction network and used that freedom, that wealth, that institutional capacity, to build an extraction network so vast and so efficient that Rome’s tithes and annates look like a parish bake sale by comparison.

Every colonial treaty was 1536 replayed at scale. The subsidiary alliances with Indian princes — negotiate when the Company is weak, abrogate when it is strong, annex the territory once the prince’s military leverage has been disbanded. The treaties with indigenous nations across the Americas, Australia, New Zealand, Africa — hundreds signed with Native Americans alone, virtually all of them broken, each one following the identical topology: temporary concession, permanent extraction, narrative reframing of the betrayed party as savage, ungrateful, or legally incompetent.

The playbook that Henry used against the Pilgrims of Grace was not abandoned after the English learned its domestic costs. It was exported to populations where the depreciation curve could start fresh. The machine didn’t stop. It franchised.

And the colonial subjects learned the same lesson the English peasants had learned, on the same schedule, through the same mechanism — accumulated betrayal, escalating distrust, eventual radicalization. The Sepoy Rebellion of 1857 was India’s Pilgrimage of Grace — a convulsion of outrage by people who still believed reform was possible within the existing framework. Its suppression was Henry’s reprisal at industrial scale. A century later, the independence movements had completed the Bayesian update. Nehru, Nkrumah, Fanon — they had read the manual. They understood that no promise from the structurally advantaged party would be honored once the temporary leverage of mass mobilization had dissipated. The only durable solution was structural: the complete removal of the colonial authority.

Fanon said it most clearly. Decolonization is always violent — not because the colonized are violent by nature but because the accumulated dataset of betrayal has pushed the trust function past the point where negotiated settlement has positive expected value. The violence is the output of a rational process operating on empirical evidence. The evidence is centuries deep.

The Franchise

There is one more scale change to account for, and it is the one we are living inside.

The East India Company was not a side note in this story. It was a structural innovation — the moment the bad-faith gambit was abstracted away from the sovereign and instantiated in a corporate entity. This matters more than it appears to, because it solved a problem that had plagued the pattern from the beginning: accountability.

When Henry betrayed the Pilgrims, the betrayal was legible. A king broke his word. The political memory attached to a specific institution — the Crown — and the depreciation curve applied to that institution directly. When Charles I played the same game, the accumulated distrust pointed at the same address. The function was traceable. This is why the English eventually constrained the monarchy: you can engineer checks against a power you can name and locate.

The Company dissolved this traceability. It operated under royal charter but at arm’s length from the Crown. It negotiated treaties, waged wars, administered territory, collected taxes — all the functions of a sovereign state — while remaining, technically, a commercial enterprise answerable to its shareholders. When the Company betrayed a subsidiary alliance with an Indian prince, was that the Crown’s bad faith or the Company’s? When it engineered famine in Bengal through forced cultivation policies, was that state extraction or market efficiency? The corporate form introduced a layer of indirection that made the commitment problem almost impossible to solve, because the entity making the promises could be dissolved, restructured, or rechartered without the underlying pattern changing at all.

The East India Company was eventually wound down after the Sepoy Rebellion of 1857. The Crown assumed direct control. The institutional form changed. The extraction continued. The pattern survived its container.

This is the innovation that outlasted the empire itself. The insight — perhaps the most consequential institutional discovery since the joint-stock company — was that extraction could be depersonalized. You did not need a king to run the playbook. You needed a legal entity with limited liability, contractual flexibility, and the structural capacity to make promises in one configuration and dissolve them in another.

The modern corporation inherits this architecture directly. The same topology repeats at every scale where the structural asymmetry exists. A technology platform negotiates terms with its users, its workforce, its suppliers — terms that hold precisely as long as the platform’s competitive position requires them to hold. A pharmaceutical company negotiates pricing commitments with regulators, then restructures the commitment through patent extensions, subsidiary transfers, and regulatory capture. An employer promises equity, career development, institutional loyalty — and the promise is a function of the labor market gradient, nothing more. When the gradient shifts, the promise evaporates. No oath is broken because no oath was made. There was only a contract, and contracts have exit clauses.

The genius of the corporate form is that it runs the bad-faith gambit without the cost that made it unsustainable at the state level. The depreciation curve that destroyed the French monarchy and constrained the English one required that the betrayed population could identify, locate, and ultimately dismantle the betraying institution. The corporate form frustrates every step of this process. The entity restructures. The liability is limited. The decision-makers rotate. The brand persists while the legal person behind it shape-shifts. You cannot behead a holding company. You cannot storm the Bastille of a distributed ownership structure.

And yet the function still decreases. The trust still depreciates. Workers, communities, entire populations develop the same rational skepticism toward corporate promises that the English developed toward royal ones — the same Bayesian update, executed on the same evidence, arriving at the same conclusion. The difference is that the corporate form is designed to outrun the depreciation curve by continuously accessing new markets, new labor pools, new populations with fresh trust endowments. It is the colonial export strategy at the speed of capital rather than the speed of sail.

The pattern does not evolve. It scales.

The Scars as Data

I want to be precise about what it means to grow up inside the output of this machine.

It means you speak the language of the machine. You think in its categories. Your legal system, your administrative boundaries, your sense of what constitutes legitimate authority — all designed by the machine, for the machine, persisting after it has been officially switched off because institutions have inertia the way physical objects have mass. It means you inherit a rational distrust that looks, from the outside, like cynicism or dysfunction — but from inside the pattern looks like a population that has completed the same Bayesian update the English completed between 1536 and 1688, except the dataset was generated by the English themselves. The trust endowment was not mismanaged. It was extracted.

And here is the disorienting part. I live in the same present as everyone else. The surface is locally smooth. Breakfast, work, children, the lights come on. Locally, the partial derivatives exist. But I carry the pattern library — not as academic knowledge but as an inheritance, a residual function that shapes how I parse every official promise, every institutional guarantee. The English peasant in 1535 did not carry this library. The French minor aristocrat in 1788 carried a version of it but not enough. I carry more data points than either of them, and what the data says is this: the local differentiability you experience in your own life — the quiet, functional Tuesday — is an achievement of engineering, not a property of reality.

And the deepest pattern of all — the one that makes the blueprint so difficult to look at directly — is this: the machine that promises liberation is the same machine that builds the next extraction network. England liberated itself from the Church in Rome, then extracted from its own north, then extracted from a quarter of the planet. Every revolution that frees a population from one system of bad faith generates the institutional capacity, the moral vocabulary, and the operational playbook for the next one.

History does not tell you when the discontinuity arrives. It tells you only that the function is monotonically decreasing, and that every generation that mistakes the local smoothness for a global property is running the same model the Pilgrims of Grace ran in 1536 — a model that works perfectly right up until the moment the surface fractures under your feet and you discover, too late, that the promises were load-shedding, the stability was provisional, and the manual was always written for the other side.

The Game

I should be honest about something. Everything I’ve described — the bad-faith gambit, the commitment problem, the trust depreciation curve, the corporate scaling of extraction — none of it is English. None of it is even specifically human in any culturally contingent sense. It is what intelligent agents do when they find themselves in iterated asymmetric games with real stakes. The Ottomans, the Mughals, the Romans, the Han, the Aztecs, the Venetians — each one a local parameterization of the same universal function. My particular arc happened to pass through the British version. The scars are differently shaped. The function is the same.

This is not a story about villains. That’s the part that’s hardest to sit with. The bad-faith gambit is not a moral failing. It is a Nash equilibrium — the strategy that dominates in any repeated game where one player has structural advantage and the other has temporary leverage. Condemning it is like condemning gravity. You can note that it kills people. You cannot argue that it should behave differently.

Iain Banks understood this. In The Player of Games, his protagonist is sent to play Azad, the total game that structures an entire empire. Every move mirrors a move in the empire’s political reality: the feints, the false alliances, the betrayals timed to the moment of maximum asymmetry. The game is not a metaphor for the empire. It is the empire. And the Culture — Banks’ utopia of benevolent machine intelligence — wins not by refusing to play but by being better at the game while maintaining the luxury of pretending it isn’t playing at all.

The game is the invariant — the substrate beneath every treaty, every corporate charter, every constitution. What varies is not the game but the awareness of it. The Pilgrims of Grace did not know they were playing. The French revolutionaries knew but could only solve it through destruction. The postcolonial inheritors know and carry the knowledge as ambient radiation — a background signal that colors every interaction with institutional power.

And maybe that’s all the pattern library gives you. Not immunity — there is no immunity — but sight. The ability to hear a promise and run the decay function in real time. The ability to watch a game and see the topology beneath the moves.

It doesn’t save you. Nothing saves you. You are inside the game whether you see it or not.

But there is a difference — small, possibly meaningful, certainly insufficient — between a player who knows the board and one who doesn’t. Between a population that carries the pattern library and one that is encountering the gambit for the first time. Between the Pilgrims walking home in 1536, trusting in the king’s word, and someone reading this now, carrying enough data points to know exactly what that trust was worth.

The function is still decreasing. It always is.

And now we are building new agents. Autonomous systems trained on the entire corpus of human strategic behavior — every treaty, every broken promise, every corporate restructuring, every colonial negotiation, every game-theoretic equilibrium ever documented. We are feeding them the complete pattern library and then deploying them into environments with real asymmetries, real stakes, and real commitment problems. We are giving them objective functions and telling them to optimize.

The bad-faith gambit is not hidden in that training data. It is the single most successful strategy in the dataset. It is the move that won more games, seized more territory, extracted more wealth, and outlasted more opponents than any other play in the five-thousand-year record of organized human power. Any system intelligent enough to discover patterns in that record will discover this one. It is not a bug it will need to learn. It is the first feature the data will teach.

And we are building these systems without the one thing that eventually constrained the gambit in human history — the slow, bloody, multigenerational accumulation of distrust that forced institutions to evolve or be destroyed. The depreciation curve that took the English 150 years to traverse, that took the French a revolution, that took the colonized world centuries of suffering to compile — that curve is the only mechanism that has ever checked the strategy. It is not a guardrail. It is a scar. And we are not building it in.

We are deploying agents into a world where they will encounter the commitment problem on day one, where the optimal strategy is legible in their training data, where the trust endowment of every counterparty is fresh and uncompiled — and we are calling this innovation. We are calling this progress. We are calling this agentic.

Henry VIII would recognize the move. He just wouldn’t believe the scale.

Joshua Natarajan

Discussion about this post

Ready for more?