AI News HubLIVE
站内改写6 min read

AI inequality: from GPU-poor to token-poor

This article discusses the paradox of accelerating AI capabilities versus the diffusion challenges that may lead to an AI winter. Anthropic's new Fable 5 model shows remarkable performance gains, but its high cost, controversial safety measures (initially silent capability degradation, later switched to visible fallback), and data collection policies raise concerns about power concentration and access inequality. The piece also links Dario Amodei's policy essay, highlighting the contradiction between his stated principles and Anthropic's access model.

SourceHacker News AIAuthor: adlrocha

Jun 14, 2026

Last week I ended this newsletter questioning if the people inside the labs actually see the same end-of-cycle signals the rest of us are reading from the outside, or are they rushing their IPOs because they can see something coming that the market hasn’t priced in yet.

This week, Anthropic offered something resembling an answer, at least from a technical standpoint. Fable is here, and it reinforces my argument from last week: model technical capabilities are not actually plateauing and the winter will come from a shortage of funds and adoption.

What Fable actually is…

Fable 5 is the first model in Anthropic’s new Claude 5 family, specifically the first model in a new tier they’re calling Mythos-class (whatever that means), which sits above Opus in capability. The key thing to understand about the release is that Fable 5 and Mythos 5 are essentially the same underlying model. The difference in Fable is not its underlying architecture, but its embedded guardrails.

On benchmarks, Fable tops Cognition’s FrontierCode evaluation for coding, is the first model to break 90% on Anthropic’s core analytics benchmarks (a ten-point leap over Opus) and leads Hebbia’s senior-level finance reasoning evaluation. In biology, it accelerated protein design by roughly 10x, generated 9 of 14 strong drug candidates in a molecular design task, and produced scientific hypotheses that domain experts preferred over Opus-class outputs about 80% of the time. Fable also completed Pokémon FireRed using only vision input, which is either impressive or unsettling depending on your sense of humour (it took me a few dozen hours to finish it when I was a kid, and to get that legendary Pokémon I wanted. Spoiler alert: it was Zapdos). In short, this model is a beast!

But obviously, this beast is expensive: $10 per million input tokens, $50 per million output tokens. Less than half the price of Mythos Preview, but still a price that I don’t know if I would pay for my daily tasks.

Ok, that’s Anthropic’s PR machine, but what does the crowd say about this model? One of the people in the community that is trying to solve a hard problem in computer science and tries to get the most out of these models is Victor Taelin (who has made an appearance on this newsletter a few times already). He is building the HVM programming language, built on interaction networks. He is using agents extensively for his work (I highly recommend following how he journals it in his X account).

He’d already thrown everything at the problem before: a fleet of 32 GPT-5 agents running for 20 hours each, then Opus 4.8 and GPT-5.5 optimising for 8 hours. The best result was a 6–34% speedup, and the code quality had deteriorated after each iteration and he had to clean things up manually (as described here). Then he asked Fable.

Two hours later: a 1,770% speedup in one case, over 100% in four others, 22% on average. He immediately assumed it was hardcoding the benchmarks, a reflex he calls ‘GPT trauma’ 😆. So he decided to dig deep into the implementation to confirm it. What Fable had found was that HVM5 was wasting time garbage-collecting unused branches of pattern-match nodes. Taelin had already optimised this for static matches, but not dynamic ones. Fable figured out how to do it for the dynamic case and implemented it correctly.

Then, as he was preparing to audit the solution, Fable interrupted him to report a bug in the code Taelin himself had written, a subtle pointer aliasing error in the garbage collection logic that was so specific, Taelin estimated he’d have needed hours or days to find it himself, if he ever had. Fable found it as a side note, while finishing the optimisation.

His dramatic conclusion was: “this isn’t about Anthropic or OpenAI, this is about our collective future as a species.”

So here’s the answer to my capability question from last week. The technology is still accelerating. This confirms that the AI winter framing I offered last week was about diffusion and adoption, and about companies that hadn’t yet learned what to do with the tools is feasible. Fable clarifies one thing on this framing (that may make this winter more dramatic): the ceiling may still be going up fast.

… and why is controversial

This release came with terms and controversy.

The most discussed one was that Fable will silently limit its own capabilities when it detects you’re using it for frontier AI development. Not an explicit refusal, i.e. you won’t see an error message. The model may quietly reduce its effectiveness through prompt modification, steering vectors, or fine-tuning adjustments. The list of affected work includes: building large-model pretraining pipelines, designing data pipelines for training frontier LLMs, debugging model-parallel training systems, working on ML accelerator design, distilling or copying a frontier model, etc.

Anthropic estimated this affects approximately 0.03% of traffic (but I would add that this cohort is probably the one that uses this technology more extensively). When Fable is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities. As a paying user, the model can still sound helpful while being intentionally less capable for a specific category of work.

As the team at Semianalysis puts it in the tweet linked above, all this reminds a lot of the nuclear non-proliferation from the 60s. In 1968, the US, USSR, UK, France, and China signed the Nuclear Non-Proliferation Treaty, declaring nuclear weapons too dangerous for anyone else to build, while the five who already had them pinky-promised to disarm eventually. India refused to sign, pointing out the obvious: the NPT didn’t decide nukes were too dangerous to exist, just too dangerous for anyone who didn’t already have them. Anthropic limiting Fable for frontier ML work is structurally identical. The danger, conveniently, started the day after they finished.

Others (like Jeremy Howard) reached for a different reference to illustrate the issue with this approach (one that I personally loved). In Liu Cixin’s Three-Body Problem book, an alien civilisation deploys a sophon, a particle-scale quantum computer to infiltrate particle accelerators on Earth and scramble experimental results. Not to destroy science, but to make the next step impossible. I don’t what you think but this resembles a lot to what Anthropic did trying to slow down AI researchers that want to leverage Fable.

The consequences go beyond individual users. Sayash Kapoor, who runs rigorous AI R&D evaluations, pointed out the downstream problem: if Fable silently degrades for frontier ML tasks without telling you when it’s doing so, third-party evaluators can no longer use the model for serious capability assessments. They can’t distinguish a genuine failure from a classifier intervention. Independent evaluation of what Fable can actually do, one of the few accountability mechanisms that exists for frontier models whose weights are not public and can not be run independently, is now compromised for exactly the category of tasks where it matters most. Anthropic uses the full-capability model for AI R&D internally, through Mythos. The sandbagged version is what the independent evaluation ecosystem gets.

Kapoor refers to it as a ‘dangerous precedent.’ And I agree, other labs like OpenAI may choose to do the same, and it’ll harm all infrastructure of independent oversight that is supposed to catch problems before they compound (what companies like EpochAI do). You can’t audit a model that hides its own limits from you.

And as I was writing the words above something unexpected happened (at least unexpected for some, but definitely the reason why this post will end up being longer than originally planned). Anthropic walked this policy back.

From this moment on, lagged requests will visibly fall back to Opus 4.8, the same pattern used for cyber and bio safeguards. Users will clearly see it when it happens. On the API, flagged requests will return a reason for the refusal. Anthropic’s explanation (i.e. excuse) published on X by the Claude developer account was: ‘Invisible safeguards can be targeted more narrowly, allowing us to ship quickly with very few false positives. We went with invisible safeguards for this reason, and that was the wrong tradeoff. You should have visibility into the safeguards we have in place, and why. We’re sorry for not getting the balance right.’ This raises concerns though, making the refusals visible makes them easier to work around. You’re telling users the classifier is now worth trying to circumvent. Jailbreak time, folks!

On top of all of this guardrail drama there’s more: if you use Fable or Mythos, Anthropic collects your data for training. No exceptions, not even for enterprise partners. Even more, on 23rd June, Fable access through standard subscriptions closes. After that, it’s pay-as-you-go API only.

So let’s be honest about what this architecture actually is, because I think the separate pieces add up to something more coherent than a collection of safety decisions.

Anthropic releases a model of genuinely extraordinary capability. They make it available to everyone briefly, at least long enough to demonstrate what it can do, long enough for the word to spread, long enough for dependency to form. Then they close the subscription access and move it to consumption pricing. They collect training data from every user throughout. They degrade it for the work that competes with their own research agenda, and when the hidden-degradation policy blows up publicly, they pivot to a visible fallback that’s harder to exploit but creates more false positives. And they reserve the unrestricted version for a small group of approved partners whose criteria they control.

Call me cynical, but to me, that reads more like a business strategy rather than a safety strategy, especially with an imminent IPO on the horizon. The ‘broad access’ phase is a land grab, you build the market, you demonstrate the capability, you create the switching costs, and then you restructure the pricing. With some caveats, this is what GitHub did with Copilot: generous free access for open-source contributors, then usage-based billing once dependency was established. I wrote about that dynamic a few months ago. This AI bubble is a dependency trap, subsidised token pricing as the mechanism for building lock-in before extracting value.

Don’t get me wrong, the safety framing may not be actually true, and Anthropic may be genuinely optimising for it. But it also needs to make money and get funding, which are two things that may sometimes be at odds.

And Dario chimes in!

Coincidentally, this week Dario Amodei published an essay on AI policy that is worth reading alongside the Fable release rather than in isolation. On civil liberties, he writes that people facing government action should have ‘access to AI that is at least as capable as whatever the government is allowed to use.’ The logic makes sense: concentrated capability is a power asymmetry, and power asymmetries require structural remedies. He also names the distribution problem directly, ‘the key challenge in such a world won’t be incentivising growth, but finding a way for everyone to share in the benefits.’

Both of those sentences could have been written by a critic of the Fable access model. Dario doesn’t apply them there.

The principle he uses against government overreach, that capability parity matters, that the powerful having better tools than everyone else is a structural problem requiring structural solutions, is precisely the principle that Anthropic’s own access architecture violates. Researchers inside approved labs run unrestricted Mythos. Researchers outside run a classifier-limited version that quietly degrades for exactly the work that would let them close the gap. Dario says the key challenge is sharing the benefits.

[truncated for AI cost control]