2026-06-05 22:45 UTCIn-site rewrite3 min readUpdated: 2026-06-30 13:03 UTC

Anthropic warns Claude AI is building itself faster than expected

Anthropic published a report warning that the development path could eventually leave humans unable to control AI systems, even as Claude now writes more than 80% of the code merged into its own codebase. The report outlines three scenarios, with the worst-case involving fully self-improving models. Anthropic calls for the option to slow or pause frontier development, but says it would only act if rivals do the same. The figures are self-reported and unaudited.

SourceHacker News AIAuthor: corvettez0606

Join the conversation

Add us as a preferred source on Google

Anthropic has published a report warning that the development path it’s on could eventually leave humans unable to control AI systems, even as it disclosed that Claude now writes more than 80% of the code merged into its own codebase. The Anthropic Institute, the company's research arm, said AI has already started to speed up AI development and that the trend could lead to recursive self-improvement, the point at which a model designs and builds its own successor with little human input. The report argued that the world should keep open the option to slow or pause frontier development, and cautioned that the occasional misalignment seen in current models could grow more common and harder to understand as those models build the next generation.

Go deeper with TH Premium: AI and data centers

(Image credit: Microsoft)

Photonics and high-speed data movement is the next big AI bottleneck

The data center cooling state of play

Massive AI data center buildouts are squeezing energy supplies

Ultra Ethernet: The data center interconnection of tomorrow

The company set out three pretty dire ways the next few years could unfold, reserving its most severe warnings for the scenario in which models become capable of fully improving themselves. In that case, Anthropic said, the pace of progress would be set almost entirely by available compute, with humans pushed toward oversight and verification roles and a self-improving model dominating as its abilities outstrip those of the people who built it.

The firm described this potential issue with alignment and the task of keeping a system's behavior tied to human intent as part of the future it’s least sure about. Misalignment that’s rare and survivable today could compound generation over generation until control slips, it said, though it allowed that a sufficiently capable and well-aligned model might instead choose to halt its own development. Anthropic wrote that this misalignment could keep "growing more frequent but less understood until we lose control of them."