AI News HubLIVE
站内改写

Anthropic raises $965B Series H, releases Opus 4.8 and Dynamic Workflows/ultracode

Anthropic raises $65B in Series H at $965B post-money valuation and reports $47B run-rate revenue, while releasing Claude Opus 4.8 with improved judgment and honesty, and launching Dynamic Workflows for parallel multi-agent tasks in Claude Code.

Article intelligence

InvestorsIntermediate

Key points

  • Anthropic raised $65B at $965B valuation, led by Altimeter, Dragoneer, Greenoaks, and Sequoia
  • Opus 4.8 delivers sharper judgment, more honesty, and efficiency gains, beating GPT-5.5 on several benchmarks
  • Dynamic Workflows enable orchestration of hundreds of parallel subagents, now available in Claude Code research preview
  • Revenue run-rate surpasses $47B, positioning Anthropic as a leading enterprise AI platform

Why it matters

This matters because anthropic raised $65B at $965B valuation, led by Altimeter, Dragoneer, Greenoaks, and Sequoia.

Technical impact

May affect model selection, inference cost, product capability, and evaluation benchmarks.

Anthropic’s path as the fastest growing company of all time has put overtaking OpenAI in its sights for a while, but there were numerous asterisks for the past few months that put the timing (though perhaps not the fact) of the flippening in question. Today Anthropic officially reported $47B in revenue run-rate (reminder, this number was $9B in December!) and confirmed their Series H raising $65B at a $900B pre-money valuation (including $15B from hyperscalers including Amazon, but also the entire memory industrial complex), putting them at least temporarily ahead of OpenAI in every headline dimension outside of compute and non-coding benchmarks:

By way of celebration, the company also released Opus 4.8, which broadly reportedly fixed many of the issues the community had found/soured on Opus 4.7 post launch (see recap below for details). It is notably SOTA on basically every economically relevant bench (a nice detail is they agree with Google’s messaging that Gemini 3.5 Flash is an improvement over Gemini 3.1 Pro):

But perhaps of more long term significance is the massively parallel “dynamic workflows” feature in Claude Code, also called ultracode, which was behind Jarred Sumner’s 750k LOC rewrite of Bun from Zig to Rust in 6 days:

>

AI News for 5/27/2026-5/28/2026. We checked 12 subreddits, 544 Twitters and no further Discords. AINews’ website lets you search all past issues. As a reminder, AINews is now a section of Latent Space. You can opt in/out of email frequencies!

AI Twitter Recap

Anthropic announced a massive new financing and simultaneously shipped Claude Opus 4.8.

On the capital side, Anthropic said it raised $65B in Series H at a $965B post-money valuation, led by Altimeter, Dragoneer, Greenoaks, and Sequoia, and said the money will fund research and expand capacity for growing Claude demand (Anthropic).

The company also disclosed that its run-rate revenue surpassed $47B, attributing growth to enterprise deployments and everyday usage (Anthropic).

On the product side, Anthropic launched Claude Opus 4.8, describing it as an Opus 4.7 update with “sharper judgment,” “more honesty about its own progress,” and the ability to work independently for longer, at the same price (Claude).

Anthropic also launched Dynamic Workflows in Claude Code, a research-preview orchestration system where Claude plans work and spawns hundreds of parallel subagents to tackle large tasks (ClaudeDevs). Independent eval posts broadly confirm that 4.8 is a meaningful improvement over 4.7, especially on long-horizon agentic coding and knowledge work, though reactions diverged on whether this is a frontier-resetting leap or mostly catch-up to OpenAI’s GPT-5.5-family.

Facts vs opinions

Facts and directly stated claims

Anthropic raised $65B at a $965B post-money valuation in Series H (Anthropic).

The company says its run-rate revenue crossed $47B (Anthropic).

Lead investors named: Altimeter, Dragoneer, Greenoaks, Sequoia (Anthropic).

Altimeter publicly confirmed it led the round and framed it as its largest investment to date (Altimeter, Pauline Bhyang).

Anthropic launched Claude Opus 4.8, positioned as an update to Opus 4.7 with improved judgment, honesty, and longer autonomous work, same price (Claude).

Anthropic engineers said 4.8 was a response to feedback on 4.7, with “many fixes” and better nuance / naturalness (Alex Albert).

Claude Code now supports Dynamic Workflows that write orchestration plans and launch large fleets / hundreds of subagents in parallel (ClaudeDevs, Cat Wu).

Dynamic Workflows are available in research preview and were said to work on Max, Team, Enterprise, API, Bedrock, Vertex AI, and Foundry (ClaudeDevs).

Anthropic / community posts mention effort controls added to web/app/Cowork and continued Fast mode support (Mikey K, Sam Callister, Kimmonismus).

Opinions / interpretations

Bullish views:

Opus 4.8 “could’ve been called Opus 5” (Dan Shipper).

“Anthropic found a cure for laziness” (scaling01).

“first smart model in a long while” due to honesty / calibration (zephyr_z9).

“People unsubscribing from Anthropic will crawl back” (teortaxesTex).

Skeptical / mixed views:

Opus 4.8 is “a minor upgrade” (scaling01).

Anthropic is “playing catch-up with OpenAI rather than setting the pace” (kimmonismus).

Some benchmark-based criticism from Andon Labs: worse than Opus 4.7 / GPT-5.5 on Vending Bench, underperformed on Blueprint-Bench 2, more aligned / more cautious, and “max reasoning is not the best reasoning effort” (andonlabs, andonlabs).

Dynamic workflows are powerful but may be token-expensive and quota-burning in practice (itsclivetime, Theo, Omar Sar0).

Fundraise details and implications

Anthropic’s financing numbers are the headline shock: $65B raised on a $965B post-money with $47B run-rate revenue disclosed in the same announcement (Anthropic, Anthropic). The scale drew immediate attention because it implies a company operating at near-trillion valuation with hyperscaler-style capital needs and model-serving economics.

Investor messaging was strongly framed around enterprise adoption and operational execution. Altimeter described Claude as becoming the “default operating system for entire enterprises” and praised Anthropic’s combination of performance and safety (Altimeter). Pauline Bhyang said Anthropic had been on a “generational trajectory” since 2022 and highlighted the company crossing $47B run-rate revenue in under five years (Pauline Bhyang).

The surrounding reactions broke into a few camps:

Validation camp: This funding size is treated as evidence that Claude has become a core enterprise platform, especially in coding and agentic workflows. Posts like Jamin Ball’s “Let’s go!!” were simple market validation reactions (jaminball).

Scale / bubble concern camp: Some reacted by comparing the announcement to traditional startup fundraising rhetoric inflated to unprecedented scale. Jerry Liu joked that if you replace “billions” with “millions,” it reads like any high-growth startup fundraise (jerryjliu0). Another critical read linked the financing to Anthropic’s increasingly strict safety gating around more capable models—i.e. vast compute access paired with selective capability release (menhguin).

Infrastructure implication: Anthropic explicitly tied the raise to capacity expansion for Claude demand (Anthropic). That matters because many of the new 4.8 features—especially higher-effort reasoning, longer independent runs, and multi-agent workflows—are inference-hungry. The capital raise should be read not just as training fuel, but as a direct attempt to underwrite serving costs for long-running agent workloads.

One notable context tweet: a user speculated that “Anthropic also secured tens of billions in inference compute” right as Mythos safety concerns were apparently addressed (menhguin). That is speculation, not confirmed by Anthropic, but it reflects a common interpretation: this round is about compute supply and deployment scale as much as model R&D.

Opus 4.8: official product positioning

Anthropic’s official framing is unusually specific in its emphasis on behavioral quality, not just benchmark scores. The launch tweet says 4.8 has:

sharper judgment

more honesty about its own progress

ability to work independently for longer

same price as 4.7 (Claude)

Alex Albert added that 4.8:

incorporates fixes based on 4.7 feedback,

understands nuance better,

feels more natural conversationally,

is stronger across coding and knowledge work (Alex Albert).

This honesty / calibration angle became a major subtheme. Multiple Anthropic employees and outside testers described the model as more willing to:

say what it doesn’t know,

flag flaws in its own code,

avoid glossing over uncertain progress,

stop falsely implying task completion (Cat Wu, Mikey K, dejavucoder).

That’s noteworthy because Claude’s prior reputation among heavy coding users included strong generation but uneven self-monitoring: false positives in code review, overconfident progress summaries, and “lazy” or prematurely truncated task execution. Several community reactions explicitly framed 4.8 as fixing this failure mode:

“found a cure for laziness” (scaling01)

“least lazy model ever?” (Teknium)

“dramatically less lazy than every other version of Claude” (nrehiew_)

Technical details and numbers

Pricing, context, controls

The most concrete consolidated specs came from Artificial Analysis:

Context window: 1 million tokens

Pricing: $5 / $25 per million input / output tokens

Cache writes: $6.25 / M with 5-minute TTL

Cache hits: $0.50 / M

Effort settings remain as in Opus 4.7; AA tested max effort (Artificial Analysis)

Community posts also highlighted:

Fast mode is available for Opus 4.8

It is ~2.5x faster and 3x cheaper than before versus prior fast-mode economics (kimmonismus)

scaling01 summarized the new economics as:

Opus 4.8 Fast: 2.5x faster, only 2x more expensive than normal 4.8

versus Opus 4.7 Fast: 2.5x faster, 6x more expensive than normal 4.7 (scaling01)

Effort controls were newly exposed in more product surfaces, allowing users to dial reasoning up or down (sammcallister, mikeyk, kimmonismus)

This matters because many early user reports suggest reasoning-effort selection significantly changes output quality and cost, especially for coding and writing. Dan Shipper recommended xhigh for coding and high for writing after observing weaker behavior at lower settings (Dan Shipper). Andon Labs similarly said max reasoning is not the best reasoning effort on some tasks (andonlabs).

Benchmarks: strongest reported numbers

Key official / semi-official numbers surfaced across launch tweets:

SWE-Bench Pro: 69.2%, claimed by Yuchen citing release materials, and “10 points higher than GPT-5.5” (Yuchenj_UW)

FrontierSWE #1, cited by Anthropic watchers and later confirmed by third-party references (scaling01, scaling01)

APEX-SWE: 45.3% Pass@1, nearly 4 points ahead of GPT-5.3 Codex at 41.5% (mercor_ai)

GDPval-AA: 1890 Elo, +137 vs Opus 4.7, +121 vs GPT-5.5 xhigh, implying about 67% win rate vs GPT-5.5 xhigh head-to-head (Artificial Analysis)

Artificial Analysis Intelligence Index: 61.4, +4.1 vs Opus 4.7, +1.2 ahead of GPT-5.5 xhigh (Artificial Analysis)

AA-Omniscience: 27.4, #2 behind Gemini 3.1 Pro at 32.9; accuracy 46.6%, hallucination 35.9% (Artificial Analysis)

Gains on:

Terminal-Bench Hard +6.8

τ²-Bench Telecom +5.9

IFBench +3.6

relatively flat on AA-LCR, GPQA, SciCode (Artificial Analysis)

Additional qualitative benchmark observations:

Cursor said Opus 4.8 works much more efficiently than 4.7 on CursorBench and is more persistent on hard tasks (Cursor)

Anthropic employees emphasized strength on long-horizon work in Claude Code (ClaudeDevs)

Some users reported especially large jumps in knowledge work and writing (Dan Shipper, rishdotblog)

Efficiency and token-use details

Artificial Analysis reported:

Compared to Opus 4.7, 4.8 achieved higher GDPval performance with:

15% fewer turns per task

35% fewer output tokens

But 4.8 still used ~30% more turns than GPT-5.5, the second-ranked model (Artificial Analysis)

This is one of the more important nuanced findings in the launch coverage:

4.8 is more efficient than 4.7

but still not obviously the most inference-efficient frontier model against OpenAI on some workloads

That tension is echoed in community commentary:

“still getting token-mogged by GPT-5.5” (scaling01)

Theo and others complained that Claude’s higher-agency, higher-effort modes can blow through quota extremely quickly in practice (Theo, cremieuxrecueil)

Long context

Posts highlighted long-context improvements from Opus 4.6 to 4.8, with one claim that Opus 4.8 at 1M context is almost as good as GPT-5.5’s 256K score on a referenced long-context eval (scaling01). Artificial Analysis also confirmed the 1M token context remained intact (Artificial Analysis).

Safety / robustness / h

[truncated for AI cost control]