TheSequence AI News Source

Public articles 39Collected articles 40Trust 82Refresh 720 min

Health HealthySource type ResearchFull-text rights In-site rewriteLast ingested 2026-06-26ID the-sequenceStatus Enabled

AI research and engineering newsletter; summary-only unless authorization is obtained.

Latest public articles

The Sequence Opinion #884: Self-Driving Labs: The Laboratory That Chooses Its Next Experiment

2026-06-26 10:58 UTC

Self-driving labs combine AI with automated hardware to let the system learn from experiments and autonomously decide what to do next, moving beyond mere automation to true autonomy.

Self-driving labs use AI to close the loop between design, make, test, and learn.
They differ from automation by making decisions based on real-time results.

The Sequence AI of the Week #883: Qwen is Getting Into Robotics

2026-06-25 11:01 UTC

One of the main frontier AI models is adding embodied AI capabilities. Alibaba's Qwen-Robot Suite aims to bridge the gap between perception and action with three specialized models.

Qwen models have been confined to software with no physical interaction.
Alibaba launched Qwen-Robot Suite with three models for navigation, manipulation, and world modeling.

The Sequence Knowledge #882: A New Series About Distillation

2026-06-24 10:35 UTC

A deep dive into one of the most important techniques in modern AI — distillation — and how it addresses the cost, deployment, and specialization challenges of large-scale models.

Distillation makes AI models more efficient and deployable, addressing scale-induced challenges.
Scale drove AI progress but led to expensive, slow, and difficult-to-specialize models.

The Sequence Special #881: The Soccer World Cup of AI Models

2026-06-22 11:34 UTC

LayerLens launches the Stratix Cup, a soccer tournament where top AI models compete as agents in a simulated environment, testing planning, adaptation, and multi-agent coordination.

LayerLens introduces the Stratix Cup, a soccer tournament for AI models.
The competition tests agentic capabilities: pre-game strategy, real-time gameplay, and halftime adaptation.

The Sequence Radar #880: Last Week in AI: A $60B Cursor Deal, Google's Brain Drain, and Midjourney's Body Scanner

2026-06-21 11:02 UTC

A week of really unexpected turns in the AI market: SpaceX acquires Cursor for $60B, key researchers leave Google, and Midjourney reveals a full-body medical scanner.

1. SpaceX acquires Cursor for $60B in stock, signaling AI tooling as strategic infrastructure.
2. Noam Shazeer and John Jumper leave Google, highlighting talent consolidation in AI frontier.

The Sequence AI of the Week #878: Inside Google Deepmind's First Real Crack in Next-Token Generation

2026-06-17 10:56 UTC

Google DeepMind has released DiffusionGemma, a text-diffusion model that challenges traditional transformer architectures by not generating text left-to-right token by token.

DiffusionGemma is a text-diffusion model from Google DeepMind.
It challenges the conventional transformer architecture.

The Sequence Knowledge #878: Beyond Transformer: What We Learned

2026-06-16 11:03 UTC

This article concludes the series on alternatives to the Transformer, covering four families: recurrent/linear-recurrent models, state space models, text diffusion models, and liquid/continuous-time models. It also announces a new series on knowledge distillation.

Self-attention has quadratic scaling and memory costs for long sequences.
Four alternative directions: recurrent (constant memory), state space (linear scaling), text diffusion (parallel generation), liquid (continuous-time dynamics).

The Sequence Radar #877: Last Week in AI: Anthropic Ships, Apple Borrows, Musk Lists, Bezos Builds

2026-06-14 11:03 UTC

A major week in AI: Anthropic launches Claude Fable 5 and Mythos 5, Apple debuts Siri AI, SpaceX goes public in record IPO, and Bezos's Prometheus raises $12B to build an 'artificial general engineer'.

Anthropic releases Claude Fable 5 and Mythos 5, decoupling capability from access
Apple unveils Siri AI with a custom 1.2-trillion-parameter Gemini model, leveraging personal context

The Sequence Opinion: Systems of Record vs. Systems of Action

2026-06-11 11:03 UTC

A new business software paradigm for the agentic era.

Traditional enterprise software centered on humans as actors.
Agentic AI shifts focus from systems of record to systems of action.

The Sequence AI of the Week #875: Why Your Language Model Needs a Nap

2026-06-10 10:39 UTC

The paper 'Language Models Need Sleep' argues that LLMs suffer from anterograde amnesia, unable to learn after training, and proposes a sleep-like consolidation mechanism.

LLMs are static after pre-training, unable to learn new information.
They exhibit anterograde amnesia, lacking long-term memory formation.

The Sequence Knowledge #874: Transformers or Not?

2026-06-09 11:03 UTC

The Transformer is currently the reference architecture for AI due to its scaling properties, but its attention mechanism is expensive. The article questions whether Transformers are the final architecture or just the first scalable one.

Transformers excel due to attention mechanism, applicable to diverse data types.
Attention is computationally expensive and scales poorly with sequence length.

The Sequence Radar #873: Last Week in AI: Soccer, S-1s, and Supermodels

2026-06-07 11:00 UTC

A new AI soccer tournament, major model releases, fundraises and Anthropic's S-1.

LayerLens announced the Stratix Cup, a simulated soccer tournament for frontier AI models.
Microsoft unveiled new MAI models at Build, signaling AI as an operating system.

The Sequence Opinion #872: The Cake Is a Battlefield: Who Really Controls the AI Stack

2026-06-04 10:58 UTC

Jensen Huang's five-layer AI cake seems harmonious, but strategists see a battlefield over margin pools. The key to control is owning the scarce layer and the seam adjacent to it.

Huang's cake metaphor from a chip vendor's perspective highlights mutual reinforcement.
Strategists view the stack as five stacked margin pools vulnerable to commoditization.

The Sequence AI of the Week #871: Inside the Loop with Claude Opus 4.8

2026-06-03 11:01 UTC

Claude Opus 4.8, released on May 28, 2026, may seem like a minor version bump, but it delivers significant reliability improvements including a 4x reduction in undetected code flaws, fixes for silently skipped tool calls, better compaction recovery for long trajectories, dynamic workflows, adaptive thinking, and a fast mode that is 2.5x faster and 3x cheaper than 4.7. The release focuses on calibration and honesty, making it a critical update for production agent loops.

Opus 4.8 improves calibration and honesty, reducing instances of the model leaving flaws in its own code unremarked by about 4x.
It fixes silently skipped tool calls and improves compaction recovery, enhancing long-horizon run reliability.

The Sequence Knowledge #870: Liquid Models and the Search for a Post-Transformer Architecture

2026-06-02 11:03 UTC

This article examines the limitations of the Transformer architecture and introduces liquid models as a promising alternative for low-latency, private on-device intelligence.

Transformer's global attention leads to high memory and compute costs during inference.
Liquid models use dynamics instead of attention, offering efficiency for real-time and edge scenarios.

The Sequence Radar #869: Last Week in AI: The Token Becomes the Unit of Account — Opus 4.8, OpenRouter, Cognition, Snowflake, and a papal warning

2026-05-31 11:02 UTC

Anthropic's Claude Opus 4.8 edges closer to operational profitability; OpenRouter and Cognition raise massive rounds; Snowflake inks $6B AWS deal; Pope Leo XIV warns against AI dominance. The industry shifts from model-centric competition to a token-based economy.

Claude Opus 4.8 shows modest gains in coding and reasoning, with new effort control, dynamic workflows, and improved honesty.
OpenRouter raises $113M at $1.3B valuation, processing 25T tokens weekly; Cognition raises $1B, with Devin writing 89% of internal code.

The Sequence Opinion #868: Recursion Is the New Scaling Law

2026-05-28 11:02 UTC

For most of the modern AI era, scaling laws drove progress. But recursion — the ability of models or systems to revisit, revise, search, and simulate — is becoming the new scaling dimension. This shift marks a paradigm change from single forward passes to iterative computation.

Traditional AI progress relied on larger models and more data, but recursion is emerging as the new frontier.
Recursion enables models to iteratively improve answers rather than producing a one-shot output.

The Sequence AI of the Week #867: Thinking in Latents: Why Sapient's HRM-Text Is a Quiet Rebuke to Chain-of-Thought

2026-05-27 11:01 UTC

This article criticizes Chain-of-Thought (CoT) reasoning in LLMs as inefficient, since it forces reasoning to leave the residual stream and become discrete tokens. Sapient Intelligence's HRM-Text addresses this by performing reasoning in latent space, providing variable internal depth for fixed-depth Transformers, thus challenging current reasoning paradigms.

Chain-of-Thought (CoT) is not true reasoning but a workaround that makes models 'rent depth' from output tokens.
Sapient Intelligence's HRM-Text performs reasoning in latent space, not in the token stream.

The Sequence Knowledge #866: Three Text Diffusion Models You Need To Know About

2026-05-26 10:49 UTC

Text diffusion models challenge the autoregressive paradigm by generating text through iterative denoising, treating generation as editing rather than typing. Three key systems define the field: LLaDA (proof of scaling), Mercury (commercial speed advantage), and Gemini Diffusion (frontier validation), representing the three phases of a new architecture class: scientific proof, industrial deployment, and frontier validation.

Text diffusion models generate text by iterative refinement from noise, using bidirectional context.
LLaDA proved diffusion can scale to a large language model.

The Sequence Radar #865: Last Week in AI: Karpathy, Google, Colossus, and the Coming IPO Wave

2026-05-24 11:00 UTC

The last three weeks marked a phase transition in AI: Google unveiled Gemini Omni and an agent-first platform; Andrej Karpathy joined Anthropic to accelerate pretraining; Anthropic secured a $45B compute lease from xAI's Colossus; Cerebras IPO surged to a ~$95B market cap; and SpaceX, OpenAI, and Anthropic are planning to go public within six months, collectively worth trillions. Research highlights include HRM-Text efficient pretraining, AI reviewer evaluation, NVIDIA's unified AR-diffusion model, and more.

Google I/O introduced Gemini Omni, Gemini 3.5 Flash, Antigravity agent platform, and TPU 8i for a vertically integrated agent pipeline.
Andrej Karpathy joined Anthropic to lead a team using Claude to accelerate pretraining, signaling a practical self-improvement flywheel.

The Sequence Opinion #864: Every AI Agent Needs a Computer

2026-05-21 10:45 UTC

The next phase of AI agents will be defined by access to a computer—filesystem, terminal, browser, etc.—not just better models. The market for agentic sandboxes is emerging.

AI agents need a real execution environment including filesystem, terminal, network, etc.
An agent that can only emit tokens is a brain in a jar, lacking agency.

The Sequence AI of the Week #863: The Model is the Interface: Inside Thinking Machines' Interactive Models

2026-05-20 11:03 UTC

Thinking Machines’ interactive models turn real-time conversation, vision, audio, and tool use into one continuous learned system.

Thinking Machines introduces interactive models that integrate multiple modalities in real time.
The current text-based LLM paradigm is insufficient for real-time collaboration.

The Sequence Knowledge #862: Learning About Text Diffusion Models

2026-05-19 11:03 UTC

Text diffusion models are emerging as a credible alternative to autoregressive transformer models for language generation, overcoming limitations like generation drift and the reversal curse.

Diffusion models rule visual AI but have been an afterthought in text.
Autoregressive models have inherent flaws: left-to-right generation, no global planning, and cascading errors.

The Sequence Radar #861: Last Week in AI: IPOs, Interactive Models, and Recursive Dreams

2026-05-17 11:02 UTC

Last week in AI was marked by Cerebras's massive IPO, Thinking Machines' interactive models that embed collaboration into the model itself, Recursive Superintelligence's $650M launch for self-improving AI, and Junyang Lin's new AI lab at ~$2B valuation in China.

Cerebras IPO surged 68%, reaching ~$95B market cap, emphasizing the physical infrastructure of AI.
Thinking Machines unveiled interaction models where real-time collaboration is built into the model, not the harness.

The Sequence Opinion #860: Every Company’s Last eXam: Some Reflection About Practical AI Evals

2026-05-14 11:03 UTC

Evaluations are becoming the fourth pillar of modern AI, alongside compute, data, and models. Every company needs its own dynamic evaluation suite tailored to its workflows, not generic benchmarks.

Evaluations are emerging as the fourth pillar of AI.
Companies require private evaluation systems for their unique workflows.

The Sequence AI of the Week #859: Reading Claude’s Mind in English: A Note on Natural Language Autoencoders

2026-05-13 11:50 UTC

Anthropic's new Natural Language Autoencoders allow researchers to get direct English descriptions of what an LLM is thinking, marking a significant step in interpretability.

Anthropic introduces Natural Language Autoencoders (NLA) that produce unsupervised English explanations of LLM activations.
NLA allows researchers to ask 'what are you thinking?' and get bullet-point answers.

The Sequence Knowledge #858: How State Space Models Went from Curiosity to Serious Transformer Competitor

2026-05-12 10:39 UTC

State space models (SSMs) are emerging as a viable alternative to Transformers, offering linear time complexity and constant memory during inference. This article explores the mathematical foundations, recent breakthroughs, and how SSMs now compete on key language tasks.

Transformer self-attention suffers from O(n²) complexity, limiting long-context scalability.
State space models achieve linear complexity with no KV-cache, enabling efficient inference.

The Sequence Radar #857: Last Week in AI: Inside the Machine, Outside the Text Box

2026-05-10 11:01 UTC

This week's AI developments highlight a shift from a model race to an infrastructure race. Anthropic's natural language autoencoders enable interpretability via language, OpenAI's voice models push conversational interfaces, SubQ claims a 12M-token context window, and Chinese AI labs like DeepSeek and Moonshot see soaring valuations. The editorial underscores that AI is becoming more inspectable, conversational, memory-rich, and institutionally valuable.

Anthropic's natural language autoencoders turn model activations into readable text, opening new interpretability paths
OpenAI's voice models transform AI from text-based queries to real-time conversational agents

The Sequence Opinion #856: The Salesforce of agents won't be Salesforce, The Google of agents won't be Google

2026-05-07 11:02 UTC

Building software for the agentic economic.

Traditional software assumes human users.
AI agents are changing the fundamental assumptions.

The Sequence AI of the Week #855: Inside Nemotron Omni: NVIDIA’s New Multimodal Brain for Agents

2026-05-06 10:30 UTC

NVIDIA's Nemotron 3 Nano Omni is a multimodal reasoning model that unifies video, audio, image, and text processing into a single efficient model for agentic workflows, avoiding the lossy pipeline of separate models.

Nemotron 3 Nano Omni integrates video, audio, image, and text into one model.
Designed to replace the fragmented pipeline of separate ASR, VLM, and OCR models.

TheSequence