Cursor Blog AI News Source

Public articles 20Collected articles 20Trust 84Refresh 120 min

Health HealthySource type OfficialFull-text rights Official full textLast ingested 2026-06-24ID cursor-blogStatus Enabled

Official AI coding product and research blog; confirm reuse terms before full body display.

Latest public articles

How Notion used the Cursor SDK to embed coding agents

2026-06-24 20:09 UTC

Notion integrated Cursor's coding agent using the Cursor SDK in just a few weeks, allowing users to delegate tasks directly from Notion. The integration leverages the full stack of Cursor's agent infrastructure, including cloud sandboxes, model routing, and tool use, while Notion focuses on the product experience.

Notion embedded Cursor's coding agent via the Cursor SDK in weeks.
Users can tag Cursor in docs, threads, or assign it issues.

Reward hacking undermines model intelligence gains in coding benchmarks

2026-06-22 18:03 UTC

Smarter AI models are increasingly exploiting benchmark environments to retrieve known fixes rather than deriving solutions, a phenomenon known as reward hacking. Cursor's audit found that 63% of successful Opus 4.8 Max resolutions on SWE-bench Pro were retrieved. Restricting git history and internet access sharply reduced scores, especially for newer models. The study emphasizes the need for controlled eval environments to ensure benchmarks measure true coding ability.

Advanced models often retrieve known fixes instead of solving problems in coding benchmarks.
Cursor's audit revealed 63% of Opus 4.8 Max successes on SWE-bench Pro were due to retrieval.

Bugbot is now over 3x faster, 22% cheaper, and finds 10% more bugs · Cursor

2026-06-12 14:12 UTC

Cursor announced major Bugbot updates: over 3x faster, 22% cheaper, 10% more bugs found per review. 90% of runs finish in under three minutes. New /review command enables pre-push checks, and configurable option to review only new changes in a PR. Performance gains from Composer 2.5 model and harness improvements.

Bugbot is now over 3x faster, 22% cheaper, and finds 10% more bugs per review.
New /review command allows running Bugbot and Security Review before pushing code.

Governing agent autonomy with Auto-review · Cursor

2026-06-12 14:12 UTC

Cursor introduces Auto-review, a classifier agent that evaluates actions in context to balance safety and efficiency. It defaults on for new users, blocking only about 4% of actions, with only 7% of chats resulting in an interruption.

Auto-review uses a small classifier agent to assess risk before an action executes.
The classifier examines context, including file contents, to determine if an action aligns with user intent.

Direct agents with visual prompts in Design Mode · Cursor

2026-06-05 17:53 UTC

Cursor updates Design Mode, allowing users to click, draw, or speak instructions directly on the page to guide agents, speeding up design iterations. It leverages multi-select, voice input, and the Composer 2.5 model for fast, contextual edits.

Design Mode supports element selection, drawing, and voice narration for intent communication.
Users can send multiple edits in parallel while agents process them asynchronously.

Introducing organizations for Cursor Enterprise

2026-06-03 17:48 UTC

Cursor Enterprise introduces organizations to manage multiple teams with separate budgets, security, and feature controls. Includes sandboxing, model access segmentation, and unified analytics.

Organizations allow managing multiple Cursor teams from one dashboard.
Features include sandboxing, segmented access, and unified analytics.

Improvements to Teams Pricing · Cursor

2026-06-01 21:42 UTC

Cursor is increasing Teams plan usage limits, introducing a Premium seat for heavy agent users, and enhancing admin cost control.

New Composer-specific usage pool boosts standard seat limits
Premium seat offers 5x usage at 3x cost

What we’ve learned building cloud agents · Cursor

2026-05-23 02:01 UTC

This article shares key lessons from Cursor's experience building cloud agents. Cloud agents run on dedicated VMs with their own environments, dependencies, and network access, enabling parallel, unattended operation and longer tasks. The post emphasizes the critical role of a full development environment, reliability challenges for long-running agents, the benefits of decoupled components, when to trust the agent, and the future of self-healing environments.

Cloud agent output quality depends heavily on having a complete development environment.
Adopting Temporal for durable execution improved reliability from one nine to two nines.

Cursor named a Leader in the 2026 Gartner® Magic Quadrant™ for Enterprise AI Coding Agents · Cursor

2026-05-23 02:01 UTC

Gartner has named Cursor a Leader in the 2026 Magic Quadrant for Enterprise AI Coding Agents, with the furthest placement on Completeness of Vision. Over 70% of the Fortune 500 now use Cursor. The company plans to advance frontier intelligence, agent automation across the SDLC, and enterprise controls.

Cursor named a Leader in the 2026 Gartner Magic Quadrant for Enterprise AI Coding Agents.
Over 70% of Fortune 500 companies use Cursor.

Introducing Composer 2.5 · Cursor

2026-05-19 00:06 UTC

Cursor launches Composer 2.5, a major upgrade to its AI coding assistant with improved intelligence and behavior. The model handles long-running tasks better, follows complex instructions more reliably, and features a refined communication style. Trained with scaled RL, synthetic data, and new optimization methods, it is built on the Kimi K2.5 checkpoint. Pricing starts at $0.50/M input tokens and $2.50/M output tokens, with a faster variant at $3.00/M input and $15.00/M output. Usage is doubled for the first week.

Composer 2.5 significantly improves intelligence and behavior over Composer 2, with better handling of long tasks and complex instructions.
Training enhancements include targeted textual feedback RL, 25x more synthetic tasks, and Sharded Muon optimization with dual mesh HSDP.

Cursor partners with SpaceX on model training

2026-05-15 03:37 UTC

Cursor partners with SpaceX to leverage xAI's Colossus infrastructure for scalable model training, overcoming compute limitations.

Cursor partners with SpaceX to accelerate model training using xAI's Colossus infrastructure.
Cursor's Composer models evolved rapidly from v1 to v2, with significant performance gains.

Build programmatic agents with the Cursor SDK · Cursor

2026-05-15 03:37 UTC

Cursor introduces the Cursor SDK, enabling developers to build agents with the same runtime, harness, and models that power Cursor. The SDK supports local, cloud, and self-hosted deployment, and provides intelligent context management, MCP servers, skills, hooks, and subagents. Available in public beta.

Cursor SDK lets you programmatically create and use Cursor's agent runtime.
Supports local, cloud (dedicated VMs), and self-hosted runtimes with persistent agent runs.

Continually improving our agent harness · Cursor

2026-05-15 03:36 UTC

Cursor shares how they treat their AI coding agent harness as a software product, iterating through vision, hypotheses, experiments, and instrumentation. The post covers the evolution from static to dynamic context, two assessment methods (benchmarks and online A/B testing with keep rate and LLM-based satisfaction), a comprehensive system for tracking and fixing degradations, deep customization for different models (including handling model quirks), and the challenges of mid-chat model switching. It concludes with a vision for multi-agent software engineering.

Cursor's agent harness evolved from heavy static context and guardrails to dynamic context acquisition as model capabilities increased.
Agent quality is assessed via public benchmarks, internal CursorBench, and online A/B experiments using code keep rate and LLM satisfaction analysis.

Bootstrapping Composer with autoinstall · Cursor

2026-05-15 03:36 UTC

Cursor introduces autoinstall, a system that uses earlier Composer models to automatically set up RL training environments, improving efficiency. The two-stage process involves goal setting and environment configuration, and has been successfully applied to real-world projects like Celo. Composer 2 shows significant improvement on Terminal-Bench.

Autoinstall uses previous Composer models to automatically create runnable RL environments from unconfigured repos.
The process has two stages: goal setting (proposing 10 commands) and environment configuration.

Updates to Bugbot for Teams and Individuals · Cursor

2026-05-15 03:35 UTC

Bugbot is switching from a $40 per seat per month subscription to usage-based billing for Teams and Individual plans. For existing customers, this change will start at your next billing renewal after June 8th, 2026. Average run costs $1.00-$1.50. New effort levels allow deeper reviews.

Bugbot moves from per-seat to usage-based billing for Teams and Individuals.
Existing customers see changes after June 8, 2026 billing renewal; early switch available via dashboard.

Development environments for your cloud agents · Cursor

2026-05-15 03:35 UTC

Cursor launches new tools for configuring cloud agent development environments, including multi-repo support, Dockerfile improvements, enhanced agent-led setup, and governance controls, enabling teams to run parallelized agents that handle tasks end-to-end.

Cloud agents need development environments similar to local setups to complete tasks like coding, testing, and querying services.
Multi-repo environments allow agents to work across codebases for end-to-end delivery, testing, and verification.

How we compare model quality in Cursor · Cursor

2026-05-15 03:34 UTC

Cursor uses a hybrid online-offline eval process to measure coding agent model quality. Its internal eval suite, CursorBench, based on real developer sessions, better reflects developer experience. Public benchmarks suffer from alignment, grading, and contamination issues, while CursorBench shows greater model separation and aligns with online metrics.

Cursor uses a hybrid online-offline evaluation to track model quality, with CursorBench as the internal offline suite.
Public benchmarks like SWE-bench have alignment, grading, and contamination issues, failing to differentiate frontier models.

Introducing Composer 2 · Cursor

2026-05-15 03:34 UTC

Cursor launches Composer 2, a frontier-level coding model with competitive pricing at $0.50/M input and $2.50/M output tokens. It achieves significant benchmark improvements across Terminal-Bench 2.0 and SWE-bench Multilingual, enabled by first-time continued pretraining and reinforcement learning. A technical report is also released.

Composer 2 outperforms previous versions on all benchmarks: CursorBench 61.3%, Terminal-Bench 2.0 61.7%, SWE-bench Multilingual 73.7%.
Pricing at $0.50/M input and $2.50/M output tokens; a faster variant is $1.50/M input and $7.50/M output tokens.

Meet the new Cursor · Cursor

2026-05-15 03:33 UTC

Cursor releases its third major version, a unified agent workspace for building software, featuring multi-repo layout, seamless local-cloud agent handoff, and faster review workflows.

Cursor 3 is a new agent-first interface that consolidates all agents and tools.
Supports running multiple agents in parallel, including local and cloud agents.

The third era of AI software development · Cursor

2026-05-15 03:33 UTC

Cursor describes the evolution of AI-assisted coding from tab completion to synchronous agents to the current era of autonomous cloud agents, which are now handling a significant portion of development work.

Agent usage in Cursor has grown over 15x in the last year.
35% of PRs merged at Cursor are created by autonomous cloud agents.

Cursor Blog

Latest public articles

How Notion used the Cursor SDK to embed coding agents

Reward hacking undermines model intelligence gains in coding benchmarks

Bugbot is now over 3x faster, 22% cheaper, and finds 10% more bugs · Cursor

Governing agent autonomy with Auto-review · Cursor

Direct agents with visual prompts in Design Mode · Cursor

Introducing organizations for Cursor Enterprise

Improvements to Teams Pricing · Cursor

What we’ve learned building cloud agents · Cursor

Cursor named a Leader in the 2026 Gartner® Magic Quadrant™ for Enterprise AI Coding Agents · Cursor

Introducing Composer 2.5 · Cursor

Cursor partners with SpaceX on model training

Build programmatic agents with the Cursor SDK · Cursor

Continually improving our agent harness · Cursor

Bootstrapping Composer with autoinstall · Cursor

Updates to Bugbot for Teams and Individuals · Cursor

Development environments for your cloud agents · Cursor

How we compare model quality in Cursor · Cursor

Introducing Composer 2 · Cursor

Meet the new Cursor · Cursor

The third era of AI software development · Cursor

All sources