AI News HubLIVE
站内改写4 min read

Vercel Releases Eve: An Open-Source AI Agent Framework Where Each Agent is a Directory of Files Mapped to Capabilities

Vercel has open-sourced Eve, an Apache-2.0 agent framework now in public preview. An agent is a directory of files, with durable execution, sandboxes, approvals, connections, channels, and evals built in. Scaffold with npx eve@latest init and deploy unchanged via vercel deploy.

SourceMarkTechPostAuthor: Asif Razzaq

Vercel has released eve, an open-source framework for building, running, and scaling agents. The project is published as the npm package eve, licensed under Apache-2.0.

Building an agent should mean defining what it does. It should not mean assembling all the plumbing that an agent needs to run in production.

eve is the framework Vercel builds and runs its own agents on. According to Vercel post, it runs more than a hundred agents in production today.

What is eve?

eve is a filesystem-first framework for durable backend agents. You create an agent as a directory on disk. The directory is the contract.

Each file describes one component of the agent. At a glance, the tree shows what an agent is and does. It also shows where it lives and when it acts on its own.

The smallest agent that runs is two files. One sets the model. The other sets the instructions.

Copy CodeCopiedUse a different Browser

// agent/agent.ts import { defineAgent } from "eve";

export default defineAgent({ model: "anthropic/claude-opus-4.8", });

The model is one line, and provider fallbacks are supported through AI Gateway. The instructions.md file becomes the system prompt that eve puts in front of every model call.

An agent is a directory

Vercel’s core idea is that agents have a shape. Every team kept rebuilding the same structure to meet the same needs. eve makes that shape into a framework.

The directory layout maps each capability to a folder. Here is the contract:

PathRoleFormat

agent.tsThe model it runs on, plus runtime configTypeScript

instructions.mdWho it is, prepended to every model callMarkdown

tools/What it can do; filename becomes the tool nameTypeScript

skills/What it knows; loaded only when the topic comes upMarkdown

connections/Secure links to MCP servers and OpenAPI APIsTypeScript

sandbox/Optional override of the agent’s sandbox; seeds workspace filesDirectory

subagents/Specialist child agents it delegates toDirectory

channels/Where it lives, like Slack or HTTPTypeScript

schedules/When it acts on its own, on a cronTypeScript

lib/Shared authored code used across the agentTypeScript

You add a tool, skill, channel, or schedule by adding a file. eve picks them up at build time and wires them in. There is no boilerplate to register them.

A tool is one TypeScript file with a Zod input schema. Its filename and place in the tree become its definition.

Copy CodeCopiedUse a different Browser

// agent/tools/run_sql.ts import { defineTool } from "eve/tools"; import { z } from "zod";

export default defineTool({ description: "Run a read-only SQL query.", inputSchema: z.object({ sql: z.string() }), needsApproval: ({ toolInput }) => estimateScanGb(toolInput.sql) > 50, async execute({ sql }) { /* ... */ }, });

What ships in the box

Vercel describes eve as ‘batteries included.’ Six production capabilities come with the framework:

Durable execution: Every conversation is a durable workflow, with each step checkpointed. A session can pause, survive a crash or a deploy, and resume where it stopped. This is built on the open-source Workflow SDK.

Sandboxed compute: Agent-generated code is treated as untrusted. Every agent gets its own sandbox for shell commands, scripts, and file reads and writes. The backend is an adapter, running on Vercel Sandbox when deployed and on Docker, microsandbox, or just-bash locally.

Human-in-the-loop approvals: Any action can be set to require approval. The agent pauses there and waits, indefinitely if needed, without consuming compute. Once approved, eve continues from where it left off.

Secure connections: A connection is a file pointing at an MCP server or an OpenAPI-compatible API. eve brokers the auth, and the model never sees the URL or credentials. At launch, agents can connect to Slack, GitHub, Snowflake, Salesforce, Notion, and Linear.

Channels: The same agent serves every surface. The HTTP API is on by default, with Slack, Discord, Teams, Telegram, Twilio, GitHub, and Linear included. One channel can hand off to another.

Tracing and evals: Every run produces a trace using standard OpenTelemetry spans. They export to Braintrust, Honeycomb, Datadog, or Jaeger. Evals are scored test suites you run locally or wire into CI.

Use cases, with real examples

Vercel published six agents it runs internally on eve:

d0, the data analyst: Its most-used internal tool, handling more than 30,000 questions a month. Every query is scoped to the asker’s own permissions.

Lead Agent, the autonomous SDR: It works every new lead and follows up on its own. Vercel says it costs about $5,000 a year and returns 32 times that, maintained part-time by one engineer.

Athena, the sales cockpit: RevOps built it in six weeks without engineers. It answers pipeline questions from Snowflake and Salesforce in plain language.

Vertex, the support engineer: It handles tickets across the help center, docs, and Slack. Vercel reports it solves 92% of tickets on its own and escalates the rest.

draft0, the content agent: It runs a review pipeline that catches glaring issues before a human editor sees the piece.

V, the routing agent: Tasks go to V in Slack first. V routes each one to the agent that can answer it.

Interactive Simulation

eve versus a hand-rolled agent stack

Most teams assemble these pieces themselves for each new agent. The table below maps that work against what eve provides.

CapabilityTypical DIY stackeve

AuthoringCustom loop, manual tool registrationFiles in a directory, discovered at build

DurabilityBespoke state and retry handlingCheckpointed durable workflow per session

Code executionSelf-managed container or VMPer-agent sandbox via swappable adapter

ApprovalsCustom pause-and-resume logicneedsApproval field on any action

SurfacesOne integration per channelOne adapter file per channel

ObservabilityStitched together from logsOpenTelemetry traces and evals built in

DeployProvision infrastructurevercel deploy, runs unchanged from local

The comparison reflects eve’s documented capabilities. Specifics of other frameworks vary by version and setup.

Getting started

You can scaffold and start a new agent with one command. It installs dependencies, scaffolds the project, and starts a dev server.

Copy CodeCopiedUse a different Browser

npx eve@latest init my-agent

eve dev runs the agent locally with an interactive terminal UI. eve eval runs your test suites. eve build compiles inspectable artifacts under .eve/.

Because an eve agent is an ordinary Vercel project, vercel deploy ships it to production unchanged. The sandbox swaps to Vercel Sandbox without a code change.

Key Takeaways

eve is Vercel’s open-source, Apache-2.0 agent framework, now in public preview.

An agent is a directory of files; each folder maps to one capability.

Durable execution, sandboxes, approvals, connections, channels, and evals ship built in.

Vercel runs 100+ agents on eve, including a data analyst handling 30,000 questions monthly.

Scaffold with npx eve@latest init, then deploy unchanged via vercel deploy.

Check out the Repo, Product page, Docs and Technical details. Also, feel free to follow us on Twitter and don’t forget to join our 150k+ML SubReddit and Subscribe to our Newsletter. Wait! are you on telegram? now you can join us on telegram as well.

Need to partner with us for promoting your GitHub Repo OR Hugging Face Page OR Product Release OR Webinar etc.? Connect with us

The post Vercel Releases Eve: An Open-Source AI Agent Framework Where Each Agent is a Directory of Files Mapped to Capabilities appeared first on MarkTechPost.