Building a Stateful AI Agent: Centri – A Memory-First Coding Agent
Centri is an open-source project that addresses AI agent forgetfulness through a durable memory spine. It uses an append-only event spine as the source of truth, with a derived, replayable memory graph and deterministic context curation. Key features include event spine, typed memory graph, deterministic curation, FTS5 verbatim recall, LLM consolidation, temporal recall, ACP-based coding delegation, tool contract, and history import. Benchmarks show Centri outperforms Letta in stale-fact handling.
Notifications You must be signed in to change notification settings
Fork 0
Star 1
BranchesTags
Open more actions menu
Folders and files
NameName
Last commit message
Last commit date
Latest commit
History
14,178 Commits
14,178 Commits
.github
.github
.husky
.husky
.vscode
.vscode
.zed
.zed
core
core
deploy
deploy
docs
docs
github
github
infra
infra
nix
nix
packages
packages
patches
patches
perf
perf
script
script
scripts
scripts
sdks/vscode
sdks/vscode
shell
shell
specs
specs
upstream/opencode
upstream/opencode
.dockerignore
.dockerignore
.editorconfig
.editorconfig
.env.example
.env.example
.gitattributes
.gitattributes
.gitignore
.gitignore
.gitleaksignore
.gitleaksignore
.oxlintrc.json
.oxlintrc.json
.prettierignore
.prettierignore
AGENTS.md
AGENTS.md
CONTEXT.md
CONTEXT.md
CONTRIBUTING.md
CONTRIBUTING.md
FORK-NOTES.md
FORK-NOTES.md
HANDOFF.md
HANDOFF.md
LICENSE
LICENSE
LICENSE-OPENCODE
LICENSE-OPENCODE
README.md
README.md
SECURITY.md
SECURITY.md
STATE-AND-HANDOFF.md
STATE-AND-HANDOFF.md
VERIFY.md
VERIFY.md
bun.lock
bun.lock
bunfig.toml
bunfig.toml
docker-compose.yml
docker-compose.yml
flake.lock
flake.lock
flake.nix
flake.nix
install
install
package.json
package.json
screenshot-uk.png
screenshot-uk.png
sst-env.d.ts
sst-env.d.ts
sst.config.ts
sst.config.ts
tsconfig.json
tsconfig.json
turbo.json
turbo.json
Repository files navigation
A memory-first coding agent. One durable memory spine remembers everything you and your tools have done, so every new turn starts warm instead of cold.
Agents forget. Context windows fill, sessions end, and every new chat starts from scratch — so you re-explain decisions you already made and watch the agent re-propose approaches you already rejected. Centri inverts this: an append-only event spine is the source of truth, memory is a derived index that can be thrown away and re-derived, and every per-turn context is assembled fresh by a deterministic curation function that attaches a receipt to every line. The context window is a cache, not storage.
What's in this repo
Three parts share one durable memory:
Part What Lives in
Centri core Python memory API: append-only event spine, typed memory graph with bi-temporal supersession, deterministic curation, optional LLM consolidation, REST/WS surface. core/
OpenCode fork The TypeScript/Bun OpenCode app shell, patched (each patch marked // CENTRI) so every turn recalls a brief from the core and runtime events are tapped back into the spine. All memory calls fail open. packages/opencode/src/centri/
Hermes plugin A deployable memory.provider translating Hermes memory calls into the core's HTTP API. deploy/hermes-plugin/centri/
The event spine is the source of truth; the memory graph is a derived, re-derivable index over it. Centri is a fork of the MIT-licensed OpenCode project — upstream attribution is preserved in LICENSE-OPENCODE, upstream/opencode/, FORK-NOTES.md, and docs/centri-app.md.
Key features
Append-only event spine — every tool call, file edit, decision, and result is durably logged with secret redaction on write. The spine is the system's source of truth.
Typed memory graph with bi-temporal supersession — decisions, facts, and open loops. New truth invalidates old truth, but history is retained: stale facts never resurface in a brief, while the full timeline stays auditable.
Deterministic curation with receipts — each per-turn brief is rendered by one pure curate() path; every line carries a source_event_id pointing back to the ledger event it came from. The same (graph, cue, budget, policy) yields a byte-identical brief. No LLM runs at read time.
FTS5 verbatim recall — a SQLite FTS5 index over the spine lets a brief lift exact prior tokens (file names, error strings, identifiers) into context alongside the typed graph.
LLM consolidation — an offline worker folds the raw spine into the ambient layer: identity, active projects, top open loops, a short recent narrative, and a user profile of preferences and conventions the agent has seen you repeat.
Temporal recall — ask "what changed since yesterday" or "where did we leave off" and get an answer grounded in the ledger's timeline, not a guess.
Coding delegation over ACP — work is handed to a coding agent (Agent Client Protocol, JSON-RPC over stdio) with live streaming progress, an approval gate for destructive actions, and failover to a fallback hand.
First-class tool contract — tools (Composio, e.g. Tavily search) sit beside the coding hands. Every invocation is event-ledgered, side-effectful tools pass through the approval gate, read-only results fold back into memory.
History import — a one-shot bootstrap plus a continuous tail of OpenCode, Claude Code, and Cursor histories, so a fresh install starts with complete memory instead of a blank slate.
Hermes structured ingestion — Hermes chat is ingested as typed, dedupable envelopes (hermes.user.message, hermes.assistant.message, hermes.tool.result, hermes.memory.write), not flattened text.
Architecture
Events are the source of truth; memory is a derived, re-derivable index over them.
Layer What it is
Web UI OpenCode fork web app — activity timeline, task cards, approvals.
Coordinator Python core loop: understand → decide → act → narrate → remember
Event spine Append-only SQLite ledger + in-memory bus, with secret redaction on write
Memory Typed decisions/facts/open-loops with bi-temporal supersession, derived from the spine
Hands Capability router over coding agents — real ACP client, OpenCode fallback
Tools ToolProvider contract with Composio; event-ledgered, approval-gated invocation
See docs/README.md for the full index, or docs/architecture.md, docs/memory-architecture.md, and docs/event-contract.md for detail.
Quickstart
Docker
cp .env.example .env # fill in your model gateway keys (BYOK) docker compose up -d # core on :8760, web shell on :8761
Manual
cd core python -m venv .venv . .venv/bin/activate pip install -e ".[dev]"
cp ../.env.example ../.env # fill in your keys (BYOK) python -m pytest tests/ -v # run the suite centri # start the server (or: python -m centri.cli)
The API listens on 127.0.0.1:8760 by default. Health checks:
curl localhost:8760/health curl localhost:8760/status
Configuration
All configuration is environment-driven — copy .env.example to .env and fill in your keys. Nothing is committed; .env and *.db are gitignored. Centri is BYOK (bring your own keys) and model-agnostic:
Model gateway — LITELLM_BASE_URL / LITELLM_API_KEY point at any OpenAI-compatible provider. Role models are set per task (MODEL_INTENT, MODEL_REASONING, …).
Auth — CENTRI_AUTH_TOKEN gates every REST route except /health (and the /events/stream WebSocket via ?token=). Empty means auth off; set it before exposing a port.
Tools — CENTRI_COMPOSIO_API_KEY enables Composio; CENTRI_COMPOSIO_TOOLS is a comma-separated allowlist (default TAVILY_SEARCH). With no key the provider reports unavailable-with-reason and never touches the network.
Embeddings (optional) — off by default (lexical + FTS5 recall only). Turn on semantic recall with CENTRI_EMBEDDING_*; see .env.example.
Ingest paths — CENTRI_INGEST_OPENCODE_PATHS, CENTRI_INGEST_CLAUDE_CODE_PATHS, and CENTRI_INGEST_CURSOR_PATHS override the per-platform probe for histories in unusual locations; CENTRI_INGEST_DISABLED_AGENTS opts an agent out.
Service startup
The reference deployment runs two services sharing one memory DB (~/.centri/state.db):
Service Port What
centri-core.service 8760 The Centri core (centri serve).
opencode.service 4096 The OpenCode fork web UI (opencode web … --port 4096), pointed at the core via CENTRI_URL / CENTRI_TOKEN.
sudo systemctl enable --now centri-core opencode systemctl is-active centri-core opencode curl -fsS http://127.0.0.1:8760/health # -> {"status":"ok",...}
Full deployment guide (systemd, Caddy/TLS, ports): docs/DEPLOY.md. The xdg-open ENOENT line in the opencode.service logs is harmless — OpenCode tries to auto-open a browser on a headless box and keeps running after the spawn fails.
Hermes integration
Centri also ships as a Hermes memory.provider. A thin plugin (CentriMemoryProvider) translates Hermes memory calls into the core's HTTP API: prefetch → POST /memory/recall, sync_turn / on_memory_write → batched POST /events/import. The deployable plugin lives at deploy/hermes-plugin/centri/.
~/.hermes/config.yaml
memory: provider: centri centri: api_base: http://127.0.0.1:8760 auth_token: # same value as the core + the fork's CENTRI_TOKEN
Restart Hermes after any plugin change. Full guide: docs/HERMES-INTEGRATION.md.
Memory import
A fresh install imports your existing coding-agent histories so memory is complete from day one. Discover what's available, then bootstrap once (idempotent — re-running imports nothing new):
curl localhost:8760/ingest/discover # "found N OpenCode messages, M Cursor sessions" curl -X POST localhost:8760/ingest/bootstrap \ -H 'content-type: application/json' -d '{}' # one-time full import
After bootstrap, the ambient tail keeps pulling new history each scheduler tick.
Benchmark
centri-bench is a falsifiable head-to-head: each engine assembles a per-turn brief from the same seeded history, scored on brief completeness, re-proposal rate, next-step correctness, and stale-fact handling. Native Centri is run against a real Letta server (v0.16.8, pgvector archival) — both a deterministic rubric and an LLM judge agree Centri wins, the gap entirely on stale-fact supersession.
Metric (avg over 3 personas) Centri Letta
brief completeness ↑ 1.00 1.00
re-proposal rate ↓ 0.00 0.00
next-step correct ↑ 1.00 1.00
stale-fact correct ↑ 1.00 0.67
composite ↑ 1.00 0.93
Letta is a benchmark comparison only, not a runtime dependency — Centri runs without it. Run it yourself:
cd core python -m centri.bench.run # human-readable report python -m centri.bench.run --json # machine-readable scores
Methodology: docs/centri-bench.md; raw results: docs/bench-results/.
Status
The core, memory graph, curation, ACP coding loop, tool contract, and history ingest are covered by a 385-test suite (cd core && python -m pytest tests/). The unit suite runs green offline; integration tests that need a live core, BYOK model keys, the opencode binary, or a real Letta server are environment-gated and skip cleanly when those are absent.
The OpenCode fork web app builds and typechecks. The roadmap lives in docs/ROADMAP.md; the full docs index is docs/README.md.
License
Centri is licensed under the MIT License — see LICENSE. The OpenCode app shell in packages/opencode/ is a fork of the MIT-licensed OpenCode project; its upstream license is preserved at LICENSE-OPENCODE and the original upstream README is kept at upstream/opencode/. Runtime dependencies are permissively licensed (MIT/BSD/Apache-2.0); letta-client is bench-only tooling, not a product dependency.
About
Memory-native OpenCode fork with Centri core, event-spine memory, verbatim recall, and Hermes integration.
Topics
memory
sqlite
opencode
hermes
fts5
ai-agent
llm
agent-memory
Resources
Readme
License
MIT, MIT licenses found
MIT
LICENSE
MIT
LICENSE-OPENCODE
Contributing
Contributing
Security policy
Security policy
Uh oh!
There was an error while loading. Please reload this page.
Activity
Stars
1 star
Watchers
0 watching
Forks
0 forks
Report repository
Releases
No releases published
Packages 0
Uh oh!
There was an error while loading. Please reload this page.
Contributors
Uh oh!
There was an error while loading. Please rel
[truncated for AI cost control]