Stop compaction drift. Keep the plan alive.
CodeFathom is a local semantic indexing service and MCP server that keeps CLI code agents coherent during long sessions by offloading repo context from the prompt into an on-device vector and symbol index.
$ codefathom index ./my-repo
✓ Indexed 1,247 files
✓ 3,892 symbols extracted
✓ Vector embeddings generated
Agent: codefathom.search_semantic("auth session schema")
→ 3 relevant snippets (2,100 tokens saved)The problem
Context windows fill up. Compaction kicks in. Plans drift.
Serious development tasks require opening many files, tracing references, reading configs, and iterating on a plan—exactly what eats context.
To keep going, CLIs compact or summarize the conversation. This loses the nuanced constraints, earlier discoveries, and why the plan changed—especially in ideation-heavy sessions.
A large share of context budget is spent on raw code text, crowding out the thread's reasoning narrative—the plan itself.
The solution
Keep the thread. Offload the code.
CodeFathom prevents compaction drift by changing what goes into the prompt: vectorize and structure the repo locally, expose MCP tools so the agent retrieves only minimal necessary code slices, and keep the conversation about intent, constraints, and decisions—not pasted files.
Developer experience
Before CodeFathom
Dev: "Let's explore 3 approaches to migrate auth and decide." Agent reads many files, traces references, proposes a plan. Session grows, compaction triggers. Agent after compaction loses nuanced constraints, drifts into a different plan.
After CodeFathom
Dev: "We're switching to approach #2." Agent calls codefathom.search_semantic, codefathom.find_references, codefathom.get_related. Agent: "Proceeding without re-reading whole files. Here's the updated plan."
MCP tools
Agent-facing tools that reduce compaction drift
search_semantic
Finds relevant code by concept
search_exact
AST-aware precise search
find_references
Reference discovery
get_related
Dependency and coupling neighborhood
describe_arch
Module-level summary
Every tool response is budgeted (bounded output) so the agent does not accidentally blow the window with overly large retrieval payloads.
Technical architecture
Local service that bridges disk to semantic index to MCP
AST and symbol graph (definitions, references, imports/exports)
Embeddings index for conceptual search
Chunking tuned for code (functions/classes, not naive paragraphs)
Incremental updates via file watch mode
Why not a cloud vector database?
Repo memory without pushing code to a managed DB
| Aspect | Cloud Vector DBs | CodeFathom |
|---|---|---|
| Storage | Metered cloud storage fees | Free (local disk) |
| Ingestion | Upload pipeline required | Index locally |
| Privacy | Code leaves your machine | Code stays local |
| Interface | App database APIs | MCP-native agent tools |
| Context | Generic vector retrieval | Code-aware (symbols, refs) |
Who this is for
Built for engineers who feel compaction drift
CLI-First Agent Users
You use Claude Code, Codex CLI, or MCP-enabled agents for real work. Long sessions hit the context limit. You restart conversations too often.
Ideation-Heavy Engineers
You explore options, revise plans, and pivot mid-stream. Compaction loses the nuance of why decisions changed.
Security-Conscious Teams
You want semantic indexing for agents but cannot send repo contents to managed vector databases.
Common use cases
Pricing
Infrastructure-style pricing: base license plus metered usage
Indie
+$10 CBC commit
For individual developers who want smarter agents with full codebase context.
Team
+$12 CBC commit
For teams that need shared context and pooled bandwidth.
Enterprise
Committed volume
For organizations that need VPC deployment, audit logs, and committed volume.
CBC = Context Bandwidth Credits: 1 CBC = 1,000 tokens delivered via MCP
14-day trial with credit card required and full features
Newsletter
Product insight briefings, delivered weekly.
Get launch notes, product intelligence playbooks, and roadmap insights.
Join the beta
Keep the thread. Offload the code.
We are looking for beta testers: teams of 3-5 to validate pooled usage, shared index latency, and relevance quality in esoteric repos.
