The problem everyone has. The one nobody's solved.
Why we built our own

You open a chat with your AI assistant. You explain your project. You describe the infrastructure. You walk through the last week of decisions. You get to the actual question you came to ask.

Then the context window fills up, or the chat ends, and it all disappears. Tomorrow you explain it again. Next week your teammate explains it again. The month after that somebody new joins and explains it from scratch. Every conversation starts at zero.

The industry has responses to this. They are all partial. They solve it for one developer, for one device, for one chat tool. They do not solve it for an organization — much less an organization that has multiple people on multiple accounts working on multiple projects and needs all of that work to flow into one shared memory.

That is what Lightning MCP is. The rest of this page is the receipts.

What's out there vs. what we built
Feature comparison · April 2026

We surveyed every serious self-hosted AI memory system we could find — the open-source MCP servers, the commercial products, the academic research, the cloud reference architectures. Here's how they stack up against what Lightning MCP does today.

System Verbatim capture Multi-user pooled memory Org-level rollup Per-tenant distillation Cross-account dispatch Self-hosted production
mem0 / OpenMemory MCPSelf-hosted MCP memory for Claude CodeNo — extracts facts onlyNoNoNoNoYes
Chroma MCPVector DB as external memoryPartial — manual chunkingShared knowledge base onlyNoNoNoYes
mcp-memory-serviceTurn-level memory for agent pipelinesTurn-level, per messageNoNoNoNoYes
LiteLLM Multi-TenantOrg/team/user routing for LLM APIsNo — routes calls, doesn't storeNoBilling hierarchy onlyNoAccess-levelYes
AWS Bedrock Multi-TenantEnterprise RAG reference architectureDocument RAG, not chatPooled knowledge baseIAM tenant scopingNoVia IAM policiesCloud-only
Collaborative MemoryarXiv 2505.18279 · May 2025 researchNot specifiedTwo-tier private + sharedUsers only — no orgsNoNoResearch paper
G-MemoryarXiv 2506.07398 · Hierarchical MAS memoryTrajectory summariesMulti-agentNoNoNoResearch paper
Hobbyist Second Brain stacksMac Mini + Docker MCP setupsVariesSingle userNoNoNoYes — for one person
Lightning MCPSchomp Technologies · what you're looking atFull word-for-wordPer tenant (Unimatrix)Multi-org rollupPer tenant · training-readyCross-account routingProduction · live tenants

→ Green = supported. Amber = partial. Rose = not supported.

Five things Lightning MCP does that nothing else does
Architectural differentiators

Any one of these you can find somebody attempting. All five combined into a single working production system — this is Lightning MCP alone.

01 · VERBATIM
Word-for-word capture
Every conversation, both sides, stored exactly as spoken. No summarization. No fact extraction. No "key points." When you need to find what was actually said, you get the real sentence back — not somebody's paraphrase of it. Full text lives in Postgres. Semantic search rides on Qdrant with the full text in every payload.
02 · UNIMATRIX
Multi-org tenant pools
Each tenant gets a Unimatrix — a pooled knowledge layer that aggregates memory from every account, every user, every project belonging to that tenant. One person's work becomes everyone's context. New team members onboard into a decade of institutional knowledge instead of staring at a blank chat.
03 · DISTILLATION
Harvested fix knowledge
Every outage, every fix, every "here's what finally worked" gets harvested from the memory stream and distilled into a training-ready knowledge layer. Per tenant. Over 30,000 distilled fixes indexed in our own Unimatrix today — covering Redis, Postgres, Docker, K3s, and everything else we run.
04 · DISPATCH
Cross-account routing
Work routes to the account that can actually do it. Need code validated through Forge? The job goes to whichever account in your Unimatrix holds that capability — even if you're operating from a different account. Tenant-internal load balancing across multiple authentications.
05 · INSTANT ACCESS
No connection plumbing
"Go work on the Snorkel website and run this process." Done. No API keys to paste. No sessions to wire up. No tunnels. The project's memory, its tools, its history — all of it is there the moment you reference it. Over 400 MCP tools covering infrastructure, ITSM, code validation, scheduling, and tenant ops — available instantly from any account in the Unimatrix.
What this actually feels like
The difference in real work
Before vs. After · A working day
"I gotta go work on the Snorkel website to run this process."

That is it. That is the whole instruction. No tunnel. No API key. No context re-explanation. No re-pasting last week's decisions. The assistant already knows what Snorkel is, which environment is which, who has Forge auth, what we tried last Tuesday that didn't work, and why we rolled that change back. The project's memory is the project's memory — not a text file on somebody's laptop.
— The actual workflow, running on Lightning MCP today

Without a memory platform Everybody else

The day you've probably had
  • Paste the project brief into the chat again
  • Paste last week's decisions again
  • Explain which env is which, who owns what
  • Hunt through Slack for the credential you pasted to yourself in March
  • Context window fills up before you get to the actual work
  • Teammate asks the same question tomorrow and starts over
  • New hire onboards and it all starts over a third time

With Lightning MCP Unimatrix

How it actually runs
  • "Go work on the Snorkel site." The platform already knows
  • Last week's decisions are in the shared memory
  • Credentials live in the vault; no paste, no ask
  • Your teammate's fix from Tuesday is already in the knowledge layer
  • Context window stays open for the actual problem
  • Another company in your Unimatrix fixed this yesterday — answer's already there
  • New team member logs in and inherits everything instantly
Nobody reinvents the wheel inside a Unimatrix
The compounding effect of pooled knowledge

Say another company in your Unimatrix — a tenant you share the pooled knowledge layer with — figured out how to fix a tricky Postgres connection issue last month. The fix got harvested. It's now a record in the tenant's troubleshooting knowledge base.

You hit the same issue today. You describe the symptom. The answer comes back already worked out — tested, verified, applied successfully before. No re-diagnosis. No second round of trial and error. The knowledge compounds across every project and every user that belongs to the Unimatrix.

This is what every consulting firm, agency, and dev shop says they want when they talk about "institutional knowledge" and "playbooks" and "runbooks" — and what they almost never actually build, because wikis go stale and Confluence rots. Lightning MCP doesn't need anyone to write a runbook. The runbook writes itself from what the team actually did.

Where we're going
Roadmap · Q2 2026 and beyond
NEXT · WEEKS
Full verbatim backfill across all tenants
Every historical memory and conscience entry re-indexed with full text in both Postgres and Qdrant. Finishing the transition from lossy extraction to true verbatim across every live Unimatrix.
Q2 · 2026
Per-tenant fine-tuned models
Use the distilled fix knowledge as training data for tenant-specific model adapters. Your Unimatrix stops being just a memory store — it becomes a model that already knows how your team operates.
Q3 · 2026
Commercial MSP onboarding
Lightning MCP as a managed service. Your company gets a Unimatrix, your team gets accounts, we run the infrastructure. First external tenants already onboarded in validation; general availability next quarter.
ONGOING
400+ MCP tools, growing
Every new integration becomes a tool in the platform — ITSM, home automation, voice, email, calendars, cloud DNS, code validation, monitoring, SSH, Kubernetes, Docker. All accessible through one instruction: "do the thing."

This is Lightning MCP.

Built for real infrastructure, running in production, backing real tenants. If you're tired of explaining your project to the AI every morning — we should talk.