Ingest and Distill
Raw events are normalized, deduplicated, and expanded into durable memories with lineage.
- Companion memories
- Dedup table
- Graph relationships
Hybrid vector + BM25 search, knowledge graphs, deterministic analytics, and fact supersession in a single Rust binary. Self-host or deploy on our platform with one click.
Experience Tellodb's real-time ingestion and recall loop. Store a fact, then retrieve it across model contexts.
Write Pipeline
Demo Memory Session Key
Memories are stored and queried using this unique key. It isolates your demo session and persists across reloads.
Ingest Latency (Engine)
---
Ingest Latency (Network RTT)
---
Read Substrate
Query Latency (Engine)
---
Query Latency (Network RTT)
---
Retrieved Memory Hits
Vector databases are giant warehouses of static text. They find words, but they do not understand life. They lose context, ignore the passage of time, and drown in their own noise.
Retrieves conflicting data from 2 years ago exactly like data from 2 minutes ago. No concept of evolving truth.
Cannot accurately aggregate or count facts (e.g. 'How many cars do I own?'). Relies entirely on the LLM to do math.
Stores every single conversational 'uh' and 'um' instead of maintaining a clean, structured user profile.
When life changes (e.g. moving from NYC to SF), Tellodb marks the old fact as stale, ensuring the LLM always gets the latest truth.
Built-in execution layer accurately computes numeric and temporal queries before hitting the LLM, fixing benchmark failures.
Distills thousands of words into compact, continuous user profiles. We track the deltas, you save on context windows.
Tellodb resolves standard vector search failures. In memory tasks with high fact-density, the local hybrid architecture ensures precise recall.
When newer truths supersede older context, the engine automatically tags prior states as stale, filtering them out from active agent context.
Raw chat logs are noise. Tellodb acts as a cognitive filter, distilling human rambling into a clean, queryable lattice of facts.
I used to love coffee, but now I only drink tea. Tellodb does not hallucinate your old preferences. It updates your profile in real time.
Our engine automatically discards greetings and filler, keeping only the high-value semantic facts that actually matter for personalization.
Our White Mercedes engine ensures your user's identity is not locked inside a single chat window.
Hey! I just bought a white Mercedes! What should I do first?
GPT-4o detects: User Ownership → Vehicle: Mercedes (White)
Fact Integration
What was that maintenance tip for my car?
Claude 3.5 recalls: "For your white Mercedes, I recommend..."
Tellodb replaces complex, slow orchestration chains with a unified, high-performance cognitive database engine.
Engineered in Rust with sub-100ms p99 query latencies, zero GC pauses, and compiled as a single air-gapped binary.
Time-aware rankings and TTL decay policies automatically tag older context as stale when new conflicting facts arrive.
Built-in arithmetic and count aggregation calculated directly on the database B-Tree indexes before LLM delivery.
Switch cognitive backends (GPT-4, Claude 3.5, Llama 3) without losing memory state or query syntax history.
Drop-in proxy gateway that automatically injects relevant context into OpenAI-compatible system instructions. In development.
Self-organizing RDF typed graph representations mapping relationships between user sessions and profile history.
Recall Engine Latency
Built natively in Rust. Delivers sub-100ms queries under heavy semantic and full-text loads.
Runtime Configuration
engine: Tellodb routes: /ingest /query/semantic /query/temporal /memory indexes: hnsw + bm25 + graph lineage policy: ttl + decay + supersession sdk: python + javascript
Tellodb is not just storage; it is a multi-stage cognitive processor that transforms raw noise into reliable agentic state.
Automatically detects if the user is asking for numbers, preferences, or narrative history.
Applies a secondary precision pass to ensure the top-k candidates are semantically perfect.
Computes aggregates (sums, counts) before delivery, preventing LLM arithmetic errors.
Experience how Tellodb organizes memories. Drag nodes to interact with the underlying graph logic where new facts supersede the old.
Nodes represent discrete semantic facts, preferences, and entities stored within the Rust engine.
Red nodes indicate **superseded memories**—stale data that has been automatically invalidated by more recent truths.
The product has a clear progression: ingest fidelity, retrieval intelligence, and operational reliability.
Raw events are normalized, deduplicated, and expanded into durable memories with lineage.
Semantic and lexical candidates are fused, reranked, then filtered by temporal policy before response.
Teams deploy one memory engine surface from local bench runs to hosted multi-tenant workloads.
Book A Session
Discuss database architecture, memory integration, and private deployment setups for your agentic applications.
One binary, five memory substrates, zero lock-in. Start building persistent, self-improving agents today.
Open Source Engine
The Rust-powered temporal memory engine that runs anywhere. Hybrid vector + BM25 search, knowledge graph traversal, deterministic analytics, and fact supersession — all in a single binary. No vendor lock-in.
Managed Cloud Service
A full SaaS experience on top of the core engine. Deploy clusters in one click, manage your team, track usage with analytics, explore knowledge graphs visually, and never worry about infrastructure.
We provide the tooling to make persistent memory a first-class citizen in your development workflow, from local testing to global scale.
An OpenAI-compatible gateway that automatically injects memories into your agent's system prompt. Zero code changes required. In development.
Unified command-line tool to manage your engine, run local benchmarks, and monitor memory logs in real-time.
Built-in support for the Model Context Protocol. Connect Tellodb directly to Claude Code, Cursor, and agentic IDEs.