True Memory: A Retrieval-First Architecture for Agent Recall

ai-technology · 2026-05-07

A recent study posted on arXiv introduces True Memory, a six-layer framework designed for agent memory that emphasizes retrieval rather than storage. This architecture accurately retains events in their original form and operates on standard CPUs with a single SQLite file, negating the necessity for external databases, vector indexes, graph stores, or GPUs. In evaluations using the LoCoMo benchmark (1,540 questions from 10 multi-session conversations), True Memory Pro achieved an impressive 93.0% accuracy (3-run mean), surpassing Mem0 (61.4%), Supermemory (65.4%), Zep (~71%), and EverMemOS (94.5%) with a matched gpt-4.1-mini answer model. It also scored 87.8% on LongMemEval (500 questions) and 76.6% on BEAM-1M (700 questions at the 1-million-token scale), exceeding Hindsight's previous best of 73.9%. The authors contend that extracting data during ingestion is an ineffective approach for agent memory since any discarded content before a query cannot be retrieved later.

Key facts

True Memory is a six-layer architecture for agent memory.
It shifts focus from storage schema to multi-stage retrieval pipeline.
Runs on commodity CPU with a single SQLite file, no external DB, vector index, graph store, or GPU.
On LoCoMo, True Memory Pro achieved 93.0% accuracy (3-run mean).
Outperformed Mem0 (61.4%), Supermemory (65.4%), Zep (~71%), and EverMemOS (94.5%) on LoCoMo.
On LongMemEval, True Memory Pro reached 87.8% (3-run mean).
On BEAM-1M, True Memory Pro scored 76.6% (3-run mean), above Hindsight's 73.9%.
Paper argues extraction at ingestion is wrong for agent memory.

Entities

Institutions

arXiv
LoCoMo
LongMemEval
BEAM-1M
Mem0
Supermemory
Zep
EverMemOS
Hindsight

Sources

arXiv cs.AI — 2026-05-07