KnowMe-Bench: Benchmarking Person Understanding for Lifelong Digital Companions
Abstract
Long-horizon memory benchmarks based on autobiographical narratives evaluate models' ability to infer stable motivations and decision principles through evidence-linked questions spanning factual recall, subjective state attribution, and principle-level reasoning.
Existing long-horizon memory benchmarks mostly use multi-turn dialogues or synthetic user histories, which makes retrieval performance an imperfect proxy for person understanding. We present \BenchName, a publicly releasable benchmark built from long-form autobiographical narratives, where actions, context, and inner thoughts provide dense evidence for inferring stable motivations and decision principles. \BenchName~reconstructs each narrative into a flashback-aware, time-anchored stream and evaluates models with evidence-linked questions spanning factual recall, subjective state attribution, and principle-level reasoning. Across diverse narrative sources, retrieval-augmented systems mainly improve factual accuracy, while errors persist on temporally grounded explanations and higher-level inferences, highlighting the need for memory mechanisms beyond retrieval. Our data is in KnowMeBench{https://github.com/QuantaAlpha/KnowMeBench}.
Community
know me
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- RealMem: Benchmarking LLMs in Real-World Memory-Driven Interaction (2026)
- HiMem: Hierarchical Long-Term Memory for LLM Long-Horizon Agents (2026)
- Structured Episodic Event Memory (2026)
- MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents (2026)
- ES-Mem: Event Segmentation-Based Memory for Long-Term Dialogue Agents (2026)
- Mem-Gallery: Benchmarking Multimodal Long-Term Conversational Memory for MLLM Agents (2026)
- CloneMem: Benchmarking Long-Term Memory for AI Clones (2026)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 0
No model linking this paper
Datasets citing this paper 1
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper