EgoMemReason Leaderboard

EgoMemReason

A Memory-driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding.

EgoMemReason is a 500-question multiple-choice benchmark over week-long egocentric videos (built on EgoLife). Models must answer questions whose evidence is sparsely distributed across hours or days, exercising three memory types:

Entity memory — Cumulative State Tracking, Temporal Counting
Event memory — Event Ordering, Event Linking
Behavior memory — Spatial Preference Inference, Activity Pattern Inference

500 Qs · avg. 5.1 evidence segments / Q · avg. 25.9 h memory backtracking. The strongest model in the paper reaches 39.6% Overall.

Resources

🌐 Project page: https://egomemreason.github.io/
📄 Paper: https://arxiv.org/abs/2605.09874
💻 Code & reference eval scripts: https://github.com/Ziyang412/EgoMemReason
📦 Public questions (no answers): https://huggingface.co/datasets/Ted412/EgoMemReason
🎬 EgoLife video frames: https://egolife-ai.github.io/

Submission

Upload a JSON file with 500 entries:

[
  {"example_id": 1, "predicted_answer": "A"},
  ...
]

Questions have 4-10 options (letters A-J) — predicted_answer must be a letter that appears in that question's options dict. See SUBMISSION_FORMAT.md for the full spec.

License

Annotations (this Space + the public dataset): CC BY-NC 4.0.
Video frames: governed by the EgoLife data license — you must accept their terms separately.

Citation

@misc{wang2026egomemreasonmemorydrivenreasoningbenchmark,
      title={EgoMemReason: A Memory-Driven Reasoning Benchmark for Long-Horizon Egocentric Video Understanding},
      author={Ziyang Wang and Yue Zhang and Shoubin Yu and Ce Zhang and Zengqi Zhao and Jaehong Yoon and Hyunji Lee and Gedas Bertasius and Mohit Bansal},
      year={2026},
      eprint={2605.09874},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2605.09874},
}

#	Method	Team	Overall	Cumul	Count	Order	Link	Spatial	Activity	Size	Ext	Modality	Links
1	Gemini-3-Flash	EgoMemReason	39.6	46	28	36	44	44	44	API	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
2	Gemini-3.1-Pro	EgoMemReason	37.4	40	26	44	33	40	48	API	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
3	Qwen-3-VL-32B	EgoMemReason	36.8	35	46	27	27	50	46	32B	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
4	Qwen-3-VL-30B-A3B	EgoMemReason	34	36	48	25	26	40	30	30B MoE	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
5	AVP	EgoMemReason	34	34	42	31	27	38	34	API	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
6	Molmo2-8B	EgoMemReason	33.2	36	50	27	25	34	22	8B	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
7	InternVL3.5-38B	EgoMemReason	32.6	33	40	27	24	46	32	38B	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
8	WorldMM	EgoMemReason	30.6	32	44	21	21	34	36	API	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
9	VideoLLaMA3-8B	EgoMemReason	30	23	31	27	32	38	36	8B	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
10	Qwen-3-VL-8B	EgoMemReason	29.6	35	28	23	21	40	42	8B	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
11	InternVL3.5-8B	EgoMemReason	28	23	29	23	27	34	42	8B	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
12	GPT-5	EgoMemReason	27.8	29	42	20	18	32	28	API	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
13	Ego-R1	EgoMemReason	25.8	30	18	23	18	48	32	API	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
14	InternVideo-2.5-8B	EgoMemReason	25.6	29	27	25	15	32	32	8B	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
15	StreamingVLM	EgoMemReason	24.2	25	29	21	20	20	32	—	no	video-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
16	SiLVR	EgoMemReason	22.4	31	14	27	17	18	28	API + 8B	no	captions-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
17	LongVA-7B	EgoMemReason	20.6	22	18	20	20	20	22	7B	no	frames-only	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)
18	Random	EgoMemReason	16.8	19.6	16.7	11.1	17.3	19.3	19.2	—	no	other	[project](https://github.com/Ted412/EgoMemReason) · [paper](https://arxiv.org/abs/<arxiv_id>)

🧠 EgoMemReason — Leaderboard

EgoMemReason

Resources

Submission

License

Citation