ToM Improvements in LLMs Don't Always Benefit Human-AI Interactions

ai-technology · 2026-05-18

A new study from arXiv challenges the assumption that improving Theory of Mind (ToM) in Large Language Models (LLMs) directly enhances human-AI interactions. The researchers propose an interactive evaluation paradigm that shifts from static, third-person benchmarks to first-person, dynamic, and open-ended assessments. They tested four ToM enhancement techniques across goal-oriented tasks (coding, math) and experience-oriented tasks (counseling), using four real-world datasets and a user study. Results show that improvements on static benchmarks do not consistently translate to better interactive performance. The findings highlight the need for evaluation methods that reflect the actual dynamics of human-AI communication.

Key facts

Study examines whether ToM improvement in LLMs benefits human-AI interactions
Proposes interactive evaluation paradigm with perspective and metric shifts
Tests four ToM enhancement techniques
Uses four real-world datasets and a user study
Covers goal-oriented tasks (coding, math) and experience-oriented tasks (counseling)
Static benchmark improvements do not always translate to interactive settings
Published on arXiv with identifier 2605.15205
Study conducted by researchers affiliated with arXiv

ToM Improvements in LLMs Don't Always Benefit Human-AI Interactions

Key facts

Entities

Institutions

Sources