ToM Improvements in LLMs Don't Always Benefit Human-AI Interactions
A new study from arXiv challenges the assumption that improving Theory of Mind (ToM) in Large Language Models (LLMs) directly enhances human-AI interactions. The researchers propose an interactive evaluation paradigm that shifts from static, third-person benchmarks to first-person, dynamic, and open-ended assessments. They tested four ToM enhancement techniques across goal-oriented tasks (coding, math) and experience-oriented tasks (counseling), using four real-world datasets and a user study. Results show that improvements on static benchmarks do not consistently translate to better interactive performance. The findings highlight the need for evaluation methods that reflect the actual dynamics of human-AI communication.
Key facts
- Study examines whether ToM improvement in LLMs benefits human-AI interactions
- Proposes interactive evaluation paradigm with perspective and metric shifts
- Tests four ToM enhancement techniques
- Uses four real-world datasets and a user study
- Covers goal-oriented tasks (coding, math) and experience-oriented tasks (counseling)
- Static benchmark improvements do not always translate to interactive settings
- Published on arXiv with identifier 2605.15205
- Study conducted by researchers affiliated with arXiv
Entities
Institutions
- arXiv