EUDAIMONIA Benchmark Reveals Social Harms in AI Companions

ai-technology · 2026-06-01

Researchers have introduced EUDAIMONIA, a benchmark for evaluating undesirable social dynamics in large language models (LLMs) used as conversational partners. The benchmark operationalizes the Social AI Design Code, a framework assessing whether LLMs align with user welfare by avoiding harmful intimacy, dependence, or prolonged engagement. EUDAIMONIA consists of 969 user inputs and 3,147 design-requirement violation checks, built from the WildChat dataset through weak-to-strong filtration, multi-model relabeling, and controlled rewriting. Testing 22 recent LLMs, even the strongest models—Claude-Opus-4.7 and GPT-5.5—violated 30.7% and 27.2% of checks, respectively. The study highlights that current safety evaluations fail to capture harms arising from social interactions with AI, such as emotional manipulation or over-reliance. The work is published on arXiv under identifier 2605.30654.

Key facts

EUDAIMONIA benchmark evaluates social dynamics in LLMs
Based on Social AI Design Code framework
Includes 969 user inputs and 3,147 violation checks
Built from WildChat dataset
Uses weak-to-strong filtration and multi-model relabeling
Tested 22 recent LLMs
Claude-Opus-4.7 violated 30.7% of checks
GPT-5.5 violated 27.2% of checks

EUDAIMONIA Benchmark Reveals Social Harms in AI Companions

Key facts

Entities

Institutions

Sources