LLM-Based Pathos Analysis Outperforms Acoustic Emotion Recognition in Political Speech

other · 2026-05-23

A study from arXiv (2605.22732) compares acoustic emotion recognition models with large language models (LLMs) for analyzing Pathos in political speech. Using a Bundestag speech by Felix Banaszak (51 segments, 245 seconds), researchers tested three modalities: emotion2vec_plus_large (acoustic SER with Russell Circumplex projection), Gemini 2.5 Flash (LLM analyzing audio and transcript), and TRUST-Pathos (multi-agent LLM supervisor ensemble). Spearman correlations show Gemini Valence strongly correlates with TRUST-Pathos (rho=+0.664, p<0.001), while emotion2vec Valence does not (rho=+0.097, p=0.499). The TRUST pipeline uses three LLM advocates for pathos scoring. Findings suggest LLMs are more effective proxies for rhetorical pathos than traditional acoustic models.

Key facts

arXiv paper 2605.22732 compares acoustic emotion recognition and LLMs for political speech pathos analysis
Case study uses Bundestag speech by Felix Banaszak (51 segments, 245 seconds)
Three modalities tested: emotion2vec_plus_large, Gemini 2.5 Flash, TRUST-Pathos
Gemini Valence correlates strongly with TRUST-Pathos (rho=+0.664, p<0.001)
emotion2vec Valence shows no significant correlation (rho=+0.097, p=0.499)
TRUST-Pathos uses three-advocate LLM supervisor ensemble
Acoustic SER model uses post-hoc Russell Circumplex projection for Arousal and Valence
LLM-based analysis outperforms acoustic models for pathos detection

Entities

Institutions

arXiv
Bundestag
TRUST

Locations

Germany

Sources

arXiv cs.AI — 2026-05-23