New Method Detects Hallucinations in LLM Reasoning Steps

other · 2026-05-14

Researchers propose a method to detect hallucinations in large language models during multi-step reasoning by analyzing hidden-state trajectories. The approach uses a label-conditioned teacher to build a contrastive PCA lens and a BiLSTM student for deployment. It identifies the first error as a localized excursion in transport cost from a stable manifold of coherent transitions.