New Method Measures Black-Box Confidence via Reasoning Trajectories

other · 2026-05-09

Researchers have introduced a novel technique for assessing confidence in chain-of-thought reasoning using text-based APIs, known as the trajectory-confidence score. This system visualizes reasoning paths and evaluates consistency with external benchmarks through a unique softmax approach, avoiding the need for hidden states or calibration methods. Experiments across six diverse scenarios, such as MedQA-USMLE and GPQA Diamond, utilizing Gemini 3.1 Pro and Claude Sonnet 4.6, indicated that integrating this score with coverage and verbalized confidence at K=4 achieved superior outcomes compared to self-consistency at K=8, achieving a median AUC of 0.78 versus 0.71, with no vendor-related discrepancies observed.

Key facts

Method is a black-box trajectory-confidence score.
Embeds CoT as sliding-window trajectory.
Measures convergence to external answer anchors with one-parameter softmax.
No logits, hidden states, or supervised calibrators needed.
Evaluated on MedQA-USMLE, GPQA Diamond, MMLU-Pro.
Models: Gemini 3.1 Pro and Claude Sonnet 4.6.
Fuses trajectory score with coverage and verbalized-confidence channels at K=4.
Outperforms self-consistency at K=8 in all six settings.
Median AUC 0.78 vs 0.71, deltaAUC=+0.075.
Fixed-pick control (+0.060) and E5 cross-embedder replication confirm robustness.

New Method Measures Black-Box Confidence via Reasoning Trajectories

Key facts

Entities

Institutions

Sources