ARTFEED — Contemporary Art Intelligence

New Method Measures Black-Box Confidence via Reasoning Trajectories

other · 2026-05-09

Researchers have introduced a novel technique for assessing confidence in chain-of-thought reasoning using text-based APIs, known as the trajectory-confidence score. This system visualizes reasoning paths and evaluates consistency with external benchmarks through a unique softmax approach, avoiding the need for hidden states or calibration methods. Experiments across six diverse scenarios, such as MedQA-USMLE and GPQA Diamond, utilizing Gemini 3.1 Pro and Claude Sonnet 4.6, indicated that integrating this score with coverage and verbalized confidence at K=4 achieved superior outcomes compared to self-consistency at K=8, achieving a median AUC of 0.78 versus 0.71, with no vendor-related discrepancies observed.

Key facts

  • Method is a black-box trajectory-confidence score.
  • Embeds CoT as sliding-window trajectory.
  • Measures convergence to external answer anchors with one-parameter softmax.
  • No logits, hidden states, or supervised calibrators needed.
  • Evaluated on MedQA-USMLE, GPQA Diamond, MMLU-Pro.
  • Models: Gemini 3.1 Pro and Claude Sonnet 4.6.
  • Fuses trajectory score with coverage and verbalized-confidence channels at K=4.
  • Outperforms self-consistency at K=8 in all six settings.
  • Median AUC 0.78 vs 0.71, deltaAUC=+0.075.
  • Fixed-pick control (+0.060) and E5 cross-embedder replication confirm robustness.

Entities

Institutions

  • arXiv

Sources