Prover-Verifier Deliberation Improves LLM Selective Prediction

ai-technology · 2026-05-26

A novel protocol for inference time, termed prover-verifier deliberation (PVD), has been developed to enhance selective prediction in extensive language models. This approach is based on interactive proof theory, where a prover substantiates a proposed answer with verifiable sub-claims, while a verifier poses specific challenges and responds with Accept, Challenge, or Reject. This system enables the reporting of high-confidence responses while allowing for abstention in uncertain scenarios. The primary experiment employs Claude Sonnet 4.6 as the prover and Claude Haiku 4.5 as the verifier, utilizing the GPQA dataset. The protocol is empirically assessed through its coverage-precision dynamics, as formal guarantees of soundness and completeness do not apply due to the limitations of models operating over a noisy channel.

Key facts

PVD is an inference-time protocol for selective prediction.
It is grounded in interactive proof theory.
A prover defends a candidate answer with checkable sub-claims.
A verifier issues targeted challenges and returns Accept, Challenge, or Reject.
The protocol allows reporting high-confidence answers and abstaining on uncertain cases.
Main experiment uses Claude Sonnet 4.6 as prover and Claude Haiku 4.5 as verifier.
The experiment is conducted on the GPQA dataset.
Formal soundness and completeness guarantees do not transfer due to imperfect models.

Entities

—

Sources

arXiv cs.AI — 2026-05-26