BALAR: Bayesian Agentic Loop Enhances LLM Active Reasoning
A new algorithm called BALAR (Bayesian Agentic Loop for Active Reasoning) enables large language models to conduct structured multi-turn interactions by maintaining beliefs over latent states and selecting clarifying questions via expected mutual information. The task-agnostic, fine-tuning-free outer-loop method dynamically expands state representations when needed. Evaluated on AR-Bench-DC, AR-Bench-SP, and iCraft-MD benchmarks, BALAR outperformed all baselines across tasks including detective cases, thinking puzzles, and clinical diagnosis.
Key facts
- BALAR is a Bayesian Agentic Loop for Active Reasoning
- It is a task-agnostic outer-loop algorithm requiring no fine-tuning
- Maintains structured belief over latent states
- Selects clarifying questions by maximizing expected mutual information
- Dynamically expands state representation when insufficient
- Evaluated on AR-Bench-DC, AR-Bench-SP, and iCraft-MD benchmarks
- Significantly outperforms all baselines across all benchmarks
Entities
Institutions
- arXiv