BALAR: Bayesian Agentic Loop Enhances LLM Active Reasoning

ai-technology · 2026-05-09

A new algorithm called BALAR (Bayesian Agentic Loop for Active Reasoning) enables large language models to conduct structured multi-turn interactions by maintaining beliefs over latent states and selecting clarifying questions via expected mutual information. The task-agnostic, fine-tuning-free outer-loop method dynamically expands state representations when needed. Evaluated on AR-Bench-DC, AR-Bench-SP, and iCraft-MD benchmarks, BALAR outperformed all baselines across tasks including detective cases, thinking puzzles, and clinical diagnosis.

Key facts

BALAR is a Bayesian Agentic Loop for Active Reasoning
It is a task-agnostic outer-loop algorithm requiring no fine-tuning
Maintains structured belief over latent states
Selects clarifying questions by maximizing expected mutual information
Dynamically expands state representation when insufficient
Evaluated on AR-Bench-DC, AR-Bench-SP, and iCraft-MD benchmarks
Significantly outperforms all baselines across all benchmarks

BALAR: Bayesian Agentic Loop Enhances LLM Active Reasoning

Key facts

Entities

Institutions

Sources