Alice: A System for Online Executable World Model Learning Under Prior Misalignment

other · 2026-05-20

A recent preprint on arXiv presents Alice, a closed-loop framework designed for online executable world-model learning in situations where prior knowledge is not aligned. The system interprets unsuccessful candidate updates as structural indicators: when a candidate accounts for a new transition but fails to retain previously explained ones, the resulting preservation conflict uncovers dynamics that were previously mixed by the current program. Alice transforms these conflicts into hypothesis classes that yield compact, class-stratified preservation counterexamples for updates, facilitating exploration towards new and less represented transitions. This research tackles the problem of deriving state-dependent dynamics solely from interaction data, without relying on rule descriptions, reward signals, or reliable lexical priors.

Key facts

arXiv:2605.16725v1
Announce Type: new
Alice is a closed-loop system
Treats failed candidate updates as structural signal
Preservation conflict reveals conflated dynamics
Refines conflicts into hypothesis classes
Provides compact preservation counterexamples
Guides frontier exploration toward novel transitions

Alice: A System for Online Executable World Model Learning Under Prior Misalignment

Key facts

Entities

Institutions

Sources