RC-aux: A Lightweight Correction for Latent World Model Planning
A new paper on arXiv (2605.07278) introduces the Reachability-Correction auxiliary objective (RC-aux) to address spatiotemporal mismatch in latent world models. These models achieve accurate short-horizon prediction but produce latent spaces poorly aligned with long-horizon planning, where Euclidean distance may not reflect reachability within a finite action budget. RC-aux adds planning-aligned supervision along two axes: multi-horizon open-loop prediction along the time axis, and budget-conditioned reachability supervision with temporal hard negatives along the space axis. This correction keeps the world-model backbone unchanged and is designed for reconstruction-free latent world models. The approach aims to distinguish states that are eventually reachable, improving goal-directed search.
Key facts
- Paper arXiv:2605.07278 introduces RC-aux for latent world models.
- RC-aux addresses spatiotemporal mismatch in planning.
- Latent world models often have poor alignment with long-horizon planning.
- RC-aux adds multi-horizon open-loop prediction along the time axis.
- Budget-conditioned reachability supervision is added along the space axis.
- Temporal hard negatives are used to distinguish reachable states.
- RC-aux is a lightweight correction that keeps the backbone unchanged.
- The method targets reconstruction-free latent world models.
Entities
Institutions
- arXiv