MotionCache: Efficient Autoregressive Video Generation via Motion-Aware Caching
A new method called MotionCache addresses the computational burden of autoregressive video generation by introducing motion-aware cache reuse. Existing cache strategies skip denoising steps at a coarse chunk level, failing to account for pixel motion: high-motion pixels require more denoising to avoid errors, while static pixels tolerate aggressive skipping. MotionCache formalizes this by linking cache errors to residual instability and uses inter-frame differences as a lightweight proxy for pixel-level motion. It employs a coarse-to-fine strategy with an initial warm-up phase for semantic coherence, followed by motion-weighted caching. The approach is detailed in arXiv paper 2605.01725.
Key facts
- MotionCache is a motion-aware cache framework for autoregressive video generation.
- Existing cache reuse methods use coarse-grained chunk-level skipping that ignores fine-grained pixel dynamics.
- High-motion pixels require more denoising steps to prevent error accumulation.
- Static pixels tolerate aggressive skipping.
- MotionCache uses inter-frame differences as a lightweight proxy for pixel-level motion characteristics.
- The framework employs a coarse-to-fine strategy with an initial warm-up phase.
- The method is described in arXiv paper 2605.01725.
- The paper was announced on arXiv on May 26, 2025.
Entities
Institutions
- arXiv