ARTFEED — Contemporary Art Intelligence

Mode-as-Sequence: New Framework for Multimodal Motion Prediction

other · 2026-05-26

A research paper on arXiv proposes Mode-as-Sequence, a unified decoding framework for multimodal motion forecasting. The framework translates an unordered mode set into an ordered mode sequence, explicitly modeling mode-to-mode dependency to address mode collapse and unreliable confidence ranking. Two instantiations are developed: ModeSeq, which performs recurrent mode decoding, and Parallel ModeSeq, which uses masked self-attention for parallel processing. The paper is available at arXiv:2605.24037.

Key facts

  • arXiv paper 2605.24037
  • Mode-as-Sequence framework
  • multimodal motion forecasting
  • mode collapse addressed
  • ModeSeq instantiation
  • Parallel ModeSeq instantiation
  • recurrent mode decoding
  • masked mode-to-mode self-attention

Entities

Institutions

  • arXiv

Sources