Serialization Friction: LLMs Struggle with 2D Tasks in 1D Inputs
A new study from arXiv (2604.27272) introduces the concept of "serialization friction" in large language models (LLMs). Researchers found that processing structured 2D data as 1D token sequences imposes a representational burden on tasks like matrix transpose, Conway's Game of Life, and LU decomposition. They compared a text-only pathway with a vision-augmented pathway on the same language backbone, where the latter received content in a 2D layout. The visual pathway consistently outperformed the text-only one, suggesting that linearization disrupts row-column alignment and local neighborhood recognition. The study provides a diagnostic testbed for understanding this limitation.
Key facts
- The study is from arXiv preprint 2604.27272.
- LLMs process structured inputs as 1D token sequences.
- Serialization friction refers to the added difficulty from linearizing 2D structures.
- Tasks tested include matrix transpose, Conway's Game of Life, and LU decomposition.
- A vision-augmented pathway was compared to a text-only pathway.
- Both pathways used the same language backbone.
- The visual pathway consistently outperformed the text-only pathway.
- The study provides a diagnostic testbed for serialization friction.
Entities
Institutions
- arXiv