Serialization Friction: LLMs Struggle with 2D Tasks in 1D Inputs

ai-technology · 2026-05-01

A new study from arXiv (2604.27272) introduces the concept of "serialization friction" in large language models (LLMs). Researchers found that processing structured 2D data as 1D token sequences imposes a representational burden on tasks like matrix transpose, Conway's Game of Life, and LU decomposition. They compared a text-only pathway with a vision-augmented pathway on the same language backbone, where the latter received content in a 2D layout. The visual pathway consistently outperformed the text-only one, suggesting that linearization disrupts row-column alignment and local neighborhood recognition. The study provides a diagnostic testbed for understanding this limitation.

Key facts

The study is from arXiv preprint 2604.27272.
LLMs process structured inputs as 1D token sequences.
Serialization friction refers to the added difficulty from linearizing 2D structures.
Tasks tested include matrix transpose, Conway's Game of Life, and LU decomposition.
A vision-augmented pathway was compared to a text-only pathway.
Both pathways used the same language backbone.
The visual pathway consistently outperformed the text-only pathway.
The study provides a diagnostic testbed for serialization friction.

Serialization Friction: LLMs Struggle with 2D Tasks in 1D Inputs

Key facts

Entities

Institutions

Sources