TinyLM Achieves 96.1% on ARC-AGI-2 Training Set Using Multi-Perspective Transformers

ai-technology · 2026-05-06

A recent paper on machine learning presents a multi-perspective transformer method aimed at addressing ARC-AGI-2, a standard for evaluating human-like visual puzzle-solving abilities in machines. This benchmark assesses a machine's capacity to generalize from few examples, understand symbolic meanings, and adapt rules across different scenarios. The authors utilize TinyLM, enhanced with fine-tuning during the testing phase, incorporating Test-Time-Training (TTT) and Products of Experts (POE). Their approach results in an accuracy of 96.1% on the training dataset and 21.7% on the evaluation dataset.

Key facts

ARC-AGI-2 is a benchmark of human-intuitive visual puzzles
The approach uses TinyLM with fine-tuning at test time
Test-Time-Training (TTT) and Products of Experts (POE) are employed
Model achieves 96.1% accuracy on training set
Model achieves 21.7% accuracy on evaluation set
The paper is titled 'Multi-Perspective Transformers in ARC-AGI-2 Challenge'
The paper is categorized under Computer Science > Machine Learning
The paper is available on arXiv with ID 2605.01154

TinyLM Achieves 96.1% on ARC-AGI-2 Training Set Using Multi-Perspective Transformers

Key facts

Entities

Institutions

Sources