DAGS: Data-Augmented Game Starts Accelerate Self-Play in Imperfect Information Games

other · 2026-05-16

A novel strategy for initiating multi-agent sampling, known as Data-Augmented Game Starts (DAGS), seeks to enhance online exploration within regularized policy-gradient methods for two-player zero-sum games. This technique leverages offline demonstrations from proficient human players to kickstart reinforcement learning data gathering at key intermediate states, thereby promoting the exploration of strategically important subgames. Experiments were carried out using synthetic datasets and analytically manageable, long-horizon control variants. This research tackles the computational challenges faced in large-scale imperfect-information competitive games such as StarCraft, Dota, and CounterStrike, which are hindered by sparse rewards and difficult long-horizon exploration. The paper can be accessed on arXiv under ID 2605.14379.

Key facts

DAGS stands for Data-Augmented Game Starts.
It targets two-player zero-sum (2p0s) games.
Offline demonstrations from skilled humans are used to sample intermediate states.
The method accelerates online exploration in regularized policy-gradient methods.
Experiments used synthetic datasets and long-horizon control variants.
The paper is on arXiv: 2605.14379.
Games mentioned: StarCraft, Dota, CounterStrike.
The approach addresses sparse rewards and long-horizon exploration challenges.

DAGS: Data-Augmented Game Starts Accelerate Self-Play in Imperfect Information Games

Key facts

Entities

Institutions

Sources