ARC-RL: Reinforcement Learning Environments for Game-Style Robot Morphologies
ARC-RL introduces a collection of four continuous-control environments based on MuJoCo, showcasing robotic designs inspired by the bestiary of the ARC Raiders game. The lineup features the 18-DoF tall hexapod named Queen, the 12-DoF armored hexapod called Bastion, the compact 18-DoF hexapod Tick, and the 12-DoF quadruped known as Leaper. Each robot utilizes a consistent observation framework, action protocol, simulation timing, and a singular closed-form reward function that varies slightly in weights and parameters across morphologies. This reward combines a velocity-tracking component with a survival bonus. This initiative seeks to bridge the divide between legged locomotion studies, which typically utilize real-world robotic designs, and game NPCs that often lack real-world equivalents. The environments serve as a testing ground for reinforcement learning, highlighting stylistic constraints not present in traditional sim-to-real robotics.
Key facts
- ARC-RL introduces four MuJoCo environments based on ARC Raiders creatures.
- The robots are Queen (18-DoF tall hexapod), Bastion (12-DoF armoured hexapod), Tick (18-DoF compact hexapod), and Leaper (12-DoF quadruped).
- All environments share unified observation, action, and simulation settings.
- A single reward function is used with per-morphology weight variations.
- The reward includes velocity-tracking and survival components.
- The work targets game NPCs with stylistic constraints not found in real robots.
- The paper is available on arXiv with ID 2605.19503.
- The research was announced in May 2025.
Entities
Institutions
- arXiv