ACE-MAPPO: A Hybrid Framework for Cooperative Air Combat

other · 2026-05-26

To tackle the difficulties in multi-agent reinforcement learning for cooperative air combat beyond visual range, researchers have introduced the Adversarial Curriculum and Evolutionary-enhanced Multi-agent Proximal Policy Optimization (ACE-MAPPO). This innovative framework combines evolutionary algorithms with MAPPO, enhancing exploration efficiency, sample usage, and policy generalization. It features a genetic soft update mechanism that promotes population diversity and prevents local optima, alongside an evolutionary-augmented prioritized trajectory replay strategy that increases sample efficiency. The focus of this research is on enabling autonomous decision-making for unmanned combat aerial vehicles operating in complex, adversarial settings.

Key facts

ACE-MAPPO integrates evolutionary algorithms with MAPPO.
Genetic soft update mechanism enhances population diversity.
Evolutionary-augmented prioritized trajectory replay improves sample utilization.
Addresses limitations in existing MARL methods for air combat.
Focuses on beyond-visual-range multi-aircraft cooperative engagements.
Aims to improve exploration efficiency and policy generalization.
Targets autonomous decision-making for UCAVs.
Published on arXiv with ID 2605.25091.

ACE-MAPPO: A Hybrid Framework for Cooperative Air Combat

Key facts

Entities

Institutions

Sources