ACE-MAPPO: A Hybrid Framework for Cooperative Air Combat
To tackle the difficulties in multi-agent reinforcement learning for cooperative air combat beyond visual range, researchers have introduced the Adversarial Curriculum and Evolutionary-enhanced Multi-agent Proximal Policy Optimization (ACE-MAPPO). This innovative framework combines evolutionary algorithms with MAPPO, enhancing exploration efficiency, sample usage, and policy generalization. It features a genetic soft update mechanism that promotes population diversity and prevents local optima, alongside an evolutionary-augmented prioritized trajectory replay strategy that increases sample efficiency. The focus of this research is on enabling autonomous decision-making for unmanned combat aerial vehicles operating in complex, adversarial settings.
Key facts
- ACE-MAPPO integrates evolutionary algorithms with MAPPO.
- Genetic soft update mechanism enhances population diversity.
- Evolutionary-augmented prioritized trajectory replay improves sample utilization.
- Addresses limitations in existing MARL methods for air combat.
- Focuses on beyond-visual-range multi-aircraft cooperative engagements.
- Aims to improve exploration efficiency and policy generalization.
- Targets autonomous decision-making for UCAVs.
- Published on arXiv with ID 2605.25091.
Entities
Institutions
- arXiv