SymphonyGen: AI Framework for 3D Hierarchical Orchestral Music Generation
A new AI framework called SymphonyGen has been introduced, designed specifically for composing symphonic music. This innovative system, developed by researchers and detailed in a paper on arXiv (2604.25498), utilizes a 3D hierarchical approach, breaking music generation into three levels: Bar, Track, and Event. This method boosts efficiency over traditional 1D or 2D models. It includes a cascading decoder and uses 'short-score' conditioning with a beat-quantized multi-voice harmony skeleton for better control and varied textures. The model is fine-tuned with Group Relative Policy Optimization (GRPO) and a cross-modal audio-perceptual reward, ensuring that the music produced aligns with modern acoustic standards, while a dissonance-averse sampling algorithm enhances quality and addresses complexity issues found in existing models.
Key facts
- SymphonyGen is a 3D hierarchical framework for symphonic music generation.
- It decomposes generation along Bar, Track, and Event axes.
- Uses a cascading decoder architecture for improved efficiency.
- Introduces 'short-score' conditioning with a beat-quantized multi-voice harmony skeleton.
- Refined using Group Relative Policy Optimization (GRPO) with cross-modal audio-perceptual reward.
- Implements a dissonance-averse sampling algorithm.
- Targets contemporary cinematic orchestration.
- Paper available on arXiv with ID 2604.25498.
Entities
Institutions
- arXiv