ARTFEED — Contemporary Art Intelligence

SymphonyGen: AI Framework for 3D Hierarchical Orchestral Music Generation

ai-technology · 2026-04-30

A new AI framework called SymphonyGen has been introduced, designed specifically for composing symphonic music. This innovative system, developed by researchers and detailed in a paper on arXiv (2604.25498), utilizes a 3D hierarchical approach, breaking music generation into three levels: Bar, Track, and Event. This method boosts efficiency over traditional 1D or 2D models. It includes a cascading decoder and uses 'short-score' conditioning with a beat-quantized multi-voice harmony skeleton for better control and varied textures. The model is fine-tuned with Group Relative Policy Optimization (GRPO) and a cross-modal audio-perceptual reward, ensuring that the music produced aligns with modern acoustic standards, while a dissonance-averse sampling algorithm enhances quality and addresses complexity issues found in existing models.

Key facts

  • SymphonyGen is a 3D hierarchical framework for symphonic music generation.
  • It decomposes generation along Bar, Track, and Event axes.
  • Uses a cascading decoder architecture for improved efficiency.
  • Introduces 'short-score' conditioning with a beat-quantized multi-voice harmony skeleton.
  • Refined using Group Relative Policy Optimization (GRPO) with cross-modal audio-perceptual reward.
  • Implements a dissonance-averse sampling algorithm.
  • Targets contemporary cinematic orchestration.
  • Paper available on arXiv with ID 2604.25498.

Entities

Institutions

  • arXiv

Sources