ARTFEED — Contemporary Art Intelligence

Cross-scale Alignment Problem in GAN Training Identified

other · 2026-05-27

A recent study questions the understanding of multi-stage GAN synthesis as a hierarchical process from coarse to fine. The researchers contend that traditional adversarial supervision by scale does not create an effective hierarchy, as each intermediate image is independently aligned with the real distribution at its specific resolution, failing to guarantee that outputs from different stages correspond to the same sample. Furthermore, the image generated at each stage is not utilized as a clear refinement goal for the next stage, which permits later stages to deviate. This issue is referred to as cross-scale alignment. The findings can be found on arXiv.

Key facts

  • Paper challenges coarse-to-fine interpretation of multi-stage GANs
  • Standard scale-wise adversarial supervision does not ensure identical sample across stages
  • Scale-specific outputs are not used as refinement targets for subsequent stages
  • Problem termed cross-scale alignment
  • Paper available on arXiv

Entities

Institutions

  • arXiv

Sources