ARTFEED — Contemporary Art Intelligence

Gaussian Mixture Models Improve DDIM Sampling Quality

other · 2026-05-23

A new approach enhances the speed of sampling from pre-trained Denoising Diffusion Probabilistic Models (DDPM) by using a Gaussian Mixture Model (GMM) as the reverse transition kernel within the Denoising Diffusion Implicit Models (DDIM) setup. This technique ensures that the first and second order central moments of DDPM forward marginals match by applying constraints to the GMM parameters. When tested on various models, including unconditional ones from CelebAHQ and FFHQ, as well as class-conditional models from ImageNet and text-to-image generation with Stable Diffusion v2.1 on COYO700M, the results showed that moment matching can produce samples that are just as good or better than those from the original DDIM with Gaussian kernels, especially when fewer sampling steps are taken.

Key facts

  • Proposes GMM as reverse transition kernel in DDIM framework
  • Matches first and second order central moments of DDPM forward marginals
  • Tested on CelebAHQ, FFHQ, ImageNet, and COYO700M datasets
  • Uses Stable Diffusion v2.1 for text-to-image generation
  • GMM kernel improves sample quality with fewer sampling steps
  • Moment matching sufficient for quality equal or better than original DDIM
  • arXiv paper ID: 2311.04938
  • Announcement type: replace-cross

Entities

Institutions

  • arXiv

Sources