ZeNO: Gradient-Free Noise Optimization for Generative Model Alignment

ai-technology · 2026-05-13

A new method called ZeNO (Zeroth-order Noise Optimization) enables reward alignment in generative models without backpropagation. Developed for diffusion and flow models, ZeNO treats noise optimization as a path-integral control problem solvable via zeroth-order reward evaluations. Using an Ornstein-Uhlenbeck reference process, the update implicitly targets a reward-tilted distribution via Langevin dynamics. The framework supports inference-time scaling and performs well across diverse generators and reward functions, including protein structure generation where backpropagation is infeasible. The paper is available on arXiv under reference 2605.11347.

Key facts

ZeNO is a gradient-free framework for reward alignment in generative models.
It formulates noise optimization as a path-integral control problem.
The method uses zeroth-order reward evaluations without backpropagation.
It instantiates an Ornstein-Uhlenbeck reference process.
The update connects to Langevin dynamics targeting a reward-tilted distribution.
ZeNO enables effective inference-time scaling.
It demonstrates strong performance on protein structure generation.
The paper is published on arXiv with ID 2605.11347.

ZeNO: Gradient-Free Noise Optimization for Generative Model Alignment

Key facts

Entities

Institutions

Sources