ARTFEED — Contemporary Art Intelligence

VERTIGO: AI Framework Optimizes Cinematic Camera Trajectories via Visual Preference

ai-technology · 2026-04-30

VERTIGO has been unveiled by researchers as the inaugural framework aimed at optimizing visual preferences for camera trajectory generators. Utilizing a real-time graphics engine, Unity, the system produces 2D previews based on the generated camera movements, which are evaluated by a vision-language model refined for cinematic purposes through a cyclic semantic similarity approach. This method ensures that the renders correspond with text prompts, effectively tackling challenges such as inadequate framing and characters appearing off-screen in current generative camera systems. The findings are elaborated in a paper available on arXiv (2604.02467v3).

Key facts

  • VERTIGO is the first framework for visual preference optimization of camera trajectory generators.
  • It leverages Unity to render 2D visual previews from generated camera motion.
  • A cinematically fine-tuned vision-language model scores previews using cyclic semantic similarity.
  • The mechanism aligns renders with text prompts.
  • Addresses poor framing, off-screen characters, and undesirable aesthetics in generative camera systems.
  • Paper available on arXiv with ID 2604.02467v3.

Entities

Institutions

  • arXiv

Sources