REVELIO Framework Reveals Failure Modes in Vision-Language Models

ai-technology · 2026-05-14

A new framework named REVELIO has been developed by researchers to systematically identify interpretable failure modes in Vision-Language Models (VLMs). These models are increasingly utilized in safety-sensitive contexts due to their extensive reasoning and generalization abilities, yet they can encounter severe failures in particular real-world scenarios. REVELIO characterizes a failure mode as a combination of interpretable, relevant concepts—like pedestrian proximity or adverse weather—where a specific VLM consistently fails. To tackle the challenge of navigating a vast discrete combinatorial space, REVELIO integrates two search methods: a diversity-aware beam search for mapping the failure landscape and a Gaussian-process Thompson Sampling approach for wider exploration. This framework seeks to enhance VLM reliability in critical applications, as detailed in a paper on arXiv (2605.12674).

Key facts

REVELIO is a framework for uncovering interpretable failure modes in VLMs.
VLMs are used in safety-critical applications due to broad reasoning and generalization.
Failure modes are compositions of interpretable, domain-relevant concepts.
Examples of concepts include pedestrian proximity and adverse weather conditions.
The search space is exponentially large and discrete combinatorial.
REVELIO uses diversity-aware beam search and Gaussian-process Thompson Sampling.
The paper is available on arXiv with ID 2605.12674.
The work aims to improve VLM reliability in critical situations.

REVELIO Framework Reveals Failure Modes in Vision-Language Models

Key facts

Entities

Institutions

Sources