MLLM Concept-Based Explanations Degrade Accuracy in Few-Shot ICL

ai-technology · 2026-05-28

A new study on arXiv (2605.28215) systematically evaluates concept-based explainability of frozen multimodal large language models (MLLMs) under few-shot in-context learning (ICL). Using five conditions of increasing formal rigour—from baseline classification to Description Logics (DL) axiom generation—the authors test four state-of-the-art MLLMs via an independent LLM-as-a-judge pipeline. Results show that generating formally structured, concept-based explanations degrades predictive accuracy monotonically from 93.8% to 90.1%, contradicting the assumption that explicit reasoning universally aids performance. The paper argues that explaining is genuinely harder than predicting alone, and that Chain-of-Thought prompting may not reflect true internal computation. The study was published on arXiv with ID 2605.28215v1.

Key facts

arXiv paper 2605.28215v1 evaluates concept-based explainability of MLLMs under few-shot ICL
Five conditions of increasing formal rigour tested: baseline to Description Logics axiom generation
Four state-of-the-art MLLMs evaluated via LLM-as-a-judge pipeline
Predictive accuracy dropped from 93.8% to 90.1% with formal concept-based explanations
Chain-of-Thought prompting may not reflect true internal computation
Explaining is harder than predicting alone
Study contradicts assumption that explicit reasoning universally aids performance
Published on arXiv with announcement type 'new'

MLLM Concept-Based Explanations Degrade Accuracy in Few-Shot ICL

Key facts

Entities

Institutions

Sources