Region-Aware Attention Recalibration Reduces Object Hallucination in LVLMs
A new training-free inference strategy called region-aware attention recalibration mitigates object hallucination in Large Vision-Language Models (LVLMs). The method computes an outlier-resistant statistical midpoint across attention heads to establish a stable anchor for visual representations, then uses inter-head disagreement mapped across regions to dynamically determine intervention budgets. This approach avoids expensive data-driven fine-tuning, high-latency contrastive decoding, and rigid attention head truncation, which often compromise computational efficiency or feature space continuity. The paper is published on arXiv with ID 2605.24957.
Key facts
- Object hallucination is a persistent challenge in LVLMs
- Current approaches include data-driven fine-tuning, contrastive decoding, and attention head truncation
- The new method is training-free and inference-time
- It uses region-aware adaptive weighting to correct semantic drift
- An outlier-resistant statistical midpoint is computed across attention heads
- Inter-head disagreement mapped across regions determines intervention budgets
- The approach avoids abrupt heuristic truncations
- The paper is on arXiv: 2605.24957
Entities
Institutions
- arXiv