ARTFEED — Contemporary Art Intelligence

Agri-CPJ: Training-Free Framework for Agricultural Pest Diagnosis Using LLM-as-a-Judge

other · 2026-04-29

The Agri-CPJ (Caption-Prompt-Judge) framework has been introduced to tackle two persistent issues in diagnosing crop diseases through field images: the generation of incorrect species names and unclear reasoning. This training-free few-shot approach employs a substantial vision-language model to create an initial structured morphological caption, which is then refined through multi-dimensional quality gating before any diagnostic queries are addressed. It generates two potential responses from different perspectives, with an LLM judge determining the more robust option based on specific domain criteria. Ablation studies reveal that caption refinement significantly impacts accuracy, as omitting it leads to consistent performance declines in both models evaluated. On CDDMBench, combining GPT-5-Nano with captions from GPT-5-mini shows enhanced results. The research can be found on arXiv under identifier 2604.23701.

Key facts

  • Agri-CPJ stands for Caption-Prompt-Judge.
  • It is a training-free few-shot framework.
  • It uses a large vision-language model to generate structured morphological captions.
  • Captions are iteratively refined through multi-dimensional quality gating.
  • Two candidate responses are generated from complementary viewpoints.
  • An LLM judge selects the stronger response based on domain-specific criteria.
  • Caption refinement has the largest individual impact on accuracy.
  • Skipping caption refinement consistently degrades downstream accuracy.
  • The framework was tested on CDDMBench.
  • Pairing GPT-5-Nano with GPT-5-mini-generated captions yields improved performance.
  • The paper is on arXiv with ID 2604.23701.

Entities

Institutions

  • arXiv

Sources