LLM Rubrics Boost Clinical Supervised Learning

ai-technology · 2026-05-23

A recent preprint on arXiv (2603.11679) introduces an agentic pipeline that enables large language models (LLMs) to create both global and local rubrics, enhancing supervised learning for intricate multimodal datasets. The global rubric is a programmatic specification derived from a varied collection of text-serialized examples, aimed at standardizing input formats. In contrast, local rubrics serve as interpretive summaries tailored to specific tasks. When evaluated on 15 clinical tasks from the EHRSHOT benchmark, the rubric methods surpassed count-feature models, basic LLM baselines, and a clinical foundation model that had been pretrained on similar datasets. This approach minimizes the necessity for domain expertise in designing input representations.

Key facts

arXiv:2603.11679v3
LLM agentic pipeline for supervised learning
Global rubric synthesized from diverse text-serialized examples
Local rubrics are task-conditioned interpretive summaries
Tested on 15 clinical tasks from EHRSHOT benchmark
Outperformed count-feature models, naive LLM baselines, and a clinical foundation model
Reduces need for domain expertise in input representation
Focuses on multimodal data: time-series, free text, structured records

LLM Rubrics Boost Clinical Supervised Learning

Key facts

Entities

Institutions

Sources