ARTFEED — Contemporary Art Intelligence

Budget-Aware Routing for Long Clinical Text

other · 2026-05-04

A recent study introduces RCD, a monotone submodular objective aimed at budgeted context selection for large language models handling lengthy clinical texts. This approach tackles constraints related to token costs and latency by choosing a limited number of document units within a defined budget. The researchers evaluate various unitization methods, including sentence, section, window, and cluster-based approaches, and present a routing heuristic that adjusts according to budgetary conditions. Findings from experiments conducted on MIMIC discharge notes, Cochrane abstracts, and L-Eval indicate that the most effective strategies vary depending on the evaluation context.

Key facts

  • arXiv:2605.00336v1
  • RCD is a monotone submodular objective balancing relevance, coverage, and diversity
  • Budgeted context selection chooses a subset of document units under a strict token budget
  • Unitization defines document segmentation; selection determines which units are kept
  • Compared sentence, section, window, and cluster-based unitization
  • Introduced a routing heuristic that adapts to the budget regime
  • Experiments on MIMIC discharge notes, Cochrane abstracts, and L-Eval
  • Optimal strategies depend on the evaluation setting

Entities

Sources