Budget-Aware Routing for Long Clinical Text
A recent study introduces RCD, a monotone submodular objective aimed at budgeted context selection for large language models handling lengthy clinical texts. This approach tackles constraints related to token costs and latency by choosing a limited number of document units within a defined budget. The researchers evaluate various unitization methods, including sentence, section, window, and cluster-based approaches, and present a routing heuristic that adjusts according to budgetary conditions. Findings from experiments conducted on MIMIC discharge notes, Cochrane abstracts, and L-Eval indicate that the most effective strategies vary depending on the evaluation context.
Key facts
- arXiv:2605.00336v1
- RCD is a monotone submodular objective balancing relevance, coverage, and diversity
- Budgeted context selection chooses a subset of document units under a strict token budget
- Unitization defines document segmentation; selection determines which units are kept
- Compared sentence, section, window, and cluster-based unitization
- Introduced a routing heuristic that adapts to the budget regime
- Experiments on MIMIC discharge notes, Cochrane abstracts, and L-Eval
- Optimal strategies depend on the evaluation setting
Entities
—