New Method Decomposes LLM Alignment for Multi-Stakeholder Tasks

ai-technology · 2026-05-27

Researchers propose DecompR, a method for aligning large language models (LLMs) in multi-stakeholder scenarios where users have conflicting preferences. Traditional holistic LLM judges conflate utility estimation and aggregation, introducing unstable implicit weights. DecompR fixes counterfactual-calibrated weights from query structure before candidate scoring and estimates per-role utilities independently, removing candidate-dependent weight drift and reducing estimation noise. Experiments show that weighting noise can create large score shifts when stakeholder satisfaction is dispersed, and these shifts increase with stakeholder count. The work is published on arXiv under computer science and artificial intelligence.

Key facts

DecompR decomposes utility estimation from aggregation in LLM alignment.
Holistic LLM judges conflate estimation and aggregation, causing unstable weights.
Weighting noise creates large score shifts when stakeholder satisfaction is dispersed.
Score shifts increase with stakeholder count.
DecompR uses counterfactual-calibrated weights fixed from query structure.
Per-role utilities are estimated independently in DecompR.
The method removes candidate-dependent weight drift.
The paper is available on arXiv under computer science and artificial intelligence.

New Method Decomposes LLM Alignment for Multi-Stakeholder Tasks

Key facts

Entities

Institutions

Sources