Semantic Flow Regularization Improves LLM Output Diversity

ai-technology · 2026-05-28

Researchers have developed a new method called Semantic Flow Regularization (SFR) to address Cross-Style Collapse in large language models. This issue happens when models fine-tuned for specific styles end up producing less diverse outputs. The underlying cause is the cross-entropy objective that restricts variation in shared representations. SFR serves as a simple additional goal, guiding the model with continuous sentence-encoder embeddings for future segments via conditional flow matching. It keeps multi-modality intact, and the flow-matching head is removed during inference, meaning no extra costs for deployment. Tests on the Qwen3-32B dataset, featuring 9 personas, show that SFR improves output diversity, style consistency, and response quality. Validation on LiveCodeBench-v5 confirms these enhancements. You can check out the study on arXiv, identifier 2605.27971.

Key facts

SFR addresses Cross-Style Collapse in LLMs.
Cross-Style Collapse is caused by cross-entropy objective suppressing diverse continuations.
SFR uses conditional flow matching with sentence-encoder embeddings.
Flow-matching head is discarded at inference, adding zero deployment cost.
Tested on Qwen3-32B with 9 personas.
Improves output diversity, style fidelity, and response quality over SFT.
Validated on LiveCodeBench-v5 with Qwen2.5-Coder-7B-Instruct.
Paper available on arXiv: 2605.27971.

Semantic Flow Regularization Improves LLM Output Diversity

Key facts

Entities

Institutions

Sources