ReasonRank: Boosting Passage Ranking with Reasoning Models
A new paper introduces ReasonRank, a listwise passage reranker that leverages large reasoning models (LRMs) to improve ranking performance on complex queries. The authors propose an automated framework to generate reasoning-intensive training data by sourcing queries and passages from diverse domains and using DeepSeek-R1 to produce high-quality labels. A two-stage training approach is employed: cold-start supervised fine-tuning followed by reinforcement learning. The method aims to address the scarcity of reasoning-intensive training data and enhance the ranking ability of rerankers in complex scenarios.
Key facts
- arXiv paper 2508.07050
- Announce type: replace-cross
- Uses Large Reasoning Models (LRMs) for listwise ranking
- Proposes automated reasoning-intensive training data synthesis
- Sources training data from diverse domains
- Applies DeepSeek-R1 to generate training labels
- Two-stage training: cold-start SFT and reinforcement learning
- Aims to improve ranking in complex scenarios
Entities
Institutions
- arXiv
- DeepSeek