CLOVER: Closed-Loop Value Estimation for Autonomous Driving Planning

ai-technology · 2026-05-16

To tackle the training-evaluation discrepancy in end-to-end autonomous driving planners, researchers have introduced CLOVER, a Closed-LOop Value Estimation and Ranking framework. Conventional planners rely on imitating a singular logged trajectory for training but assess performance using rule-based metrics such as safety, feasibility, progress, and comfort. This creates a situation where trajectories that closely resemble the logged path may breach rules, while those that deviate can still be valid and score highly. CLOVER employs a streamlined generator-scorer approach: the generator creates a variety of candidate trajectories, and the scorer estimates planning-metric sub-scores for ranking during inference. This enhances proposal support beyond mere single-trajectory imitation. The framework also develops an evaluator to boost candidate-set coverage and scorer ranking quality, which is crucial for proposal-selection planners. The research is published on arXiv (2605.15120).

Key facts

CLOVER stands for Closed-LOop Value Estimation and Ranking.
It addresses training-evaluation mismatch in autonomous driving planning.
Traditional planners imitate a single logged trajectory.
Evaluation uses rule-based metrics: safety, feasibility, progress, comfort.
CLOVER uses a generator-scorer formulation.
Generator produces diverse candidate trajectories.
Scorer predicts planning-metric sub-scores for ranking.
Framework improves candidate-set coverage and scorer ranking quality.

CLOVER: Closed-Loop Value Estimation for Autonomous Driving Planning

Key facts

Entities

Institutions

Sources