ProtDBench: Standardized Benchmark for Protein Binder Design

other · 2026-05-07

ProtDBench introduces a novel framework for evaluating de novo protein binder design, focusing on standardized and throughput-aware assessments to remedy the inconsistency in evaluation methods across various studies. It establishes common benchmark tasks, evaluation protocols, and criteria for success, facilitating a systematic examination of how evaluation design influences performance outcomes. By utilizing a comprehensive wet-lab annotated dataset, the framework scrutinizes widely used structure prediction models as evaluators, uncovering significant verifier-dependent biases and low agreement under the same filtering criteria. Additionally, it benchmarks a selection of open-source generative binder design techniques across ten distinct protein targets using a consistent evaluation protocol. ProtDBench also includes throughput-aware metrics that extend beyond simple per-sequence success rates.

Key facts

ProtDBench is a standardized evaluation framework for protein binder design.
It defines unified benchmark tasks, evaluation protocols, and success criteria.
The framework uses a large wet-lab annotated dataset.
It analyzes structure prediction models as evaluation verifiers.
Reveals substantial verifier-dependent bias and limited agreement.
Benchmarks open-source generative methods across ten protein targets.
Incorporates throughput-aware metrics.
Published on arXiv with ID 2605.04118.

ProtDBench: Standardized Benchmark for Protein Binder Design

Key facts

Entities

Institutions

Sources