PepBenchmark: A Standardized Benchmark for Peptide Machine Learning

Published: 26 Jan 2026, Last Modified: 08 May 2026ICLR 2026 PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: peptide machine learning, benchmark, protein language models
TL;DR: We introduce PepBenchmark, a standardized framework with datasets, preprocessing, and evaluation tools that establish the first reproducible benchmark for peptide machine learning.
Abstract: Peptide therapeutics are widely regarded as the “third generation” of drugs, yet progress in peptide Machine Learning (ML) are hindered by the absence of standardized benchmarks. Here we present \textbf{PepBenchmark}, which unifies datasets, preprocessing, and evaluation protocols for peptide drug discovery. PepBenchmark comprises three components: (1) \textbf{PepBenchData}, a well-curated collection comprising 29 canonical-peptide and 6 non-canonical-peptide datasets across 7 groups, systematically covering key aspects of peptide drug development—representing, to the best of our knowledge, the most comprehensive AI-ready dataset resource to date; (2) \textbf{PepBenchPipeline}, a standardized preprocessing pipeline that ensures consistent dataset cleaning, construction, splitting, and feature transformation, mitigating quality issues common in ad hoc pipelines; and (3) \textbf{PepBenchLeaderboard}, a unified evaluation protocol and leaderboard with strong baselines across 4 major methodological families: Fingerprint-based, GNN-based, PLM-based, and SMILES-based models. Together, PepBenchmark provides the first standardized and comparable foundation for peptide drug discovery, facilitating methodological advances and translation into real-world applications. The data and code are publicly available at \href{https://github.com/ZGCI-AI4S-Pep/PepBenchmark/}{\texttt{https://github.com/ZGCI-AI4S-Pep/PepBenchmark/}}.
Supplementary Material: pdf
Primary Area: datasets and benchmarks
Submission Number: 11509
Loading