Benchmarking Fine-Tuned RNA Language Models for Intronic Branch Point Prediction

Pablo Rodenas Ruiz; Ali Saadat; Timothy T. Tran; Oliver Müller Smedt; Peng Zhang; Jacques Fellay

Benchmarking Fine-Tuned RNA Language Models for Intronic Branch Point Prediction

Pablo Rodenas Ruiz, Ali Saadat, Timothy T. Tran, Oliver Müller Smedt, Peng Zhang, Jacques Fellay

Published: 05 Mar 2025, Last Modified: 16 Apr 2025ICLR 2025 AI4NA PosterEveryoneRevisionsBibTeXCC BY 4.0

Track: tiny / short paper (up to 3 pages)

Keywords: Branch point, Language models, RNA splicing, Fine-tuning, Benchmark

TL;DR: Several RNA language models are fine-tuned for branch point prediction, achieving state-of-the-art performance, thus setting a new benchmark for future studies.

Abstract: Accurate prediction of RNA branch points is critical for understanding splicing mechanisms and identifying variants that may lead to genetic diseases. Despite their biological importance, few computational methods have been developed for reliably identifying branch points. In this work, we fine-tune several RNA language models for branch point prediction. The top-performing model, ERNIE-RNA, achieved an $F_1$ score of 0.811, a sequence accuracy of 0.790, and an average precision score of 0.868, outperforming previous leading models. These results showcase the potential of RNA-specific language models in capturing the subtle sequence features relevant to splicing. Our findings suggest that extended training and hyperparameter tuning could yield additional performance gains, positioning this study as a strong baseline for future research in RNA splicing.

Submission Number: 11

Loading