Leveraging pretrained deep protein language model to predict peptide collision cross section

Published: 05 May 2025, Last Modified: 18 May 2025Communications ChemistryEveryoneCC BY-NC-ND 4.0
Abstract: Collision cross section (CCS) of peptide ions provides an important separation dimension in liquid chromatography/tandem mass spectrometry-based proteomics that incorporates ion mobility spectrometry (IMS), and its accurate prediction is the basis for advanced proteomics workflows. This paper describes experimental data and a prediction model for challenging CCS prediction tasks including longer peptides that tend to have higher charge states. The proposed model is based on a pretrained deep protein language model. While the conventional prediction model requires training from scratch, the proposed model enables training with less amount of time owing to the use of the pretrained model as a feature extractor. Results of experiments with the novel experimental data show that the proposed model succeeds in drastically reducing the training time while maintaining the same or even better prediction performance compared with the conventional method. Our approach presents the possibility of prediction on the basis of “greener” manner training of various peptide properties in proteomic liquid chromatography/tandem mass spectrometry experiments.
Loading