Model_rank,Model_name,Model type,Average_Spearman,Bootstrap_standard_error_Spearman,Function_Activity,Function_Binding,Function_Expression,Function_OrganismalFitness,Function_Stability,Low_MSA_depth,Medium_MSA_depth,High_MSA_depth,Taxa_Human,Taxa_Other_Eukaryote,Taxa_Prokaryote,Taxa_Virus,Depth_1,Depth_2,Depth_3,Depth_4,Depth_5+,Low_Plddt,Medium_Plddt,High_Plddt,Model details,References
1,S3F_MSA,Sequence & Structure & Surface & Alignment,0.496,0,0.502,0.44,0.479,0.476,0.581,0.469,0.509,0.546,0.5,0.55,0.53,0.496,0.498,0.331,0.369,0.33,0.383,0.402,0.512,0.542,,
2,S2F_MSA,Sequence & Structure & Alignment,0.487,0.001,0.498,0.428,0.472,0.472,0.567,0.461,0.501,0.536,0.493,0.539,0.522,0.486,0.49,0.301,0.338,0.303,0.362,0.399,0.504,0.532,,
3,S3F,Sequence & Structure & Surface,0.47,0.005,0.468,0.404,0.472,0.412,0.594,0.412,0.473,0.547,0.484,0.527,0.522,0.4,0.484,0.326,0.323,0.281,0.337,0.381,0.485,0.514,,
4,TranceptEVE L,Hybrid - Alignment & PLM,0.456,0.006,0.487,0.376,0.457,0.46,0.5,0.451,0.467,0.492,0.471,0.498,0.473,0.453,0.446,0.28,0.35,0.32,0.382,0.374,0.468,0.5,TranceptEVE Large model (Tranception Large & retrieved EVE model),"<a href='https://www.biorxiv.org/content/10.1101/2022.12.07.519495v1?rss=1'>Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. &  (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.</a>"
5,TranceptEVE M,Hybrid - Alignment & PLM,0.455,0.005,0.479,0.386,0.452,0.454,0.502,0.44,0.468,0.488,0.473,0.498,0.466,0.441,0.441,0.281,0.304,0.309,0.375,0.37,0.467,0.496,TranceptEVE Medium model (Tranception Medium & retrieved EVE model),"<a href='https://www.biorxiv.org/content/10.1101/2022.12.07.519495v1?rss=1'>Notin, P., Van Niekerk, L., Kollasch, A., Ritter, D., Gal, Y. & Marks, D.S. &  (2022). TranceptEVE: Combining Family-specific and Family-agnostic Models of Protein Sequences for Improved Fitness Prediction. NeurIPS, LMRL workshop.</a>"
6,GEMME,Alignment-based model,0.455,0.006,0.482,0.383,0.438,0.452,0.519,0.455,0.47,0.497,0.468,0.51,0.473,0.469,0.446,0.274,0.321,0.324,0.414,0.377,0.471,0.503,GEMME model,"<a href='https://pubmed.ncbi.nlm.nih.gov/31406981/'>Laine, _., Karami, Y., & Carbone, A. (2019). GEMME: A Simple and Fast Global Epistatic Model Predicting Mutational Effects. Molecular Biology and Evolution, 36, 2604 - 2619.</a>"
7,S2F,Sequence & Structure,0.454,0.005,0.459,0.384,0.456,0.403,0.568,0.394,0.458,0.528,0.471,0.505,0.506,0.38,0.468,0.278,0.283,0.241,0.297,0.377,0.468,0.496,,
8,TranceptEVE S,Hybrid - Alignment & PLM,0.452,0.004,0.475,0.396,0.443,0.449,0.497,0.449,0.46,0.484,0.468,0.49,0.467,0.433,0.435,0.275,0.304,0.304,0.372,0.373,0.459,0.494,TranceptEVE Small model (Tranception Small & retrieved EVE model),"<a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a>"
9,EVE (ensemble),Alignment-based model,0.439,0.005,0.464,0.386,0.408,0.447,0.491,0.425,0.453,0.481,0.453,0.487,0.468,0.428,0.427,0.273,0.308,0.298,0.355,0.341,0.453,0.489,EVE model (ensemble of 5 independently-trained models),"<a href='https://www.nature.com/articles/s41586-021-04043-8'>Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.</a>"
10,VESPA,Protein language model,0.436,0.007,0.468,0.366,0.404,0.44,0.5,0.427,0.455,0.484,0.438,0.492,0.49,0.432,0.434,0.183,0.357,0.302,0.328,0.364,0.456,0.482,VESPA model,"<a href='https://link.springer.com/article/10.1007/s00439-021-02411-y'>Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.</a>"
11,Tranception L,Hybrid - Alignment & PLM,0.434,0.007,0.465,0.349,0.45,0.436,0.471,0.432,0.438,0.473,0.453,0.483,0.431,0.432,0.423,0.258,0.352,0.318,0.387,0.366,0.446,0.471,Tranception Large model (700M params) with retrieval,"<a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a>"
12,MSA Transformer (ensemble),Hybrid - Alignment & PLM,0.434,0.011,0.469,0.337,0.446,0.421,0.495,0.404,0.45,0.488,0.437,0.505,0.463,0.414,0.426,0.238,0.384,0.366,0.408,0.284,0.454,0.484,MSA Transformer (ensemble of 5 MSA samples),"<a href='http://proceedings.mlr.press/v139/rao21a.html'>Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.</a>"
13,EVE (single),Alignment-based model,0.433,0.005,0.458,0.372,0.404,0.442,0.487,0.417,0.448,0.477,0.445,0.484,0.464,0.424,0.422,0.271,0.304,0.296,0.359,0.326,0.447,0.486,EVE model (single seed),"<a href='https://www.nature.com/articles/s41586-021-04043-8'>Frazer, J., Notin, P., Dias, M., Gomez, A.N., Min, J.K., Brock, K.P., Gal, Y., & Marks, D.S. (2021). Disease variant prediction with deep generative models of evolutionary data. Nature.</a>"
14,Tranception M,Hybrid - Alignment & PLM,0.427,0.007,0.448,0.361,0.441,0.422,0.465,0.417,0.432,0.456,0.451,0.476,0.404,0.415,0.41,0.247,0.24,0.27,0.353,0.356,0.444,0.451,Tranception Medium model (300M params) with retrieval,"<a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a>"
15,ESM-IF1,Inverse folding model,0.422,0.01,0.368,0.389,0.407,0.324,0.624,0.3,0.431,0.544,0.415,0.497,0.507,0.374,0.438,0.345,0.29,0.289,0.358,0.196,0.422,0.519,ESM-IF1 model,"<a href='https://www.biorxiv.org/content/10.1101/2022.04.10.487779v2.full.pdf+html'>Chloe Hsu, Robert Verkuil, Jason Liu, Zeming Lin, Brian Hie, Tom Sercu, Adam Lerer, Alexander Rives (2022). Learning Inverse Folding from Millions of Predicted Structures. BioRxiv.</a>"
16,MSA Transformer (single),Hybrid - Alignment & PLM,0.421,0.011,0.457,0.33,0.435,0.409,0.476,0.393,0.435,0.473,0.427,0.491,0.451,0.39,0.411,0.239,0.39,0.368,0.413,0.292,0.435,0.472,MSA Transformer (single MSA sample),"<a href='http://proceedings.mlr.press/v139/rao21a.html'>Rao, R., Liu, J., Verkuil, R., Meier, J., Canny, J.F., Abbeel, P., Sercu, T., & Rives, A. (2021). MSA Transformer. ICML.</a>"
17,DeepSequence (ensemble),Alignment-based model,0.419,0.007,0.455,0.363,0.39,0.413,0.476,0.383,0.428,0.473,0.442,0.469,0.46,0.344,0.404,0.264,0.313,0.309,0.378,0.342,0.419,0.475,DeepSequence model (ensemble of 5 independently-trained models),"<a href='https://www.nature.com/articles/s41592-018-0138-4'>Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.</a>"
18,Tranception S,Hybrid - Alignment & PLM,0.418,0.006,0.436,0.372,0.42,0.411,0.452,0.428,0.415,0.444,0.438,0.463,0.396,0.405,0.397,0.24,0.249,0.272,0.352,0.365,0.43,0.439,Tranception Small model (85M params) with retrieval,"<a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a>"
19,ESM2 (650M),Protein language model,0.414,0.013,0.425,0.337,0.415,0.369,0.523,0.335,0.406,0.515,0.456,0.471,0.476,0.238,0.421,0.247,0.22,0.162,0.218,0.348,0.426,0.457,ESM2 model (650M params),"<a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a>"
20,ESM-1v (ensemble),Protein language model,0.41,0.012,0.414,0.318,0.431,0.387,0.5,0.326,0.418,0.502,0.458,0.446,0.454,0.289,0.4,0.221,0.186,0.151,0.203,0.303,0.432,0.454,ESM-1v (ensemble of 5 independently-trained models),"<a href='https://proceedings.neurips.cc/paper/2021/hash/f51338d736f95dd42427296047067694-Abstract.html'>Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.</a>"
21,DeepSequence (single),Alignment-based model,0.408,0.008,0.447,0.349,0.372,0.397,0.473,0.382,0.415,0.465,0.436,0.465,0.448,0.323,0.391,0.252,0.277,0.293,0.37,0.338,0.408,0.467,DeepSequence model (single seed),"<a href='https://www.nature.com/articles/s41592-018-0138-4'>Riesselman, A.J., Ingraham, J., & Marks, D.S. (2018). Deep generative models of genetic variation capture the effects of mutations. Nature Methods, 15, 816-822.</a>"
22,ESM2 (3B),Protein language model,0.406,0.011,0.417,0.321,0.403,0.379,0.509,0.348,0.415,0.49,0.441,0.461,0.477,0.274,0.414,0.213,0.202,0.166,0.217,0.346,0.41,0.466,ESM2 model (3B params),"<a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a>"
23,MIF-ST,Hybrid - Structure & PLM,0.402,0.01,0.39,0.321,0.438,0.373,0.486,0.376,0.403,0.456,0.404,0.415,0.463,0.396,0.431,0.265,0.334,0.298,0.298,0.304,0.417,0.442,MIF-ST model,"<a href='https://www.biorxiv.org/content/10.1101/2022.05.25.493516v3'>Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.</a>"
24,ESM2 (15B),Protein language model,0.401,0.011,0.405,0.317,0.405,0.388,0.488,0.357,0.414,0.473,0.431,0.449,0.459,0.313,0.405,0.204,0.239,0.172,0.234,0.336,0.41,0.457,ESM2 model (15B params),"<a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a>"
25,EVmutation,Alignment-based model,0.395,0.008,0.44,0.317,0.378,0.411,0.43,0.403,0.423,0.41,0.409,0.444,0.422,0.388,0.375,0.274,0.324,0.301,0.394,0.298,0.406,0.448,EVmutation model,"<a href='https://www.nature.com/articles/nbt.3769'>Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.</a>"
26,ESM-1b,Protein language model,0.394,0.012,0.428,0.287,0.406,0.351,0.5,0.35,0.398,0.482,0.434,0.475,0.455,0.241,0.383,0.227,0.187,0.149,0.27,0.319,0.405,0.454,ESM-1b (w/ Brandes et al. extensions),"[1] Original model: <a href='https://www.biorxiv.org/content/10.1101/622803v4'>Rives, A., Goyal, S., Meier, J., Guo, D., Ott, M., Zitnick, C.L., Ma, J., & Fergus, R. (2019). Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences. Proceedings of the National Academy of Sciences of the United States of America, 118.</a> [2] Extensions: <a href='https://www.biorxiv.org/content/10.1101/2022.08.25.505311v1'>Brandes, N., Goldman, G., Wang, C.H., Ye, C.J., & Ntranos, V. (2022). Genome-wide prediction of disease variants with a deep protein language model. bioRxiv.</a>"
27,VESPAl,Protein language model,0.394,0.007,0.429,0.347,0.326,0.404,0.461,0.382,0.412,0.449,0.392,0.461,0.451,0.392,0.385,0.144,0.324,0.276,0.32,0.324,0.411,0.446,VESPAl model,"<a href='https://link.springer.com/article/10.1007/s00439-021-02411-y'>Marquet, C., Heinzinger, M., Olenyi, T., Dallago, C., Bernhofer, M., Erckert, K., & Rost, B. (2021). Embeddings from protein language models predict conservation and variant effects. Human Genetics, 141, 1629 - 1647.</a>"
28,Progen2 XL,Protein language model,0.391,0.01,0.402,0.302,0.418,0.387,0.445,0.354,0.405,0.444,0.384,0.442,0.439,0.391,0.385,0.184,0.28,0.219,0.28,0.289,0.401,0.44,Progen2 xlarge model (6.4B params),"<a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a>"
29,ESM2 (150M),Protein language model,0.387,0.015,0.391,0.326,0.402,0.305,0.51,0.306,0.358,0.497,0.449,0.46,0.407,0.137,0.386,0.243,0.146,0.173,0.231,0.314,0.377,0.436,ESM2 model (150M params),"<a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a>"
30,MIF,Inverse folding model,0.383,0.011,0.327,0.336,0.43,0.295,0.524,0.349,0.374,0.446,0.397,0.39,0.417,0.352,0.409,0.269,0.257,0.24,0.24,0.261,0.382,0.432,MIF model,"<a href='https://www.biorxiv.org/content/10.1101/2022.05.25.493516v3'>Kevin K. Yang, Hugh Yeh, Niccolo Zanichelli (2023). Masked Inverse folding with Sequence Transfer for Protein Representation Learning. BioRxiv.</a>"
31,Progen2 L,Protein language model,0.38,0.01,0.406,0.293,0.427,0.379,0.396,0.348,0.381,0.42,0.41,0.414,0.369,0.333,0.371,0.144,0.232,0.199,0.258,0.305,0.398,0.398,Progen2 large model (2.7B params),"<a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a>"
32,ESM-1v (single),Protein language model,0.38,0.015,0.39,0.266,0.406,0.362,0.476,0.286,0.393,0.481,0.432,0.422,0.43,0.256,0.373,0.192,0.187,0.142,0.197,0.271,0.408,0.428,ESM-1v (single seed),"<a href='https://proceedings.neurips.cc/paper/2021/hash/f51338d736f95dd42427296047067694-Abstract.html'>Meier, J., Rao, R., Verkuil, R., Liu, J., Sercu, T., & Rives, A. (2021). Language models enable zero-shot prediction of the effects of mutations on protein function. NeurIPS.</a>"
33,Progen2 M,Protein language model,0.379,0.01,0.393,0.295,0.433,0.381,0.396,0.318,0.382,0.425,0.411,0.406,0.356,0.342,0.372,0.13,0.158,0.135,0.177,0.307,0.396,0.395,Progen2 medium model (760M params),"<a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a>"
34,Progen2 Base,Protein language model,0.378,0.01,0.396,0.294,0.437,0.379,0.383,0.342,0.368,0.423,0.421,0.408,0.331,0.328,0.369,0.13,0.145,0.157,0.208,0.309,0.396,0.388,Progen2 base model (760M params),"<a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a>"
35,Tranception L no retrieval,Protein language model,0.374,0.01,0.401,0.288,0.413,0.389,0.381,0.358,0.371,0.419,0.389,0.376,0.381,0.395,0.363,0.178,0.308,0.257,0.334,0.291,0.386,0.404,Tranception Large model (700M params) without retrieval,"<a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a>"
36,RITA XL,Protein language model,0.372,0.009,0.366,0.302,0.414,0.381,0.398,0.315,0.382,0.412,0.394,0.384,0.353,0.402,0.356,0.139,0.136,0.154,0.233,0.293,0.392,0.392,RITA xlarge model (1.2B params),"<a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a>"
37,CARP (640M),Protein language model,0.369,0.013,0.395,0.273,0.397,0.364,0.414,0.314,0.375,0.428,0.416,0.386,0.39,0.273,0.389,0.213,0.187,0.164,0.162,0.319,0.398,0.383,CARP model (640M params),"<a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a>"
38,RITA L,Protein language model,0.365,0.01,0.359,0.29,0.42,0.374,0.383,0.316,0.369,0.403,0.394,0.386,0.319,0.391,0.347,0.137,0.135,0.147,0.21,0.291,0.388,0.377,RITA large model (680M params),"<a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a>"
39,Site-Independent,Alignment-based model,0.359,0.01,0.369,0.344,0.343,0.382,0.358,0.426,0.373,0.32,0.379,0.385,0.316,0.383,0.336,0.235,0.226,0.267,0.35,0.337,0.371,0.366,Site-Independent model,"<a href='https://www.nature.com/articles/nbt.3769'>Hopf, T.A., Ingraham, J., Poelwijk, F.J., Schärfe, C.P., Springer, M., Sander, C., & Marks, D.S. (2017). Mutation effects predicted from sequence co-variation. Nature Biotechnology, 35, 128-135.</a>"
40,RITA M,Protein language model,0.35,0.013,0.352,0.273,0.405,0.371,0.348,0.304,0.349,0.39,0.378,0.349,0.31,0.385,0.336,0.114,0.134,0.168,0.222,0.276,0.373,0.357,RITA medium model (300M params),"<a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a>"
41,Tranception M no retrieval,Protein language model,0.348,0.011,0.349,0.284,0.406,0.362,0.342,0.293,0.349,0.379,0.379,0.335,0.314,0.349,0.331,0.142,0.107,0.15,0.21,0.257,0.368,0.352,Tranception Medium model (300M params) without retrieval,"<a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a>"
42,Unirep evotuned,Hybrid - Alignment & PLM,0.347,0.009,0.355,0.305,0.365,0.346,0.366,0.33,0.344,0.372,0.355,0.363,0.346,0.349,0.319,0.154,0.25,0.226,0.294,0.233,0.363,0.367,Unirep model w/ evotuning,"<a href='https://www.nature.com/articles/s41592-019-0598-1'>Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.</a>"
43,Progen2 S,Protein language model,0.336,0.013,0.333,0.275,0.384,0.337,0.349,0.283,0.321,0.391,0.384,0.334,0.298,0.285,0.327,0.113,0.12,0.133,0.187,0.287,0.355,0.337,Progen2 small model (150M params),"<a href='https://arxiv.org/abs/2206.13517'> Nijkamp, E., Ruffolo, J.A., Weinstein, E.N., Naik, N., & Madani, A. (2022). ProGen2: Exploring the Boundaries of Protein Language Models. ArXiv, abs/2206.13517. </a>"
44,CARP (76M),Protein language model,0.328,0.014,0.342,0.282,0.369,0.269,0.377,0.247,0.301,0.406,0.387,0.355,0.301,0.15,0.334,0.203,0.11,0.134,0.127,0.258,0.348,0.322,CARP model (76M params),"<a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a>"
45,ESM2 (35M),Protein language model,0.321,0.016,0.314,0.291,0.343,0.218,0.439,0.239,0.271,0.451,0.37,0.394,0.324,0.102,0.305,0.242,0.134,0.164,0.229,0.249,0.311,0.358,ESM2 model (35M params),"<a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a>"
46,RITA S,Protein language model,0.304,0.013,0.294,0.275,0.336,0.327,0.289,0.276,0.297,0.334,0.33,0.281,0.245,0.358,0.285,0.11,0.096,0.125,0.19,0.235,0.314,0.309,RITA small model (85M params),"<a href='https://arxiv.org/abs/2205.05789'>Hesslow, D., Zanichelli, N., Notin, P., Poli, I., & Marks, D.S. (2022). RITA: a Study on Scaling Up Generative Protein Sequence Models. ArXiv, abs/2205.05789.</a>"
47,Tranception S no retrieval,Protein language model,0.303,0.013,0.288,0.286,0.349,0.319,0.27,0.258,0.295,0.32,0.317,0.263,0.272,0.315,0.277,0.114,0.1,0.132,0.189,0.247,0.305,0.296,Tranception Small model (85M params) without retrieval,"<a href='https://proceedings.mlr.press/v162/notin22a.html'>Notin, P., Dias, M., Frazer, J., Marchena-Hurtado, J., Gomez, A.N., Marks, D.S., & Gal, Y. (2022). Tranception: Protein Fitness Prediction with Autoregressive Transformers and Inference-time Retrieval. ICML.</a>"
48,CARP (38M),Protein language model,0.279,0.016,0.285,0.268,0.312,0.217,0.315,0.196,0.24,0.357,0.321,0.298,0.252,0.125,0.277,0.169,0.103,0.137,0.143,0.232,0.285,0.269,CARP model (38M params),"<a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a>"
49,ProteinMPNN,Inverse folding model,0.258,0.013,0.197,0.163,0.198,0.165,0.566,0.173,0.28,0.434,0.282,0.395,0.354,0.248,0.292,0.257,0.171,0.186,0.278,0.133,0.292,0.375,ProteinMPNN model,"<a href='https://www.science.org/doi/10.1126/science.add2187'>J. Dauparas, I. Anishchenko, N. Bennett, H. Bai, R. J. Ragotte, L. F. Milles, B. I. M. Wicky, A. Courbet, R. J. de Haas, N. Bethel, P. J. Y. Leung, T. F. Huddy, S. Pellock, D. Tischer, F. Chan,B. Koepnick, H. Nguyen, A. Kang, B. Sankaran,A. K. Bera, N. P. King,D. Baker (2022). Robust deep learning-based protein sequence design using ProteinMPNN. Science, Vol 378.</a>"
50,ESM2 (8M),Protein language model,0.226,0.016,0.201,0.26,0.266,0.141,0.262,0.194,0.179,0.264,0.239,0.236,0.213,0.078,0.202,0.136,0.099,0.132,0.195,0.22,0.201,0.218,ESM2 model (8M params),"<a href='https://www.science.org/doi/abs/10.1126/science.ade2574'>Zeming Lin, Halil Akin, Roshan Rao, Brian Hie, Zhongkai Zhu, Wenting Lu, Nikita Smetanin, Robert Verkuil, Ori Kabeli, Yaniv Shmueli, Allan Dos Santos Costa, Maryam Fazel-Zarandi, Tom Sercu, Salvatore Candido, Alexander Rives (2023). Evolutionary-scale prediction of atomic-level protein structure with a language model. Science, Vol. 379.</a>"
51,Wavenet,Alignment-based model,0.217,0.019,0.219,0.186,0.195,0.303,0.182,0.207,0.255,0.207,0.145,0.305,0.293,0.283,0.177,0.059,0.218,0.181,0.258,0.123,0.212,0.267,Wavenet model,"<a href='https://www.nature.com/articles/s41467-021-22732-w'>Shin, J., Riesselman, A.J., Kollasch, A.W., McMahon, C., Simon, E., Sander, C., Manglik, A., Kruse, A.C., & Marks, D.S. (2021). Protein design and variant prediction using autoregressive generative models. Nature Communications, 12.</a>"
52,Unirep,Protein language model,0.19,0.017,0.182,0.202,0.216,0.141,0.21,0.181,0.161,0.209,0.213,0.219,0.165,0.057,0.174,0.071,0.111,0.141,0.191,0.197,0.184,0.176,Unirep model,"<a href='https://www.nature.com/articles/s41592-019-0598-1'>Alley, E.C., Khimulya, G., Biswas, S., AlQuraishi, M., & Church, G.M. (2019). Unified rational protein engineering with sequence-based deep representation learning. Nature Methods, 1-8.</a>"
53,ProtGPT2,Protein language model,0.188,0.012,0.176,0.149,0.193,0.166,0.256,0.175,0.173,0.253,0.242,0.235,0.133,0.141,0.18,0.138,0.041,0.034,0.078,0.16,0.224,0.186,ProtGPT2 model,"<a href='https://www.nature.com/articles/s41467-022-32007-7'>Ferruz, N., Schmidt, S., & Höcker, B. (2022). ProtGPT2 is a deep unsupervised language model for protein design. Nature Communications, 13.</a>"
54,CARP (600K),Protein language model,0.106,0.018,0.112,0.084,0.171,0.059,0.105,0.095,0.087,0.101,0.121,0.082,0.087,0.042,0.107,0.026,0.043,0.083,0.096,0.137,0.083,0.098,CARP model (600K params),"<a href='https://www.biorxiv.org/content/10.1101/2022.05.19.492714v4'>Kevin K. Yang, Nicolo Fusi, Alex X. Lu (2023). Convolutions are competitive with transformers for protein sequence pretraining. BioRxiv.</a>"