Visualizing and Annotating Protein Sequences using A Deep Neural Network

Published: 2020, Last Modified: 19 Feb 2025ACSSC 2020EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: It is critical for biological studies to annotate amino acid sequences and understand how proteins function. Protein function is important to medical research in the health industry (e.g., drug discovery). With the advancement of deep learning, accurate protein annotation models have been developed for alignment free protein annotation. In this paper, we develop a deep learning model with an attention mechanism that can predict Gene Ontology labels given a protein sequence input. We believe this model can produce accurate predictions as well as maintain good interpretability. We further show how the model can be interpreted by examining and visualizing the intermediate layer output in our deep neural network.
Loading