Learning multi-scale functional representations of proteins from single-cell microscopy dataDownload PDF

Published: 05 Apr 2022, Last Modified: 05 May 2023MLDD PosterReaders: Everyone
Keywords: protein, representation learning, microscopy, computer vision, subcellular, localization
TL;DR: We show that models can learn protein representations that encapsulate universal functional information through subcellular localizaition classification.
Abstract: Protein function is inherently linked to its localization within the cell, and fluorescent microscopy data is an indispensable resource for learning representations of proteins. Despite major developments in molecular representation learning, extracting functional information from biological images remains a non-trivial computational task. Current state-of-the-art approaches use autoencoder models to learn high-quality features by reconstructing images. However, such methods are prone to capturing noise and imaging artifacts. In this work, we revisit deep learning models used for classifying major subcellular localizations, and evaluate representations extracted from their final layers. We show that simple convolutional networks trained on localization classification can learn protein representations that encapsulate diverse functional information, and significantly outperform autoencoder-based models. We also propose a robust evaluation strategy to assess quality of protein representations across different scales of biological function.
0 Replies