everyone
since 05 Nov 2024">EveryoneRevisionsBibTeXCC BY-SA 4.0
This proposal outlines a novel approach for protein sequence generation, a key area in computational biology focused on generating sequences with specific functional and structural characteristics. Despite recent advances with deep learning models, challenges remain in handling long sequences and capturing biological dynamics. We propose implementing a Mamba architecture that leverages selective state-space updates to achieve efficient protein sequence generation. With linear computational complexity and effective handling of long-range dependencies, this architecture is designed to be more suited to biological dynamics while reducing resource requirements. A comprehensive evaluation on the UniRef50 dataset will demonstrate its potential to deliver competitive performance, providing a viable solution for research environments with limited computational resources.