Genomic Data Classification via Universal Compression

Yasmine Omri, Naomi Sagan, Eugene Min, Heewoong Choi, Taesup Moon, Tsachy Weissman

Published: 09 Apr 2025, Last Modified: 10 Nov 2025CrossrefEveryoneRevisionsCC BY-SA 4.0
Abstract: Efficient and accurate DNA sequence classification is a crucial task in genomic data analysis. In this work, we construct a lightweight DNA classifier based on the LZ78 lossless universal compressor, and optimize its performance through hyperparameter tuning. This classifier outperforms the state...
Loading