Using a hybrid neural network architecture for DNA sequence representation: A study on N4-methylcytosine sites

Published: 01 Jan 2024, Last Modified: 28 Sept 2024Comput. Biol. Medicine 2024EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•Our predictive models surpass existing approaches in identifying 4mC sites in Fragaria vesca, achieving superior accuracy and sensitivity.•Advanced word embedding techniques like fastText and variable k-mers capture key patterns crucial for accurate 4mC site prediction.•Our models uncover distinct sequence motifs at 4mC sites, highlighting key epigenetic mechanisms in gene regulation and adaptation.•These models support crop breeding strategies, environmental adaptation studies, and disease research by precisely identifying epigenetic traits.
Loading