Hygeia: A new framework for data-driven discovery of DNA methylation patterns

ICLR 2025 Workshop LMRL Submission81 Authors

13 Feb 2025 (modified: 18 Apr 2025)Submitted to ICLR 2025 Workshop LMRLEveryoneRevisionsBibTeXCC BY 4.0
Track: Tiny Paper Track
Keywords: epigenetics, DNA methylation, Bayesian modelling, change-point model
TL;DR: We introduce flexible Bayesian change-point models for DNA methylation and associated inference algorithms that yield a detailed probabilistic description of methylation signatures.
Abstract: Epigenetic variability is an essential modulator of phenotypic plasticity. To better understand complex epigenetic signals, we introduce Hygeia – a new framework for discovering DNA methylation patterns in whole-genome bisulfite sequencing (WGBS) data. Hygeia utilises a Bayesian statistical model, designed to match empirically observed methylation patterns. The model selects a regime for the methylation propensity at each cytosine-guanine dinucleotide (CpG) site, with regime changes permitted at any position. Thus, conventional means-based methods are replaced by probability-based MEthylaTion changE point Regimes (METEORs). Hygeia fits the model to WGBS data to produce METEOR annotation at the CpG level. We applied Hygeia to WGBS EpiATLAS data (N=445) from the International Human Epigenome Consortium (IHEC) to enrich the EpiATLAS resource with METEOR annotation. Hygeia is packaged as a Nextflow pipeline available on GitHub.
Submission Number: 81
Loading

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview