# Computational modeling of DNA methylation-mediated gene regulation

> **NIH NIH R01** · WASHINGTON UNIVERSITY · 2022 · $366,897

## Abstract

Abstract
Large numbers of complete methylomes are being acquired through clinical sequencing projects, such as
through The Cancer Genome Atlas, Blueprint Epigenome Project, and International Cancer Genome
Consortium. Furthermore, third-generation nanopore sequencers, which detect DNA methylation and genetic
variation in a single experiment, are nearly ready for routine clinical sequencing and will provide complete
methylomes for all patients where whole-genome sequencing is indicated. Current analysis tools however only
perform preliminary methylome processing and catalogue differentially methylated regions (DMRs). In order to
transform methylome analysis into a clinically useful diagnostic/prognostic test, we need to develop predictive
tools to interpret the functional and pathological consequences of identified methylation changes. Towards this
goal, we have published a series of papers demonstrating that machine-learning based models utilizing high-
resolution signatures of all methylation changes around a promoter vastly outperform conventional DMR
methods. Our models accurately predict expression states at genes potentially regulated by methylation and
reveal predictive methylation signatures that facilitate mechanistic interpretation. Nonetheless, several
challenges remain before we can achieve our goals of translating genome-wide methylation data for routine
clinical use: (1) To our knowledge, no current models integrate distal enhancers, whose activation is affected by
DNA methylation. Such integrative analysis is necessary to understand consequences of methylation changes
in cancers, whose genomes frequently undergo wide-spread methylation changes. In addition, such modelling
will be essential to understand the role of 5-hydroxymethylcytosine (5hmC), which may play both repressive and
activating roles in neurons depending on whether it is found at promoters or enhancers. (2) Our current models
(and conventional approaches) represent methylation data independent of DNA sequence despite mechanistic
studies demonstrating that methylation changes can have different functional effects depending on which
sequences change and depending on the context of the local regulatory grammar. In this proposal, we will meet
these challenges by first developing a predictive model that incorporates 5-methylcytosine and 5hmC at
promoters and enhancers to determine how these marks act in concert. In particular, we will examine the
hypothesized dual role of 5hmC as a repressor at promoters and as an activator at enhancers in cortical neurons.
We will then use new advances in natural language processing to model DNA sequence and methylation to
predict expression states. Our results will reveal which regulatory elements and transcription factors binding sites
are affected by DNA methylation and how changes at different sites collaborate to affect expression changes.
We will experimentally validate our in silico predictions using a combination of reporter assays ...

## Key facts

- **NIH application ID:** 10405488
- **Project number:** 5R01LM013096-04
- **Recipient organization:** WASHINGTON UNIVERSITY
- **Principal Investigator:** John R. Edwards
- **Activity code:** R01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2022
- **Award amount:** $366,897
- **Award type:** 5
- **Project period:** 2019-09-16 → 2024-05-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10405488

## Citation

> US National Institutes of Health, RePORTER application 10405488, Computational modeling of DNA methylation-mediated gene regulation (5R01LM013096-04). Retrieved via AI Analytics 2026-05-22 from https://api.ai-analytics.org/grant/nih/10405488. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
