# Leveraging biological pathways, gene networks, and functional annotations to understand the genetic architecture of diseases and complex traits

> **NIH NIH F31** · MASSACHUSETTS INSTITUTE OF TECHNOLOGY · 2021 · $23,429

## Abstract

Project Summary
 Over the past decade, data generated from genome-wide association studies (GWAS) have been used to
analyze heritability of human diseases and complex traits to improve understanding of genetic architecture,
including identifying the genetic variants associated with diseases and their effect sizes. These heritability
enrichment analyses have revealed underlying biological mechanisms such as regulatory elements involved in
disease pathways. This research proposes statistical methods and analysis to formally assess the contribution
of biological pathways, gene networks, and re-prioritized functional annotations to the genetic architecture of
94 human diseases and complex traits. Better understanding of disease architecture will improve genome-wide
associations and fine mapping and provides robust inference of disease etiology and mechanisms.
 The first aim is to determine the contribution of disease-associated gene pathways to disease heritability. It
is well known that an abnormality in individual genes rarely causes a human disease. Large-scale pathway
analysis will elucidate disease pathways by constructing functional annotations and applying an existing
polygenic method to partition heritability. Furthermore, pathway analyses on candidate genes identified via
whole-exome sequencing (WES) data will inform new biological inference. The resulting set of enriched
pathway-trait pairs will further help with independent validation of previously reported associations and for
novel associations enriched for trait heritability.
 The second aim is to determine the contribution of gene network to disease heritability across 94 diseases,
focusing on hub genes defined by a broad set of network connectivity metrics. The contribution of gene
network to disease heritability has been only partially answered, limited to few network connectivity metrics
across 42 diseases. Comprehensive analysis of gene sets informed by published gene networks will inform a
new direction of integrating gene networks to infer biological mechanisms of human diseases. In addition,
visualization method for large-scale gene networks will greatly enhance research translation, serving as a
useful resource for studying gene-gene interactions for potential drugs and therapeutic targets.
 The third aim is to de-noise and re-prioritize Mendelian annotations to infer candidate genes. Despite
considerable process on prioritizing rare variants impacting Mendelian diseases, little is known about the utility
of these Mendelian pathogenicity scores for common disease. I will conduct heritability enrichment analyses to
answer informativeness of Mendelian disease pathogenicity scores for common disease. To further improve
their informativeness, I propose a machine learning variant reprioritization framework. Ultimately, this project
will produce (1) a detailed understanding of disease-associated biological pathways to disease heritability, (2)
a set of functional annotations informed...

## Key facts

- **NIH application ID:** 10126720
- **Project number:** 5F31HG010818-02
- **Recipient organization:** MASSACHUSETTS INSTITUTE OF TECHNOLOGY
- **Principal Investigator:** Samuel Sungil Kim
- **Activity code:** F31 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2021
- **Award amount:** $23,429
- **Award type:** 5
- **Project period:** 2020-03-02 → 2021-05-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10126720

## Citation

> US National Institutes of Health, RePORTER application 10126720, Leveraging biological pathways, gene networks, and functional annotations to understand the genetic architecture of diseases and complex traits (5F31HG010818-02). Retrieved via AI Analytics 2026-05-24 from https://api.ai-analytics.org/grant/nih/10126720. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
