# Analysis of Somatic Mutations in Longitudinal Whole-genome Sequencing Data

> **NIH NIH R00** · UNIVERSITY OF KENTUCKY · 2024 · $249,000

## Abstract

PROJECT SUMMARY
 While recent efforts have focused on better understanding the germline mutation rate of different
types of DNA variants, the dynamics of somatic mutations in the human genome are poorly understood.
Somatic mutations are a well-known cause of cancer and may play a causative role in other diseases and
aging. Analyses of somatic mutations have generally been limited to single-cell or low-coverage studies.
To accurately assess the rate of somatic mutations over time would require a longitudinal, high coverage
dataset. For ~450 individuals in the Utah CEPH (Centre d'Etude du Polymorphism Humain) pedigrees,
blood was drawn at two timepoints approximately 15 years apart. This proposal describes experiments to
determine the rate of somatic mutations throughout the human genome by utilizing high-coverage (720x)
whole-genome sequencing data at both of these timepoints in a subset of CEPH individuals. The genomes
included in this study will be used to determine the rate of somatic mutations and improve our
understanding of differences in this rate between individuals. Building on previous work performed at the
University of Utah, we will determine the relationship between the germline and somatic mutation rate.
Further, using detailed phenotypic data collected for these individuals, experiments will be performed to
determine how this rate is correlated with disease outcomes such as cancer and cardiovascular disease.
From this work, a series of filtering algorithms will be produced to allow for comprehensive analysis of
somatic mutations occurring between two timepoints in the same individual. This dataset provides me with
a unique opportunity to study somatic mutations over time using high-quality data.
This proposal builds upon my experience in analyzing mobile element-mediated somatic mutations
in longitudinal whole-genome data of cancer patients, and my recent work analyzing germline short
tandem repeat mutations in the CEPH pedigrees. The K99 portion of this project will focus on determining
the rate of somatic mutations in longitudinal, high-coverage whole-genome sequencing data while creating
filtering algorithms to improve our detection capabilities. The additional training I will receive during this
period will lay the foundation of an independent research program that will determine the significance of
somatic mutations in the human genome and how these relate to disease outcomes.

## Key facts

- **NIH application ID:** 10875490
- **Project number:** 5R00HG011657-04
- **Recipient organization:** UNIVERSITY OF KENTUCKY
- **Principal Investigator:** Cody Steely
- **Activity code:** R00 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2024
- **Award amount:** $249,000
- **Award type:** 5
- **Project period:** 2023-07-01 → 2027-01-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10875490

## Citation

> US National Institutes of Health, RePORTER application 10875490, Analysis of Somatic Mutations in Longitudinal Whole-genome Sequencing Data (5R00HG011657-04). Retrieved via AI Analytics 2026-05-24 from https://api.ai-analytics.org/grant/nih/10875490. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
