# Computational approaches to characterize heterogeneity and improve risk stratification in complex disease phenotypes

> **NIH NIH R00** · UNIVERSITY OF COLORADO DENVER · 2024 · $249,000

## Abstract

PROJECT SUMMARY/ABSTRACT
Recent technological breakthroughs have enabled the generation of clinical, environmental, and multi-omics data
at an unprecedented scale, providing a complete proﬁle of the patient for individualized disease diagnosis, prog-
nosis, and treatment. However, the precision medicine approach is yet to realize its potential in most multi-factorial
diseases, for which their highly polygenic nature, as well as phenotypic and genetic heterogeneity, complicate the
identiﬁcation of disease-associated cell type-speciﬁc transcriptional mechanisms. A better characterization of this
heterogeneity and an interpretable prediction of individuals at high risk of disease are crucial steps to deliver the
promises of precision medicine. In this context, polygenic risk scores (PRS) are likely to play a crucial role in
precision medicine for disease-risk prediction. However, it has been argued that PRS might accentuate dispari-
ties among non-European ancestries and have low stability at individual-level predictions, probably due to greater
underlying complexity in disease etiology that is not captured in a single score. Current efforts to mitigate health
disparities involve recruiting individuals from different population ancestries. However, if the underlying biological
complexity of disease etiology remains unaccounted, risk stratiﬁcation methods will continue to be limited.
The goal of this project is to develop machine learning methods to advance key computational aspects of precision
medicine. In the ﬁrst aim, an unsupervised method will be applied across large amounts of genetic studies to
detect gene sets associated with multiple human traits, which will also identify environmental risk factors. In the
second aim, new computational approaches will be developed to learn gene co-expression patterns optimized for
a better understanding of transcriptional mechanisms linked to complex traits and their therapeutical modalities.
This will detect gene modules (i.e., genes with similar expression proﬁles across the same cell types) with complex
gene relationships, and the approach will be validated by predicting known FDA-approved drug-disease links.
Finally, the outcomes of these aims will inform a gene module-based polygenic risk score for accurate and robust
disease-risk stratiﬁcation that will be portable across different population ancestries. Although the methods will
be initially applied to asthma, they are clearly extendable to other common diseases as well.
For the K99 phase of this project, the mentorship team's expertise covers all key areas of precision medicine,
including computational genetics, systems biology, environmental exposure studies, pharmacology, and trans-
lational medicine. Mentors and advisors are directly involved in precision medicine initiatives to enhance both
scientiﬁc discovery and its implementation in clinical care. For the R00 phase and beyond, all the conceptual
and methodological expertise previously learned...

## Key facts

- **NIH application ID:** 10840941
- **Project number:** 5R00HG011898-03
- **Recipient organization:** UNIVERSITY OF COLORADO DENVER
- **Principal Investigator:** Milton Pividori
- **Activity code:** R00 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2024
- **Award amount:** $249,000
- **Award type:** 5
- **Project period:** 2023-05-12 → 2026-04-30

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10840941

## Citation

> US National Institutes of Health, RePORTER application 10840941, Computational approaches to characterize heterogeneity and improve risk stratification in complex disease phenotypes (5R00HG011898-03). Retrieved via AI Analytics 2026-05-23 from https://api.ai-analytics.org/grant/nih/10840941. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
