# Harnessing the Electronic Health Record to Predict Risk of Cardiovascular Disease

> **NIH NIH R01** · MAYO CLINIC ROCHESTER · 2021 · $794,994

## Abstract

PROJECT SUMMARY / ABSTRACT
Cardiovascular disease (CVD) is the single largest killer in the United States for both men and women in every
racial/ethnic group. Thus, accurate and systematic evaluation of CVD risk represents an aspect of Precision
Medicine that will touch every patient. CVD risk scores that are currently the standard of care are derived from
research cohorts and are particularly inaccurate in women, older patients, and those with missing data. The
goal of this Precision Medicine based application is to capitalize on the depth and breadth of clinical data within
electronic health record (EHR) systems to revolutionize CVD risk prediction, thereby optimizing personalized
care for every patient. Our proposed approach is innovative in that we have identified and addressed the most
significant barriers to development of an EHR-based risk score. Novel aspects of this research include: 1) use
of complete EHR data to develop and validate algorithms to define a variety of risk factors (e.g., reproductive
history), thus building a comprehensive risk profile for each patient that incorporates diagnosis and procedure
codes, laboratory values, clinical test results, patient provided information (e.g., alcohol use), and natural
language processing of unstructured clinical text; 2) incorporation of age at onset of risk factors; 3) use of
highly flexible machine learning techniques in the form of generalized boosted regression modeling; 4)
exploration of a new deep learning model for censored EHR data; and 5) determination of the extent of risk
reclassification in multiple geographically-defined populations, including an underserved minority population.
Furthermore, genetic studies demonstrate that incorporating variants into current risk models improves risk
prediction and use of an individual's genetic risk could further enhance our ability to deliver precision medicine
to every patient. Therefore, we seek to develop a sex-specific next-generation CVD risk prediction score using
EHR data in combination with genetic variants. This paradigm is a significant departure from the current one
that relies on scores derived from relatively small research cohorts that use only a restricted set of clinical
parameters that differentially misclassify an individual's risk, especially in women. Our access to empirical
clinical EHR data for hundreds of thousands of patients uniquely positions us to 1) develop a sex-specific risk
prediction model for incident CVD using data from the EHR; 2) assess the performance of the sex-specific
EHR risk score in an independent non-urban and rural population; and 3) identify and characterize patients for
whom genetic information improves CVD prediction beyond the clinical risk score. Successful completion of
these aims has the potential to impact all adult patients, drive clinical practice changes to systematically collect
sex-specific risk factors, and inform attempts to embed the next-generation CVD risk score into EHR syst...

## Key facts

- **NIH application ID:** 10063011
- **Project number:** 5R01HL136659-04
- **Recipient organization:** MAYO CLINIC ROCHESTER
- **Principal Investigator:** Suzette Janine Bielinski
- **Activity code:** R01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2021
- **Award amount:** $794,994
- **Award type:** 5
- **Project period:** 2017-12-15 → 2022-11-30

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10063011

## Citation

> US National Institutes of Health, RePORTER application 10063011, Harnessing the Electronic Health Record to Predict Risk of Cardiovascular Disease (5R01HL136659-04). Retrieved via AI Analytics 2026-05-23 from https://api.ai-analytics.org/grant/nih/10063011. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
