# Digital Phenotyping and Cardiovascular Health

> **NIH NIH R01** · UNIVERSITY OF PENNSYLVANIA · 2020 · $767,681

## Abstract

Digital data from social media, online searches, and smartphones can reveal a detailed narrative about an
individual's day-to-day activities. Information about lifestyle and health behaviors (e.g. exercise habits, food
consumption, smoking status) are often revealed with significant detail through these electronic platforms. Of
importance, many commonly shared health behaviors may be associated with cardiovascular disease,
treatment, and management. Digital phenotypes derived from these electronically mediated data sources can
shape our assessment of human illness and have substantial value beyond our traditional approaches to
characterizing a disease phenotype (e.g. physical exam, laboratory values), and ultimately expand our ability to
identify and diagnose health conditions and predict healthcare utilization. Central to this proposal is the
recognition that person-to-person communication and online activities that were previously private are now
observable. It is the observability of these new communication channels that provides both innovation and
promise to this area of inquiry. Our first aim will entail consenting patients to share access to their digital data
(e.g. social, search, and mobile data) and merge this information with validated health record data in a
research database. We will then extensively process the digital data so that it is in an interpretable format that
can be incorporated in traditional predictive models. Aim 2 will focus on assessing the incremental benefit of
adding digital data to the Framingham risk score to evaluate the contribution of digital data for predicting
cardiovascular risk. In the future, this data could inform patients about their personalized risk and ways to
concretely change that risk. The digital platforms used to post or share data could also be used to directly
provide feedback to patients on the medium they use and in direct response to their stated inputs.
The third aim will focus on incorporating digital data in models to predict cost of care. This approach offers
promise for better understanding the factors contributing to healthcare utilization, which correlate with
morbidity, mortality, and the economic burden of cardiovascular disease. Through this project we seek to learn
new insights about collecting and analyzing digital data while being attentive to issues of ethics and privacy
that may be associated with these data. We will incorporate digital data in models to predict important targets
like coronary heart disease risk and healthcare use. Overall, the areas of focus for this grant represent new
frontiers in precision medicine and digital phenotyping for cardiovascular health.

## Key facts

- **NIH application ID:** 9984899
- **Project number:** 5R01HL141844-02
- **Recipient organization:** UNIVERSITY OF PENNSYLVANIA
- **Principal Investigator:** Raina Merchant
- **Activity code:** R01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2020
- **Award amount:** $767,681
- **Award type:** 5
- **Project period:** 2019-08-01 → 2023-05-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/9984899

## Citation

> US National Institutes of Health, RePORTER application 9984899, Digital Phenotyping and Cardiovascular Health (5R01HL141844-02). Retrieved via AI Analytics 2026-05-24 from https://api.ai-analytics.org/grant/nih/9984899. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
