# SoS:B10:ldentifying and Encouraging Connections among Data Reuse, Scientific Innovation, and Scientific Careers

> **NIH NIH R01** · UNIVERSITY OF MICHIGAN AT ANN ARBOR · 2024 · $244,581

## Abstract

PROJECT SUMMARY {See instructions): 
This project has two main goals: measuring the impact of research data reuse on diversity and novelty, 
and identifying synergistic data-method-researcher combinations to spur scientific discovery. 
Access to scientific data is critical for advancing research quality and efficiency. Yet, bibliometric studies 
have uncovered biases in publication impact and citation patterns, raising concerns that such disparities 
might affect how researchers reuse data. Developing robust metrics to identify and rectify these 
imbalances is the focus of this work. 
The project will construct networks and graph databases that connect research objects such as 
publications, analysis code, datasets, and variables. These networks will provide insight into datasets' 
influence on research diversity, author interactions, and code reuse for scientific advancements. Focusing 
on biomedical research allows for a varied examination of the types of data, outputs, and potential 
high-impact findings, like new therapeutics or research methods. 
Initially, the project will create knowledge graphs from ICPSR and PhysioNet datasets, supplemented with 
author and code metadata from databases like Dimensions and OpenAlex. Subsequent phases will 
assess the impact of data through diversity and novelty metrics, refined in consultation with stakeholders. 
Finally, the project will recommend strategic partnerships to promote equitable data use and 
groundbreaking reuse applications.

## Key facts

- **NIH application ID:** 11115891
- **Project number:** 1R01GM158694-01
- **Recipient organization:** UNIVERSITY OF MICHIGAN AT ANN ARBOR
- **Principal Investigator:** Libby Hemphill
- **Activity code:** R01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2024
- **Award amount:** $244,581
- **Award type:** 1
- **Project period:** 2024-09-16 → 2026-08-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/11115891

## Citation

> US National Institutes of Health, RePORTER application 11115891, SoS:B10:ldentifying and Encouraging Connections among Data Reuse, Scientific Innovation, and Scientific Careers (1R01GM158694-01). Retrieved via AI Analytics 2026-05-22 from https://api.ai-analytics.org/grant/nih/11115891. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
