# Data Science and Statistics Core

> **NIH NIH P01** · UNIVERSITY OF CALIFORNIA-IRVINE · 2023 · $445,034

## Abstract

Abstract
Data Sciences and Statistics Core
 The Data Sciences and Statistics Core (DSSC) will be a multi-site core that integrates experts in data
management, biostatistics, epidemiology and pathogen genomics. First, the DSSC will serve as an inventory
hub for all data associated with this program, including sample inventory, epidemiologic data, and microbial
genomic sequence data. With complex epidemiologic data and terabytes of genomic sequence data from over
16,000 isolates and nearly 3,000 plate metagenomes, a detailed inventory is essential to ensure organized
access. Second, the DSSC will be a data sharing hub across all Projects. The DSSC will be responsible for
facilitating secure and frictionless data transfer between Projects, maintaining consistent nomenclature across
data types, as well as detailed data dictionaries. This resource will be essential to enable synergistic
collaboration between researchers across multiple institutions and projects, working with large, complex
datasets. Third, the DSSC will serve as an innovation hub for integrating biostatistics traditionally used for
epidemiologic studies with computational biology for genomic analysis. Researchers with expertise in statistical
modeling, epidemiology and bioinformatics will work closely to develop novel methodology for the
simultaneous analysis of complex epidemiological and genomic data. The DSSC will provide data science and
biostatistical expertise across all projects. This will enhance the synergy among the projects by unifying data
and statistical consultation. In convening an integrated core with expertise in biostatistics, computational
biology, epidemiology, and genomics, we will form a unique and powerful platform for developing novel
approaches and provide the clearest view of multidrug resistant organism (MDRO) carriage and transmission
within nursing homes (NHs) to date.

## Key facts

- **NIH application ID:** 10549489
- **Project number:** 1P01AI172725-01
- **Recipient organization:** UNIVERSITY OF CALIFORNIA-IRVINE
- **Principal Investigator:** Colin Worby
- **Activity code:** P01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2023
- **Award amount:** $445,034
- **Award type:** 1
- **Project period:** 2023-07-11 → 2028-04-30

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10549489

## Citation

> US National Institutes of Health, RePORTER application 10549489, Data Science and Statistics Core (1P01AI172725-01). Retrieved via AI Analytics 2026-05-22 from https://api.ai-analytics.org/grant/nih/10549489. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
