Data Repository and Management Core

NIH RePORTER · NIH · U2C · $882,461 · view on reporter.nih.gov ↗

Abstract

DATA REPOSITORY AND MANAGEMENT CORE PROJECT SUMMARY The Data Repository and Management Core (DRMC) will advance scientific understanding of environmental exposures by expanding our successful CHEAR Data Center (DC) data portal and repository to encompass humans at all life stages. Our proposed HHEAR DC data portal and repository, along with our effective user support team, will help researchers add comprehensive exposure analysis to their studies. Our demonstrated commitment to FAIR principles and our unique harmonized data sets will enhance research productivity and general public understanding by (1) providing access to cleaned and harmonized data and larger data sets for greater statistical power and maximal reuse; (2) interoperating with relevant national data sets such as the Metabolomics Workbench, ECHO, and others to facilitate data submission of new data types; and (3) facilitating access to modern collaborative data analysis tools such as Jupyter notebooks and Google's Colaboratory. We will employ best practices for high reliability and security and follow HIPAA guidelines. To develop effective and focused infrastructure, services, and processes tailored for the CHEAR community, our interdisciplinary team developed strong partnerships with the Coordinating Center (CC), the Lab Network, the ECHO DC, the Metabolomics Workbench, and others. These relationships, along with our existing CHEAR infrastructure, singular expertise, and established processes, will carry over to the creation of the HHEAR DC and will help accelerate its rollout. These services will give the HHEAR community the ability to discover new correlations and relationships between multi-scale and multimodal data sets, thus progressing toward the promise of big data to help solve the major challenges of human environmental health research across the lifecourse. Combining data from a set of individual studies would likely require substantial work if it were attempted without resources similar to those of the HHEAR DMRC; the design of the data repository will facilitate manageable and efficient combining of existing data sets; the availability of common vocabularies developed by the DSR will contribute to maximizing the usable data from each study; and the SSAR will ultimately receive a dataset to which they can apply their exposome-related analytic methods to address hypotheses on the environmental health of the pooled study population. Our state-of-the-art DRMC has been fulfilling these roles within the CHEAR program, and will build on and extend our capabilities as the HHEAR DC. In sum, leveraging our existing infrastructure and expertise will overcome the need for a long implementation process fraught with challenges — we have already encountered and overcome many such challenges in implementing the CHEAR DC, and will be able to flexibly respond to the needs of HHEAR Network.

Key facts

NIH application ID
10173773
Project number
5U2CES026555-04
Recipient
ICAHN SCHOOL OF MEDICINE AT MOUNT SINAI
Principal Investigator
Patricia Kovatch
Activity code
U2C
Funding institute
NIH
Fiscal year
2021
Award amount
$882,461
Award type
5
Project period
2015-09-30 → 2024-05-31