# Data Management and Analysis Core (DMAC)

> **NIH NIH P42** · UNIVERSITY OF CALIFORNIA BERKELEY · 2023 · $267,003

## Abstract

CORE C: SUMMARY
The Center investigators seek to understand and remediate potential health risks posed by complex exposure
scenarios present at hazardous waste sites using a systems approach. Several of the proposed projects will use
cutting-edge analytical chemistry, sequencing and other approaches to produce high-dimensional “omic” data
with thousands of parallel measurements on a specific endpoint. These data will be analyzed to identify biological
processes that are perturbed in complex environmental exposure scenarios. The Data Management and
Analysis Core (DMAC) will support the all aspects of the scientific data process: from statistical design, data
management and QA/QC to providing high performance computing platforms, with secure access to Center
data, consulting on data science, biostatistical analysis, to development of new methodology and its
dissemination for project goals. The DMAC will thus support the acquisition, storage, analysis, and sharing of
large, complex datasets through the development of tools, infrastructure and expertise. It will develop data-
driven, machine-learning methods to find patterns in high-dimensional data sets in order to understand biological
perturbations and potential health risks associated with exposures. These efforts include proposals for new
statistical algorithms (and resulting software) on discovering which patterns of chemical mixtures have greatest
potential human health impacts. As these methods require a lot of computing power, Core C leaders will work
with the Berkeley Research Computing group to provide a platform for computation that provides for fast scalable
solutions, as the resulting system will have over 100 CPU’s. This platform will also have direct access through
the integration with Box file management system. Finally, the DMAC will manage access to Center data,
metadata, analysis plans and other supporting material releases using the Open Scientific Framework (osf). In
conclusion, Core C is an integral and critical component of the overall program that supports the whole life history
of Center data.

## Key facts

- **NIH application ID:** 10690465
- **Project number:** 5P42ES004705-35
- **Recipient organization:** UNIVERSITY OF CALIFORNIA BERKELEY
- **Principal Investigator:** Alan E Hubbard
- **Activity code:** P42 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2023
- **Award amount:** $267,003
- **Award type:** 5
- **Project period:** 1997-04-01 → 2027-06-30

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10690465

## Citation

> US National Institutes of Health, RePORTER application 10690465, Data Management and Analysis Core (DMAC) (5P42ES004705-35). Retrieved via AI Analytics 2026-05-22 from https://api.ai-analytics.org/grant/nih/10690465. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
