# Data  Core

> **NIH NIH U2C** · NORTHWESTERN UNIVERSITY · 2023 · $713,601

## Abstract

Data and Statistical Analysis Core Summary
 The Data and Statistical Analysis Core (Data Core) will provide the data management and analytic support
tools needed throughout the Mobile Toolbox 2 initiative. We will provide a standard data processing pipeline and
centralized data storage center that documents the original data, as well as data provenance for derived scores,
metadata, and paradata regarding measure completion. These processes are necessary to ensure the
reproducibility of results arising from the Mobile Toolbox, as well as the reliability and validity of the measures.
The Data Core will support the Project by providing statistical and psychometric expertise in the development of
new measures and analysis of longitudinal observational studies. Further, the centralized repository developed
and maintained by the Data Core will be a key component of the Extension and Dissemination Core in their
efforts to disseminate data for secondary research purposes. The Mobile Toolbox system and data management
infrastructure was built as an extension of the Sage Synapse Data Platform, a cloud-based data storage and
organization system developed to facilitate sharing data, data analysis, and computational methods distributed
across research teams. The Data Core will be integral in serving the needs of Mobile Toolbox 2. The primary
aims of the core include: Aim 1: Conduct statistical analysis and methodological consultation services for
the Mobile Toolbox; Aim 2: Extend and maintain a central repository for data aggregation, curation, and
dissemination; and Aim 3: Create a learning system to assist in the deployment, design, and execution
of remote studies. The Data Core will manage and conduct analyses needed for psychometric validation and
norming of any new measures identified for inclusion in the Mobile Toolbox assessment library through the
scientific community-engaged approach utilized within the Project. We will develop new normative values and
conduct multi-pronged validation analyses as warranted within diverse and representative samples. This work
will be performed by the team of statisticians and psychometricians responsible for norming and validating the
NIH Toolbox®, Patient Reported Outcome Measurement Information System®, and International Cognitive Ability
Resource. The Data Core will ensure that any new measures can be included within the automated data
processing pipeline, including ensuring that the measures are well-defined using JSON schemas so that the data
are self-describing. This allows the data to be seamlessly integrated into the centralized data repository. Further,
this repository allows data aggregation across studies, supporting novel and innovative “big data” approaches.
The Data Core will enable learning systems to aggregate data and implement unsupervised and semi-supervised
machine learning approaches across studies.

## Key facts

- **NIH application ID:** 10729443
- **Project number:** 2U2CAG060426-06
- **Recipient organization:** NORTHWESTERN UNIVERSITY
- **Principal Investigator:** Aaron James Kaat
- **Activity code:** U2C (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2023
- **Award amount:** $713,601
- **Award type:** 2
- **Project period:** 2018-09-15 → 2028-05-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10729443

## Citation

> US National Institutes of Health, RePORTER application 10729443, Data  Core (2U2CAG060426-06). Retrieved via AI Analytics 2026-05-23 from https://api.ai-analytics.org/grant/nih/10729443. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
