# Statistical Methods for Integrative Analysis of Large Scale Neuroimaging Data

> **NIH NIH R01** · UNIV OF NORTH CAROLINA CHAPEL HILL · 2022 · $369,209

## Abstract

Abstract
Integrative analysis methods are in great needs as multimodal multi-cohort neuroimaging data rapidly emerge in
neuro science. In Alzheimer's Disease (AD) studies, many research relies on multimodal neuroimaging data to
identify key image biomarkers for the early diagnosis of AD. Despite great endeavors in data collection, there still
lacks rigorous statistical methods and efﬁcient computational tools to properly integrate big neuroimaging data in
a statistical model and carry out inference to address practical problems. Important problems such as missing
data and adjustment for between-subject heterogeneity still remain unsolved. In this proposal, we propose to
build two integrative models, one handles multimodal data and the other handles longitudinal multi-cohort data.
They will be built under a generic M-estimation framework that covers many widely used statistical models as its
special cases. We will provide various inference tools for these models and develop efﬁcient algorithms to solve
the M-estimation problem in presence of block missing values. In Aim 1, we propose a factor-adjusted integrative
model for multimodal data and provide a complete set of inference tools. These tools can test the signiﬁcance of
one whole data modality as well as the signiﬁcance of multiple linear combinations of predictors from one or more
modalities. In Aim 2, we provide a powerful computational tool to handle block missing values of multimodal data.
Such a tool does not need to perform ad-hoc imputation on missing values, but rather relies on an innovative mini-
batch gradient descent algorithm to yield a good estimator. In Aim 3, we will develop an interactive factor model
to jointly model longitudinal data coming from multiple cohorts. We show that such a model includes the standard
random effects model as a special case and is more ﬂexible modeling the longitudinal data and accounting for the
between-subject heterogeneity. The proposed research will likely transform how we analyze neuroimaging data
and enhance our understanding of Alzheimer's Disease and its relation to public health.

## Key facts

- **NIH application ID:** 10470397
- **Project number:** 5R01AG073259-02
- **Recipient organization:** UNIV OF NORTH CAROLINA CHAPEL HILL
- **Principal Investigator:** Quefeng Li
- **Activity code:** R01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2022
- **Award amount:** $369,209
- **Award type:** 5
- **Project period:** 2021-09-01 → 2026-06-30

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10470397

## Citation

> US National Institutes of Health, RePORTER application 10470397, Statistical Methods for Integrative Analysis of Large Scale Neuroimaging Data (5R01AG073259-02). Retrieved via AI Analytics 2026-05-21 from https://api.ai-analytics.org/grant/nih/10470397. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
