# Statistical Models of Suicidal Behavior and Brain Biology Using Large Data Sets

> **NIH NIH P50** · NEW YORK STATE PSYCHIATRIC INSTITUTE DBA RESEARCH FOUNDATION FOR MENTAL HYGIENE, INC · 2020 · $148,773

## Abstract

SUMMARY-PROJECT 6
The primary objective of the Conte Center is to better understand the causes of suicidal behavior and,
ultimately, to translate that knowledge into tools that have the potential for detecting patients at higher risk for
more lethal suicide attempts. Towards that aim, a great deal of valuable data is being collected. This project is
primarily concerned with very high-dimensional data, including brain imaging data, DNA genetic and epigenetic
data, and gene expression data. In particular, we are interested in using such data as predictors in statistical
models. Our ultimate goal is to build and implement models that can be used to distinguish depressed suicide
attempters from other depressed patients.
The greatest hurdle is dealing with very high dimensional data, a situation that classical statistical modeling
techniques are not equipped to handle. Our approach to modeling with such high-dimensional data seeks to
achieve three primary goals: (1) to provide good predictive accuracy; (2) to build interpretable models that can
give meaningful insight into the relationship between the various predictors and the outcome variable; (3) to
ensure stability of the models through validation studies. To accomplish these goals our approach will
incorporate a regularization component in the model selection and fitting process by applying a penalty to
model complexity. In applications, the amount of penalization will be determined by cross-validation or another
related technique. In both model development and application, we will consider a wide range of outcome
variables including continuous or nearly continuous (e.g., severity of depression, aggressive traits), ordinal
(e.g., lethality of suicide attempts), categorical (e.g., diagnosis), and binary (yes/no, e.g., suicide attempt), as
well as time-to-event data (e.g., time to a suicide attempt), which will draw on survival analysis techniques.
This project focuses specifically on situations in which the number of possible predictors is in the tens of
thousands or more. Examples of existing data and data to be gathered as part of the Conte Center include
brain imaging data (positron emission tomography (PET) images of the serotonin system, diffusion tensor
imaging (DTI) scans, functional magnetic resonance imaging (fMRI) scans, magnetic resonance spectroscopy
(MRS) scans), genome-wide DNA genotyping and methylation data, and gene expression data.

## Key facts

- **NIH application ID:** 9959478
- **Project number:** 5P50MH090964-08
- **Recipient organization:** NEW YORK STATE PSYCHIATRIC INSTITUTE DBA RESEARCH FOUNDATION FOR MENTAL HYGIENE, INC
- **Principal Investigator:** TODD OGDEN
- **Activity code:** P50 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2020
- **Award amount:** $148,773
- **Award type:** 5
- **Project period:** 2013-07-19 → 2023-06-30

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/9959478

## Citation

> US National Institutes of Health, RePORTER application 9959478, Statistical Models of Suicidal Behavior and Brain Biology Using Large Data Sets (5P50MH090964-08). Retrieved via AI Analytics 2026-05-25 from https://api.ai-analytics.org/grant/nih/9959478. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
