# UCSC-Buck Genome Data Analysis Center for the Genomic Data Analysis Network v2.0

> **NIH NIH U24** · UNIVERSITY OF CALIFORNIA SANTA CRUZ · 2024 · $366,578

## Abstract

ABSTRACT
Tumor heterogeneity -- the complex mix of tumor subclones, the cell-of-origin that first became transformed,
the evolution of tumor subclones under selective pressures of the body and due to treatment, and the interplay
of these cells with the tumor microenvironment (TME) -- contributes to the character, behavior, and mystery of
tumors and is a key determinant of cancer progression and a patient’s response to therapy. Large-scale
genomics projects like the Cancer Genome Atlas (TCGA) and the Genome Data Analysis Network (GDAN)
have revealed important characteristics and patterns from a multi-omics overview of various tumor types.
However, it remains a mystery on how to maximize the use of these data to choose the best course of
treatment for an individual patient. The proposed GDAN will close this gap in knowledge by collecting clinical
information and outcomes endpoints alongside the multiple omics platforms that will provide key linkages upon
which to train supervised computational approaches. We propose to contribute our key competencies of
pathway analysis, integrative machine-learning, mRNA-seq analysis, assessment of driving somatic mutations,
and visualization of high-throughput datasets to serve the future GDAN analysis working groups (AWGs) to
achieve these goals. We will collect and share widely a database of gene expression signatures that capture
cell state information gleaned from the large collection of single-cell mRNA sequencing data such as from the
Human Cell Atlas (Aim 1). In addition, we will contribute our existing, and novel extensions to,
machine-learning approaches like AKIMATE to maximally use these signatures and others in combination with
AWG-approved omics datasets as features to train accurate predictors of response for the GDAN’s studies like
ALCHEMIST (Aim 2). Our proposal will adapt the TumorMap to benefit weekly analysis and bolster the
exploration and publication of results. Specifically, we will work with the group to create new maps that show
the TME and TIC comparisons of the patient samples separately to help elucidate new important subtypes
implied by the collected data (Aim 3). As we have done for the past twelve years for TCGA and the GDAN, we
propose to continue working closely with the consortium in these endeavors to significantly enrich our
understanding of the molecular and cellular basis of tumor heterogeneity and its influence on cancer
progression and treatment response.

## Key facts

- **NIH application ID:** 10911803
- **Project number:** 5U24CA264009-04
- **Recipient organization:** UNIVERSITY OF CALIFORNIA SANTA CRUZ
- **Principal Investigator:** Christopher Benz
- **Activity code:** U24 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2024
- **Award amount:** $366,578
- **Award type:** 5
- **Project period:** 2021-09-07 → 2026-08-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10911803

## Citation

> US National Institutes of Health, RePORTER application 10911803, UCSC-Buck Genome Data Analysis Center for the Genomic Data Analysis Network v2.0 (5U24CA264009-04). Retrieved via AI Analytics 2026-05-25 from https://api.ai-analytics.org/grant/nih/10911803. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
