# Big Data Training for Cancer Research

> **NIH NIH R25** · UNIVERSITY OF CALIFORNIA-IRVINE · 2024 · $324,000

## Abstract

PROJECT SUMMARY
This project, focused on biomedical big data analysis in cancer research, represents a response to the ongoing
need for initiatives that enhance the capabilities of researchers in the field. As the volume and complexity of
cancer-related data continue to grow, equipping researchers with data analysis skills and conceptual
understanding is paramount. The 12-day workshop component of the renewal proposal aims to provide
participants with the essential knowledge and practical skills necessary for proficiently handling and interpreting
large-scale datasets, especially different types of omics data in cancer research. By fostering competence and
confidence in data analysis, the workshop will contribute to a more informed and skilled research community,
thereby enhancing the utility and value of the NCI-funded data. Through its rigorous curriculum and hands-on
training, our proposal aligns with current data analysis challenges while providing flexibility and adaptability over
the performance period. The proposal is a follow-up to two previous rounds of funding and underscores our
ongoing commitment to advancing cancer research through robust data analysis techniques. The continued
demand for big data training is supported by the results from the survey of our workshops and the substantial
number of applications to the workshop each year.
This proposal includes an updated course that leverages the successes and insights gained from the current big
data courses for cancer researchers and the previous courses on big data training for biomedical researchers
supported by NIH/BD2K, ensuring the topics are adapted to the constantly evolving nature of big data in cancer
research. Our multidisciplinary team includes expertise in basic cancer research, clinical oncology,
bioinformatics, biostatistics, and computer science by harnessing the collective expertise from the Chao Family
Comprehensive Cancer Center of the University of California, Irvine (UCI), University of Colorado Cancer Center,
and the Anvil supercomputer (number 143 on the Top500 list of the world's most powerful supercomputers).
University of California, Irvine is in a unique position to lead a big data education program through partnerships
with the current American Association for Cancer Research (AACR) epidemiology workshop team, the Frederick
National Laboratory for Cancer Research (FNLCR), the Cancer Informatics for Cancer Centers (Ci4CC), and the
Big Ten Cancer Research Consortium (Big Ten CRC). By targeting cancer researchers who appreciate the value
of big data but lack the analytical skills necessary, this renewal proposal will incorporate new topics in current
cancer research and establish a cohesive big data community for sustainable communications of workshop
participants. The overarching goal of the proposed renewal project is to empower the participants with skills
and confidence to efficiently manage, visualize, analyze, integrate big data, and derive meaningful ins...

## Key facts

- **NIH application ID:** 10935873
- **Project number:** 2R25CA233429-06
- **Recipient organization:** UNIVERSITY OF CALIFORNIA-IRVINE
- **Principal Investigator:** MIN ZHANG
- **Activity code:** R25 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2024
- **Award amount:** $324,000
- **Award type:** 2
- **Project period:** 2019-09-17 → 2029-11-30

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10935873

## Citation

> US National Institutes of Health, RePORTER application 10935873, Big Data Training for Cancer Research (2R25CA233429-06). Retrieved via AI Analytics 2026-06-08 from https://api.ai-analytics.org/grant/nih/10935873. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
