# NextGen Random Forests

> **NIH NIH R01** · UNIVERSITY OF MIAMI SCHOOL OF MEDICINE · 2020 · $347,834

## Abstract

Project Summary/Abstract
Building from the PI's current R01, we propose next generation random forests (RF) designed for unprecedented
accuracy and computational scalability to meet the challenges of today's complex and big data in the health
sciences. Superior accuracy is achieved using super greedy trees which circumvent limitations on local adaptivity
imposed by classical tree splitting. We identify a key quantity, forest weights, and show how these can be
leveraged for further improvements and generalizability. In one application, improved survival estimators are
applied to worldwide esophageal cancer data to develop guidelines for clinical decision making. Richer RF
inference is another issue explored. Cutting edge machine learning methods rarely consider the problem of
estimating variability. For RF, bootstrapping currently exists as the only tool for reliably estimating conﬁdence
intervals, but due to heavy computations is rarely applied. We introduce tools to rapidily calculate standard errors
based on U-statistic theory. These will be used to increase robustness of esophageal clinical recommendations
and to investigate survival temporal trends in cardiovascular disease. In another application, we make use of
our new massive data scalability for discovery of tumor and immune regulators of immunotherapy in cancers.
This project will set the standard for RF computational performance. Building from the core libraries of the highly
accessed R-package randomForestSRC (RF-SRC), software developed under the PIs current R01, we develop
open source next generation RF software, RF-SRC Everywhere, Big Data RF-SRC, and HPC RF-SRC. The
software will be deployable on a number of popular machine learning workbenches, use distributed data storage
technologies, and be optimized for big-p, big-n, and big-np scenarios.

## Key facts

- **NIH application ID:** 9929599
- **Project number:** 5R01GM125072-08
- **Recipient organization:** UNIVERSITY OF MIAMI SCHOOL OF MEDICINE
- **Principal Investigator:** Hemant Ishwaran
- **Activity code:** R01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2020
- **Award amount:** $347,834
- **Award type:** 5
- **Project period:** 2017-09-01 → 2021-05-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/9929599

## Citation

> US National Institutes of Health, RePORTER application 9929599, NextGen Random Forests (5R01GM125072-08). Retrieved via AI Analytics 2026-05-23 from https://api.ai-analytics.org/grant/nih/9929599. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
