Super Greedy Trees

NIH RePORTER · NIH · R35 · $422,125 · view on reporter.nih.gov ↗

Abstract

Project Summary/Abstract We identify critical weaknesses with Classification and Regression Trees (CART), a widely used base learner for machine learning of big omic-data analysis, and propose to replace these with a fundamentally different type of base learner we call super greedy trees (SGT's). SGT's cut the space in a fundamentally different manner, resulting in a richer partition structure with provable consistency and superior empirical performance. The project will develop a unified SGT framework for big data analysis using machine learning including the treatment of time varying covariate survival analysis, unsupervised learning, highly imbalanced data and multivariate regression. The SGT framework will be deployed within scalable and extensible open source software that will allow NIGMS researchers to deploy them to deal with their challenging big data problems.

Key facts

NIH application ID
10407442
Project number
5R35GM139659-02
Recipient
UNIVERSITY OF MIAMI SCHOOL OF MEDICINE
Principal Investigator
Hemant Ishwaran
Activity code
R35
Funding institute
NIH
Fiscal year
2022
Award amount
$422,125
Award type
5
Project period
2021-06-01 → 2026-05-31