# Enabling Next Generation Machine Learning for Large Scale Image Analysis

> **NIH NIH R44** · RNET TECHNOLOGIES, INC. · 2024 · $888,411

## Abstract

Project Summary/Abstract
Deep learning has transformed medical image analysis by delivering clinically meaningful results on challenging
problems like prostate cancer detection and lung cancer screening. FDA approval of whole-slide digital pathology
imaging (WSIs) for primary diagnosis is further increasing interest, adoption, and investment in artificial intelli-
gence (AI) technology for pathology. Learning from large medical images using patient-level labels (PLLs) has
become an active computational pathology research area. PLLs such as pathology diagnosis or clinical outcomes
are generated through healthcare operations and are often readily available. In contrast to learning paradigms
that depend on the expert annotation of images (e.g., delineating tumor regions) and are therefore time-intensive
and limited to smaller cohorts, training directly from WSIs using PLLs will allow the development of realistic
training datasets containing tens-of-thousands of subjects that can produce models with clinically-meaningful ac-
curacy. GPU accelerators have played a significant role in advancing deep learning methods for computational
pathology tools. Machine Learning Frameworks (MLFs), e.g., Pytorch and TensorFlow, provide researchers with
abstractions to quickly develop models that utilize GPUs. The evolution of GPUs and MLFs has been driven by
the analysis of small images, and so applying these tools directly to WSIs or other large medical images like
volumetric magnetic resonance or computed tomography is challenging. Adapting medical imaging problems to
the small image paradigm leads to many compromises resulting in suboptimal performance, increased imple-
mentation effort, and increased software/design complexity (e.g., patch based techniques or multiple instance
learning). As a result, the development of scalable ML models from PLLs by directly processing WSI images
through a deep learning pipeline is infeasible today on GPUs. Recent efforts that use unified GPU memory or
streaming approaches to overcome GPU memory limits and attempt to perform end-to-end training at WSI scale
have demonstrated superior performance to annotation or MIL. However, these approaches are either slow (due
to suboptimal data movement strategies), complex to adapt/use, or highly specific to a given network architecture
(limiting the ability to develop and explore new architectures). More general-purpose, efficient, and user-friendly
frameworks are needed to allow the development of WSI scale deep learning.
 This project will develop a robust software framework to facilitate seamless development and use of scalable
ML models, without the imposition of any limits on the sizes of handled images, unhindered by the limited memory
capacity in GPUs. The proposed SSTEP (Seamless Scalable Tensor-Expression Execution via Partitioning) soft-
ware framework will allow scalable and portable neural network models that directly process full high-resolution
images of arbitrary size for ...

## Key facts

- **NIH application ID:** 10850949
- **Project number:** 5R44EB032722-03
- **Recipient organization:** RNET TECHNOLOGIES, INC.
- **Principal Investigator:** Gerald Michael Sabin
- **Activity code:** R44 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2024
- **Award amount:** $888,411
- **Award type:** 5
- **Project period:** 2021-09-30 → 2026-05-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10850949

## Citation

> US National Institutes of Health, RePORTER application 10850949, Enabling Next Generation Machine Learning for Large Scale Image Analysis (5R44EB032722-03). Retrieved via AI Analytics 2026-05-26 from https://api.ai-analytics.org/grant/nih/10850949. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*