Investigating the Effect of Synthetic Medical Images on Fairness in Medical Deep Learning Research

NIH RePORTER · NIH · R21 · $212,928 · view on reporter.nih.gov ↗

Abstract

PROJECT SUMMARY or ABSTRACT This grant proposal describes a research project that aims to improve the performance of deep learning models for pathology detection in chest radiographs. Understudied racial and ethnic minority groups in the United States experience higher rates of illness and death across a wide range of health conditions. With the adoption of Artificial Intelligence (AI) and Deep Learning (DL) in healthcare, there is growing concern about increased differences through the use of algorithms. To address this issue, the research team proposes to leverage generative modeling to better represent underserved and understudied groups in training data. Specifically, they will train DL models that detect 14 pathologies from publicly available chest radiographs with patient age, sex and race information. They will use Denoising Diffusion Probabilistic Models (DDPMs) to create synthetic data and augment the dataset with more diverse images. The research team expects that engineered image synthesis will train DL models that reliably detect chest pathologies without disproportionate performance on different race or sex groups. To achieve this goal, they have proposed three aims: 1) Establish a baseline for pathology detection in chest radiographs; 2) Augment the real radiographs with synthetic chest radiographs representing minorities; and 3) Assess the impact of synthetic data on model performance. The proposed research leverages the power of DL image generation algorithms to potentially improve the performance of pathology detection in chest radiographs. Additionally, the generative model will be released publicly as a foundational model for researchers without access to the required computational resources to train such models. This approach has the potential to improve healthcare outcomes for underserved populations and advance this field of AI research.

Key facts

NIH application ID
10867700
Project number
1R21MD019360-01
Recipient
MAYO CLINIC ROCHESTER
Principal Investigator
BRADLEY J. ERICKSON
Activity code
R21
Funding institute
NIH
Fiscal year
2024
Award amount
$212,928
Award type
1
Project period
2024-09-22 → 2026-06-30