Investigating the Effect of Synthetic Medical Images on Fairness in Medical Deep Learning Research

NIH RePORTER · NIH · R21 · $212,928 · view on reporter.nih.gov ↗

Abstract

PROJECT SUMMARY or ABSTRACT This grant proposal describes a research project that aims to improve the performance of deep learning models for pathology detection in chest radiographs. Understudied racial and ethnic minority groups in the United States experience higher rates of illness and death across a wide range of health conditions. With the adoption of Artificial Intelligence (AI) and Deep Learning (DL) in healthcare, there is growing concern about increased differences through the use of algorithms. To address this issue, the research team proposes to leverage generative modeling to better represent underserved and understudied groups in training data. Specifically, they will train DL models that detect 14 pathologies from publicly available chest radiographs with patient age, sex and race information. They will use Denoising Diffusion Probabilistic Models (DDPMs) to create synthetic data and augment the dataset with more diverse images. The research team expects that engineered image synthesis will train DL models that reliably detect chest pathologies without disproportionate performance on different race or sex groups. To achieve this goal, they have proposed three aims: 1) Establish a baseline for pathology detection in chest radiographs; 2) Augment the real radiographs with synthetic chest radiographs representing minorities; and 3) Assess the impact of synthetic data on model performance. The proposed research leverages the power of DL image generation algorithms to potentially improve the performance of pathology detection in chest radiographs. Additionally, the generative model will be released publicly as a foundational model for researchers without access to the required computational resources to train such models. This approach has the potential to improve healthcare outcomes for underserved populations and advance this field of AI research.

Key facts

NIH application ID: 10867700
Project number: 1R21MD019360-01
Recipient: MAYO CLINIC ROCHESTER
Principal Investigator: BRADLEY J. ERICKSON
Activity code: R21
Funding institute: NIH
Fiscal year: 2024
Award amount: $212,928
Award type: 1
Project period: 2024-09-22 → 2026-06-30