Building Safety Guards into LLMs for Trustworthy Automatic Simplification of Medical Documents

NIH RePORTER · NIH · R01 · $363,790 · view on reporter.nih.gov ↗

Abstract

Abstract Texts describing medical advances are of keen interest to the general public. However, reliable medical evidence is largely disseminated in peer-reviewed journal articles describing new findings. Because such articles are wrięen in technical language intended for experts, this “primary literature” is effectively inaccessible to the general population. With very large language models (LLMs) like ChatGPT now widely available, lay people are increasingly turning to them for medical information. One potentially promising use is for LLMs to provide simplified versions of medical papers. However, while LLMs can capably simplify texts automatically, they can also still generate inaccurate, unsupported, and/or potentially misleading information, posing a risk. This proposal seeks to develop novel natural language processing (NLP) technologies to mitigate risks and improve the reliability of LLM outputs for the task of medical text simplification. Given the high-stakes of healthcare information, we focus on building controllable, transparent LLMs that are moderately sized, and design tools that enable communities to strike a balance between using LLMs safely and perceiving their outputs critically, while (potentially) improving health literacy by eventually empowering the public with more reliable access to high-quality, newly published medical findings. We propose several methodological safeguards. To begin, we will design the first error detection model for LLM-generated simplifications of medical texts, trained with expert-annotated data focusing on factual correctness. This tool will then allow us to build safer knowledge distillation methods, i.e., training much more efficient, smaller models on examples elicited from massive, closed models like GPT-4 calibrated by estimated confidence of their correctness. With full access to the parameters of the distilled model, we propose innovative ways to improve the factuality and readability of the output, and to estimate the model’s (un)certainty of its own output. We will then integrate these safety guards into a prototype, such that they can be evaluated by medical experts and lay readers.

Key facts

NIH application ID
10944659
Project number
1R01LM014600-01
Recipient
UNIVERSITY OF TEXAS AT AUSTIN
Principal Investigator
Junyi Jessy Li
Activity code
R01
Funding institute
NIH
Fiscal year
2024
Award amount
$363,790
Award type
1
Project period
2024-09-18 → 2028-08-31