Building Safety Guards into LLMs for Trustworthy Automatic Simplification of Medical Documents

NIH RePORTER · NIH · R01 · $363,790 · view on reporter.nih.gov ↗

Abstract

Abstract Texts describing medical advances are of keen interest to the general public. However, reliable medical evidence is largely disseminated in peer-reviewed journal articles describing new ﬁndings. Because such articles are wrięen in technical language intended for experts, this “primary literature” is eﬀectively inaccessible to the general population. With very large language models (LLMs) like ChatGPT now widely available, lay people are increasingly turning to them for medical information. One potentially promising use is for LLMs to provide simpliﬁed versions of medical papers. However, while LLMs can capably simplify texts automatically, they can also still generate inaccurate, unsupported, and/or potentially misleading information, posing a risk. This proposal seeks to develop novel natural language processing (NLP) technologies to mitigate risks and improve the reliability of LLM outputs for the task of medical text simpliﬁcation. Given the high-stakes of healthcare information, we focus on building controllable, transparent LLMs that are moderately sized, and design tools that enable communities to strike a balance between using LLMs safely and perceiving their outputs critically, while (potentially) improving health literacy by eventually empowering the public with more reliable access to high-quality, newly published medical ﬁndings. We propose several methodological safeguards. To begin, we will design the ﬁrst error detection model for LLM-generated simpliﬁcations of medical texts, trained with expert-annotated data focusing on factual correctness. This tool will then allow us to build safer knowledge distillation methods, i.e., training much more eﬃcient, smaller models on examples elicited from massive, closed models like GPT-4 calibrated by estimated conﬁdence of their correctness. With full access to the parameters of the distilled model, we propose innovative ways to improve the factuality and readability of the output, and to estimate the model’s (un)certainty of its own output. We will then integrate these safety guards into a prototype, such that they can be evaluated by medical experts and lay readers.

Key facts

NIH application ID: 10944659
Project number: 1R01LM014600-01
Recipient: UNIVERSITY OF TEXAS AT AUSTIN
Principal Investigator: Junyi Jessy Li
Activity code: R01
Funding institute: NIH
Fiscal year: 2024
Award amount: $363,790
Award type: 1
Project period: 2024-09-18 → 2028-08-31