# Building Safety Guards into LLMs for Trustworthy Automatic Simplification of Medical Documents

> **NIH NIH R01** · UNIVERSITY OF TEXAS AT AUSTIN · 2024 · $363,790

## Abstract

Abstract
Texts describing medical advances are of keen interest to the general public. However, reliable
medical evidence is largely disseminated in peer-reviewed journal articles describing new
ﬁndings. Because such articles are wrięen in technical language intended for experts, this
“primary literature” is eﬀectively inaccessible to the general population. With very large
language models (LLMs) like ChatGPT now widely available, lay people are increasingly
turning to them for medical information. One potentially promising use is for LLMs to provide
simpliﬁed versions of medical papers. However, while LLMs can capably simplify texts
automatically, they can also still generate inaccurate, unsupported, and/or potentially
misleading information, posing a risk.
This proposal seeks to develop novel natural language processing (NLP) technologies to
mitigate risks and improve the reliability of LLM outputs for the task of medical text
simpliﬁcation. Given the high-stakes of healthcare information, we focus on building
controllable, transparent LLMs that are moderately sized, and design tools that enable
communities to strike a balance between using LLMs safely and perceiving their outputs
critically, while (potentially) improving health literacy by eventually empowering the public
with more reliable access to high-quality, newly published medical ﬁndings.
We propose several methodological safeguards. To begin, we will design the ﬁrst error
detection model for LLM-generated simpliﬁcations of medical texts, trained with
expert-annotated data focusing on factual correctness. This tool will then allow us to build safer
knowledge distillation methods, i.e., training much more eﬃcient, smaller models on examples
elicited from massive, closed models like GPT-4 calibrated by estimated conﬁdence of their
correctness. With full access to the parameters of the distilled model, we propose innovative
ways to improve the factuality and readability of the output, and to estimate the model’s
(un)certainty of its own output. We will then integrate these safety guards into a prototype,
such that they can be evaluated by medical experts and lay readers.

## Key facts

- **NIH application ID:** 10944659
- **Project number:** 1R01LM014600-01
- **Recipient organization:** UNIVERSITY OF TEXAS AT AUSTIN
- **Principal Investigator:** Junyi Jessy Li
- **Activity code:** R01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2024
- **Award amount:** $363,790
- **Award type:** 1
- **Project period:** 2024-09-18 → 2028-08-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10944659

## Citation

> US National Institutes of Health, RePORTER application 10944659, Building Safety Guards into LLMs for Trustworthy Automatic Simplification of Medical Documents (1R01LM014600-01). Retrieved via AI Analytics 2026-05-26 from https://api.ai-analytics.org/grant/nih/10944659. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
