Statistical Understanding of Adversarial Training in Neural Networks

NSF Award Search · 01002526DB NSF RESEARCH & RELATED ACTIVIT · $180,000 · view on nsf.gov ↗

Abstract

The increasing use of AI raises concerns about its reliability in real-world environments. Modern AI models, especially deep learning, can produce incorrect predictions when inputs are slightly and deliberately altered, a phenomenon known as adversarial attacks. These vulnerabilities can lead to critical errors in healthcare, scientific research, and security, where AI models guide important decisions. The project improves the robustness of AI by founding a new statistical framework. By improving the trustworthiness of data-driven tools, the project supports advances in multiple scientific fields. This project will prepare undergraduate and graduate students to be competitive in robust data analysis, and will increase interests in Science and Mathematics at the pre-college level through K-12 outreach. The research objective of this project is to establish statistical frameworks for robust adversarial training in neural networks and extend them to modern pre-training and fine-tuning paradigms. Specifically, the research goals include: (1) developing a theoretical foundation for adversarial training in two-layer neural networks; (2) designing scalable adversarial training algorithms that leverage dynamic attack strategies and selective sampling for computational efficiency; and (3) creating robust fine-tuning methods for pre-trained foundation models used in downstream tasks. These theoretical and algorithmic advances contribute to a deeper understanding of robustness in st

Key facts

NSF award ID: 2515194
Awardee: Michigan State University (MI)
SAM.gov UEI: R28EKN92ZTZ9
PI: Yue Xing
Primary program: 01002526DB NSF RESEARCH & RELATED ACTIVIT
All programs: Artificial Intelligence (AI), Machine Learning Theory
Estimated total: $180,000
Funds obligated: $180,000
Transaction type: Standard Grant
Period: 08/15/2025 → 07/31/2028