Taming sequential decision-making with reinforcement learning: non-stationarity, heterogeneity, and online/offline comparison

NSF Award Search · 01002526DB NSF RESEARCH & RELATED ACTIVIT · $149,997 · view on nsf.gov ↗

Abstract

Many decision-making tasks in healthcare, business, and economics can be naturally framed as online sequential decision-making problems, where decisions are made and outcomes are observed iteratively to achieve long-term objectives. Reinforcement learning (RL) offers a powerful framework and has achieved significant success in engineering domains, including robotics and gaming. However, human-centered tasks — such as those in healthcare and business — pose substantial new challenges for RL. These high-stakes tasks are more complex (e.g., non-stationary environments, heterogeneity across objects, and tension between leveraging historical data and the need to perform well in an interactive online setting) and impose more requirements (e.g., interpretability, performance guarantees, and computational efficiency). This project will address these challenges by conceptualizing and gaining insight into the aforementioned complications and by developing well-rounded methodologies that can effectively handle them and meet all requirements. The research outcomes will be broadly applicable to diverse fields, including but not limited to healthcare (e.g., patient treatment, mobile health), business (e.g., operations management, marketing, financial strategies), and economics (e.g., public policy). This project also integrates research and education by providing research training opportunities for students and incorporating the findings into course materials. In more detail, the proje

Key facts

NSF award ID
2515896
Awardee
Washington University (MO)
SAM.gov UEI
L6NFUM28LQM5
PI
Ran Chen
Primary program
01002526DB NSF RESEARCH & RELATED ACTIVIT
All programs
Artificial Intelligence (AI), Machine Learning Theory, STATISTICS
Estimated total
$149,997
Funds obligated
$149,997
Transaction type
Standard Grant
Period
09/01/2025 → 08/31/2028