# Category III: Next-Generation Metadata Management Infrastructure for Enabling Intelligent and Efficient Scientific Data Sharing and Discovery at National-Scale

> **NSF 01002627DB NSF RESEARCH & RELATED ACTIVIT** · College of William and Mary (VA) · $500,000

## Abstract

Modern scientific progress in fields such as fusion energy, materials research, climate science, and biomedical imaging depends on researchers' ability to find and reuse the vast amounts of data produced across the national research ecosystem. National investments such as Globus and domain-specific data repositories have made scientific data transfer and storage efficient and reliable, but they were not designed to help researchers find data by its scientific meaning, and they do not provide the descriptive information that artificial intelligence tools need to interpret datasets and recommend related work across disciplines. As a result, much valuable scientific data remains underused. By laying the groundwork for an intelligent, AI-ready scientific data discovery ecosystem, this project advances the progress of science and supports national prosperity through faster, more open scientific discovery.

The project will assess current data discovery practices and requirements across diverse scientific communities, and identify the metadata and semantic information needed to support concept-driven and AI-driven data discovery. The project will develop a community-informed architectural plan for a scalable, AI-ready metadata service that interoperates with existing software and hardware ecosystems. The plan will define representative scientific use cases, technical requirements, design principles, and a governance model. The findings are expected to benefit not only the engage

## Key facts

- **NSF award ID:** 2609536
- **Awardee organization:** College of William and Mary (VA)
- **SAM.gov UEI:** EVWJPCY6AD97
- **PI:** Jie Ren
- **Primary program:** 01002627DB NSF RESEARCH & RELATED ACTIVIT
- **All programs:** Artificial Intelligence (AI)
- **Estimated total:** $500,000
- **Funds obligated:** $500,000
- **Transaction type:** Standard Grant
- **Period:** 08/01/2026 → 07/31/2028

## Primary source

NSF Award Search: https://www.nsf.gov/awardsearch/showAward?AWD_ID=2609536

## Citation

> US National Science Foundation, Award 2609536, Category III: Next-Generation Metadata Management Infrastructure for Enabling Intelligent and Efficient Scientific Data Sharing and Discovery at National-Scale. Retrieved via AI Analytics 2026-06-08 from https://api.ai-analytics.org/grant/nsf/2609536. Licensed CC0.

---

*[NSF Awards dataset](/datasets/nsf-awards) · CC0 1.0*
