# Data Infrastructure Core

> **NIH NIH U19** · SLOAN-KETTERING INST CAN RESEARCH · 2022 · $3,812,100

## Abstract

A unique commonality among the diverse Projects and Cores of this Center is the need for rigorous and
reproducible data. While the successful discovery of novel antivirals is inherently multidisciplinary, data must be
generated, collected, annotated, and acted upon at each step of the process by biologists and chemists from
target validation to late-stage lead optimization. The goal of this Core is to create a next-generation data
infrastructure that allows for the efficient undertaking of these data tasks, as well as the dissemination of opendata
to the larger scientific community. Innovative software will build on initial architecture developed for the
open-science COVID Moonshot initiative for rapid antiviral discovery, which led to novel inhibitors of the SARSCoV-
2 Main Protease, while also supporting hundreds of projects across the globe through open-data practices.
Early end-to-end collection of data and tracking of drug discovery project progression, will enable accelerated
internal development, in addition to sources of data crucial for development of the global antiviral pipeline. All
internal results, as well as compound logistics and prioritization, will be tracked and made available, thus
providing the high quality data for machine learning models to augment rapid development, and for the
computational community to build on. Data workflows and processes in addition to data and metadata ranging
from target validation to preclinical development will ultimately be shared to the global community, spurring rapid
global progress in antiviral discovery.

## Key facts

- **NIH application ID:** 10513870
- **Project number:** 1U19AI171399-01
- **Recipient organization:** SLOAN-KETTERING INST CAN RESEARCH
- **Principal Investigator:** John Damon Chodera
- **Activity code:** U19 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2022
- **Award amount:** $3,812,100
- **Award type:** 1
- **Project period:** 2022-05-16 → 2025-04-30

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10513870

## Citation

> US National Institutes of Health, RePORTER application 10513870, Data Infrastructure Core (1U19AI171399-01). Retrieved via AI Analytics 2026-06-01 from https://api.ai-analytics.org/grant/nih/10513870. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
