# Data Management for Molecule Simulation : A Throughput-Oriented Approach

> **NIH NIH R01** · UNIVERSITY OF SOUTH FLORIDA · 2021 · $287,698

## Abstract

Project Summary
We will design and develop a molecular simulation data management system, we call P2DMS, to analyze large dis-
tributed molecular dynamics (MD) simulation data. The salient features of the system include: (1) a push-based local
query engine design that handles data in a batch processing manner and processes many queries at the same time;
(2) optimized MD analytics tools using modern many-core hardware such as GPUs; and (3) efﬁcient management and
access to distributed data over wide area networks, which is quite common for large scale MD simulations. This will
be done by building a data analysis layer on top of state-of-the-art distributed big data management systems. The out-
come of this project will not only improve the efﬁciency of MD data processing, but also enable new knowledge discov-
ery that is currently regarded difﬁcult or infeasible. In particular, we will integrate the P2DMS program into existing MD
simulation packages, and validate the new design with important real-world biological and MD methodological prob-
lems. In particular, we will (1) model the structure-function relationships of how the spike protein of SARS-CoV-2 inter-
acts with the human angiotensin converting enzyme 2 (ACE2) receptor; and (2) enhance the performance of a recently
developed parameter optimization software for active control of MD simulations.

## Key facts

- **NIH application ID:** 10299289
- **Project number:** 1R01GM140316-01A1
- **Recipient organization:** UNIVERSITY OF SOUTH FLORIDA
- **Principal Investigator:** Yicheng Tu
- **Activity code:** R01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2021
- **Award amount:** $287,698
- **Award type:** 1
- **Project period:** 2021-09-22 → 2025-08-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10299289

## Citation

> US National Institutes of Health, RePORTER application 10299289, Data Management for Molecule Simulation : A Throughput-Oriented Approach (1R01GM140316-01A1). Retrieved via AI Analytics 2026-05-23 from https://api.ai-analytics.org/grant/nih/10299289. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
