# Open Data-driven Infrastructure for Building Biomolecular Force Field for Predictive Biophysics and Drug Design

> **NIH NIH R01** · UNIVERSITY OF COLORADO · 2020 · $225,000

## Abstract

PROJECT SUMMARY/ABSTRACT
Molecular simulation is a powerful tool to predict the properties of biomolecules, interpret biophysical experiments,
and design small molecules or biomolecules with therapeutic utility. However, a number of obstacles have impeded
the development of quantitative, cloud-scale research workﬂows involving biomolecular simulation. Two main ob-
stacles are the insufﬁcient accuracy of current atomistic models for biomolecules and small molecule therapeutics
and the lack of interoperability in simulation toolchains used in both academic and industrial biomolecular research.
Our original R01, “Open Data-driven Infrastructure for Building Biomolecular Force Fields for Predictive Bio-
physics and Drug Design,” seeks to solve the ﬁrst problem. It helps fund our effort, the Open Force Field Initiative
(https://openforceﬁeld.org) to develop open, extensible, and shared software and data infrastructure, implementing
statistically robust methods of parameterizing force ﬁelds and choosing new force ﬁelds in a statistically sound
manner. This work is designed to create not just a new generation of force ﬁelds, but an open technology to
continue advancing force ﬁeld science.
However, even with improved molecular models, putting together complete workﬂows of biomolecular simulations
involves interfacing substantial numbers of different tools. However the majority of the existing molecular
simulation workﬂows are mutually incompatible, with differing representations of the molecular models.
The Open Force Field Initiative effort already includes the development of molecular data structures that we can ex-
port into existing molecular simulation tools. We propose to extend the existing scope of our R01 to create an
extensible common molecular simulation representation and translators to and from this representation.
Such a set of tools will immediately make it signiﬁcantly easier to combine the disparate workﬂows developed for
different sets of molecular simulation tools. Researchers will be able to set up and build the biophysical simulations
using their usual tools, but run and analyze them with currently incompatible tools, enabling better matching of
computational resources and methods to problems. It will help avoid trapping in a single software framework, and
enable combinations of functionalities previously impossible without substantial developer time and effort.
We will (Aim 1) work with partners to generalize our modular, extensible object model for representing
parameterized biomolecular systems in a manner that accommodates the force ﬁeld terms currently supported
by most popular biomolecular simulation packages. We will engineer it to be extensible to advanced interaction
forms, such as polarizability and other multibody terms, and machine learning models for intermolecular forces. We
will (Aim 2): enable easy conversion between components of molecular simulation workﬂows by allowing
other molecular simulation packages to easily ...

## Key facts

- **NIH application ID:** 10166314
- **Project number:** 3R01GM132386-01A1S2
- **Recipient organization:** UNIVERSITY OF COLORADO
- **Principal Investigator:** Michael R Shirts
- **Activity code:** R01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2020
- **Award amount:** $225,000
- **Award type:** 3
- **Project period:** 2020-03-01 → 2024-02-29

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10166314

## Citation

> US National Institutes of Health, RePORTER application 10166314, Open Data-driven Infrastructure for Building Biomolecular Force Field for Predictive Biophysics and Drug Design (3R01GM132386-01A1S2). Retrieved via AI Analytics 2026-05-26 from https://api.ai-analytics.org/grant/nih/10166314. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
