# The AnVIL Data Ecosystem

> **NIH NIH U24** · BROAD INSTITUTE, INC. · 2020 · $217,232

## Abstract

PROJECT SUMMARY
In response to the COVID-19 pandemic, the Host Genetics Initiative (HGI) has formed to generate, share, and
analyze data to identify the underlying genetic determinants of the SARS-CoV-2 infection and disease,
including the severity in symptomatic presentation and associated outcomes, as well as the development of
hypotheses for drug repurposing. To achieve the aims of the consortium, the need for a mechanism to ingest
data and make it broadly available for researchers arose. With the National Human Genome Research
Institute’s commitment to the same goals, the AnVIL, which is predicated on providing a cloud environment for
the analysis of large genomic and clinical datasets, became the eminent choice to serve as a data repository
and analysis platform for the consortium. As the technology has been developed through the parent grant, the
aims proposed for this grant allows the AnVIL to provide scalable support for the Host Genetics Initiative in
their utilization of the AnVIL. The specific aims include:
• Aim 1: Scale the data Ingest processes for genotypic, phenotypic, clinical report data and metadata to
 support the expected influx of data from the Host Genetics Initiative
• Aim 2: Utilize the Data Use Oversight System (DUOS) to facilitate expeditious data governance and
 access requests
• Aim 3: Dissemination of data for researchers via featured workspaces done in partnership with the
 Host Genetics Initiative
• Aim 4: Provide support for new data generator and user communities
Through the implementation of these aims over the course of one year, we will enable the data that is
generated by the members of the Host Genetics Initiative to be broadly shared and analyzed with researchers
inside and outside of the consortium, which will serve to contribute to the global knowledge of the biology of
SARS-CoV-2 infection and disease.

## Key facts

- **NIH application ID:** 10166400
- **Project number:** 3U24HG010262-03S2
- **Recipient organization:** BROAD INSTITUTE, INC.
- **Principal Investigator:** Robert J Carroll
- **Activity code:** U24 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2020
- **Award amount:** $217,232
- **Award type:** 3
- **Project period:** 2018-09-19 → 2023-06-30

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10166400

## Citation

> US National Institutes of Health, RePORTER application 10166400, The AnVIL Data Ecosystem (3U24HG010262-03S2). Retrieved via AI Analytics 2026-06-08 from https://api.ai-analytics.org/grant/nih/10166400. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
