# The WashU-UCSC-EBI Human Genome Reference Center

> **NIH NIH U41** · WASHINGTON UNIVERSITY · 2020 · $2,515,410

## Abstract

PROJECT SUMMARY (Overall: Human Genome Reference Center)
The human reference genome is the foundational resource upon which the framework of modern human
genetics and genomics has been constructed. It is the analytical substrate for nearly all human genomics
applications including read alignment, variant detection, variant interpretation, functional annotation,
population genetics, and epigenomic analysis. In a more basic sense, the reference genome also serves as
a coordinate system for systematically reporting and comparing results across studies, and for cataloging the
important genetic elements and variants that exist in humans. As genomic methods continue to march into
the clinical realm, the reference genome will become increasingly important for genetic screening and
precision medicine. Yet, there is a growing sense that the current reference genome has become obsolete.
The primary limitation is that the reference does not adequately represent genomic diversity in the human
population, and this leads to "reference biases" that adversely affect the accuracy of genetic analyses. To
solve this, it is necessary to build a reference pan-human genome – i.e., a "pan-genome" – that represents
the full complement of common variants, haplotypes and functional elements that exist in our collective
genomes. To accomplish this goal, we propose to form the WashU-UCSC-EBI Human Genome Reference
Center. Starting with the genome assemblies generated by the data production center, we will create a high
quality map of sequence alignments and variants, and use the genome graph methods that we have
pioneered to build a pan-genome resource that naturally represents genetic diversity. We will annotate the
pan-genome for genes and other elements, and share this resource broadly and openly for public use.
Working with the community, we will foster a new ecosystem of genome analysis tools that work with this
new reference. We will maintain and gradually improve the reference by soliciting user feedback and
establishing scalable bioinformatic methods and targeted sequenced protocols for resolving errors and
improving specific genomic regions. We further propose to form a logistical coordination center that efficiently
organizes communication and collaborative activities at the level of the entire consortium, ensuring that all
program components are working hand-in-hand. Finally, and perhaps most importantly from the standpoint
of user adoption, we have devised an integrated pan-genome transition plan that involves broad community
engagement via outreach and education at the level of tool developers and end users. Taken together, these
efforts will create a new human genome reference, software ecosystem, and expert user base to support the
next generation of human genetics and clinical practice.

## Key facts

- **NIH application ID:** 10020425
- **Project number:** 5U41HG010972-02
- **Recipient organization:** WASHINGTON UNIVERSITY
- **Principal Investigator:** Paul Flicek
- **Activity code:** U41 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2020
- **Award amount:** $2,515,410
- **Award type:** 5
- **Project period:** 2019-09-18 → 2024-07-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10020425

## Citation

> US National Institutes of Health, RePORTER application 10020425, The WashU-UCSC-EBI Human Genome Reference Center (5U41HG010972-02). Retrieved via AI Analytics 2026-05-24 from https://api.ai-analytics.org/grant/nih/10020425. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
