# Extending Tools for Visualization of Geographic Structure in Population Genomic Data

> **NIH NIH R01** · UNIVERSITY OF CHICAGO · 2020 · $355,779

## Abstract

PROJECT SUMMARY
 A major challenge of contemporary research in genetics and genomics is the vast quantity of data. Visu-
alization tools and customized data portals help conquer this complexity and greatly aid researchers on the
path from data to knowledge. An important source of structure in genomic data is geography. Understanding
the geography of genetic variation is crucial for human genomics as well as for the study of other species that
are deeply relevant to human health. It is especially important in precision medicine, which aims to develop
effective treatments for individuals of all ancestries. Currently there is a well-documented bias in genome-wide
association studies (GWAS) towards European ancestry populations, though the relevance of this is unclear—
some studies ﬁnd that GWAS results are largely portable across populations, others suggest substantial errors
will arise in applying GWAS results across populations, and yet others leverage population variation via trans-
ethnic ﬁne-mapping. Given the broad importance of population structure, multiple computational tools have
been developed for revealing population structure, and some of them are among the most cited algorithms in
computational biology.
 Nevertheless, few existing computational genomic methods grapple explicitly with geography. Here, we
propose to develop and improve multiple tools that will empower researchers to visualize and interpret geo-
graphic patterns in genomic data. In the ﬁrst, we will build on our “Geography of Genetic Varaints” browser, a
web-based tool for accessing and displaying information on the geographic distribution of genetic variants in
humans. In the second, we will expand the functionality of our software titled EEMS (for Estimating Effective
Migration Surfaces), which provides a visualization tool that builds maps that reveal the genetic connectivity
among populations. In the third, we develop a new variant-centric view for displaying patterns of popula-
tion structure that has multiple applications. Overall, we expect to produce effective, important tools that will
illuminate the relationships between genetic ancestry and geography.
 Throughout the project we will pay special attention to building user-friendly software and interactive data
displays such as those generated by the Data Driven Documents (d3) JavaScript visualization libraries. We aim
to use simple, yet ﬂexible python backends and provide complementary R libraries to facilitate customization
and integration with existing analysis pipelines. Finally, while population genetic applications motivate our
work, the tools we are generating will be generally applicable to other forms of structured biomedical data.

## Key facts

- **NIH application ID:** 9904741
- **Project number:** 5R01GM132383-02
- **Recipient organization:** UNIVERSITY OF CHICAGO
- **Principal Investigator:** John Novembre
- **Activity code:** R01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2020
- **Award amount:** $355,779
- **Award type:** 5
- **Project period:** 2019-04-01 → 2023-03-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/9904741

## Citation

> US National Institutes of Health, RePORTER application 9904741, Extending Tools for Visualization of Geographic Structure in Population Genomic Data (5R01GM132383-02). Retrieved via AI Analytics 2026-05-23 from https://api.ai-analytics.org/grant/nih/9904741. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
