Center for Quantitative Biology: A focus on "omics", from organisms to single cells

NIH RePORTER · NIH · P20 · $267,626 · view on reporter.nih.gov ↗

Abstract

Recent advances in single cell analysis have enabled the genome-wide measurement of DNA sequence, RNA expression, methylation and protein abundance for thousands to hundreds-of-thousands of individual cells. Methods that capture or infer spatial or temporal information provide additional contextual information to create a detailed, cell-level picture of gene activity and function. Although genome-wide single cell profiling provides unprecedented insights into the biology of complex tissues, analyzing such data on a gene-by- gene basis is extraordinarily challenging due to the large number of tested hypotheses and consequent low statistical power and difficult interpretation. While a gene-level analysis presents similar problems for bulk tissue data, the issues are magnified for single cell date due to increased noise, inflated zero counts and multi-modal distributions. One promising approach or addressing these challenges is gene set testing, or pathway analysis. Originally developed for bulk tissue data, gene set testing is a hypothesis aggregation method that leverages prior knowledge regarding the functional relationships between genes to test a smaller number hypotheses and thereby improve interpretation, replication and statistical power. By combining the single cell measurements for all genes in a set or pathway, gene set testing can also decrease variance and mitigate the impact of scarcity and multi-model distributions. Unfortunately, statistical and biological differences between single cell and bulk genomic data make it testing methods developed for bulk tissue on single cell data. Despite the motivation for customized techniques, little work has been done to develop gene set testing methods or collections that are specialized for single cell data and , for some important applications like the analysis of dynamic processes, relevant methods do not yet exist. We will address the limitations of current support for gene set analysis of single celldata by developing a suite of gene set testing methods optimized for the characteristics of single cellgene expression data. To validate these methods, we will us with the development and function of tissue-resident memory CD8 T cells.

Key facts

NIH application ID
10212421
Project number
5P20GM130454-03
Recipient
DARTMOUTH COLLEGE
Principal Investigator
Hildreth Robert Frost
Activity code
P20
Funding institute
NIH
Fiscal year
2021
Award amount
$267,626
Award type
5
Project period
2019-08-01 → 2024-06-30