Single Cell Transcriptomic and Genetic Diversity by Single Molecule Long Read Sequencing

NIH RePORTER · NIH · R01 · $545,292 · view on reporter.nih.gov ↗

Abstract

PROJECT SUMMARY Defining the features of cellular mixtures, where diverse cell types with distinct genomic characteristics are physically intermingled together, is a central problem in biology. During the past decade, single cell sequencing technologies have enabled a new era of high throughput and high resolution interrogation of cell type diversity, vastly expanding our understanding of the role that cell types play in development and disease. Yet, current studies in single cell genomics rely on short-read sequencing and thus suffer from limitations, including: (1) Most studies rely on short read counting which limits the study of alternative splicing. (2) Cell states are reflected by static snapshots, and while population dynamics can be deduced through trajectory and RNA velocity estimation, robust estimation of these parameters remains a major challenge. (3) Despite advances in single-cell DNA sequencing, there is yet no cost-effective way to simultaneously characterize both the genetic variants and transcriptome-level changes in a cell, which is crucial for diseases such as cancer. This proposal is motivated by technological breakthroughs in single-molecule sequencing (SMS) and the recent adaptation of SMS to the massively parallel sequencing of single cell transcriptomes in our lab. We propose to develop computational methods to harness the power of SMS in single cell transcriptomics. In particular, we have developed a new genomic approach which allows one to repeatedly interrogate complete transcripts from single cells using SMS long reads, rather than 3' or 5' counting with short reads. This technology allows experimental designs where specific transcript subsets and/or cellular subsets can be repeatedly targeted for deeper joint short and long read analysis over many iterations, which we will exploit to conduct analyses that were previously intractable.

Key facts

NIH application ID
10440475
Project number
5R01HG006137-12
Recipient
UNIVERSITY OF PENNSYLVANIA
Principal Investigator
Hanlee P Ji
Activity code
R01
Funding institute
NIH
Fiscal year
2022
Award amount
$545,292
Award type
5
Project period
2011-07-06 → 2025-06-30