A kinetic framework to map the genetic determinants of alternative RNA isoform expression

NIH RePORTER · NIH · R01 · $740,345 · view on reporter.nih.gov ↗

Abstract

R01: A kinetic framework to map the genetic determinants of alternative RNA isoform expression Project Summary At the core of the central dogma is the transcription of RNA, which enables instructions from DNA to be translated into protein messages. The biogenesis of mRNA is a complex and highly regulated process, re- quiring coordination between transcriptional and RNA processing machinery that each comprise hundreds of regulatory RNAs and proteins. These interactions often result in many possible alternative isoforms ex- pressed from a single gene. While we can now extensively catalog the abundance and variability of mRNA isoforms across cellular states, we are still limited in our abilities to predict coordination between the mech- anistic steps that give rise to cellular diversity. Most catalogs of mRNA levels only profile steady-state mRNA and are blind to the spatiotemporal dynamics regulating isoform choice and expression. However, the trajectory and fate of an mRNA molecule likely depends on the efficiency of kinetic interactions at each step of mRNA biogenesis. Thus, we must directly track mRNA fluxes on timescales commensurate with the regulatory decisions being made to estimate the rates of each step of mRNA biogenesis. While high-throughput approaches to quantify steady-state RNA levels have matured in the past decade, it is still challenging to quantify nascent RNA and infer the rates of core mRNA biogenesis steps including transcription initiation, elongation, splicing, and 3’ end cleavage. Both technical and mechanistic features confound direct kinetic measurements from nascent RNA sequencing data. Existing approaches to quan- tify rates of these steps have found extensive variability in these rates across genes and cell types, but have been limited in their ability to estimate rates of alternative processing decisions, identify kinetic coordina- tion between splicing or 3’ end cleavage events, and map genetic elements that underlie kinetic variability. We propose to address these challenges by developing a joint experimental and statistical framework to characterize the molecular and genetic factors affecting RNA processing rates. First, we will build a statistical model to estimate RNA processing rates at individual sites, and generate time-resolved short-read nascent RNA-sequencing data to train this model. Second, we will characterize ki- netic competition between individual events that underlies alternative isoform expression using a constrained state space model and generate long-read RNA sequencing data to identify expressed isoform trajectories. Third, we will estimate RNA processing rates in a population of genotyped human cells to identify quan- titative trait loci (QTL) associated with variability in RNA processing rates. For all three aims, we identify rigorous validation metrics and calibrate the uncertainty in our rate predictions. Together, our work will lead to i) a better understanding of how molecular coordinatio...

Key facts

NIH application ID
10896926
Project number
5R01HG012967-02
Recipient
UNIV OF MASSACHUSETTS MED SCH WORCESTER
Principal Investigator
Barbara Engelhardt
Activity code
R01
Funding institute
NIH
Fiscal year
2024
Award amount
$740,345
Award type
5
Project period
2023-08-01 → 2027-05-31