DNA labeling for improved synthetic long reads

NIH RePORTER · NIH · R43 · $329,887 · view on reporter.nih.gov ↗

Abstract

Project Summary Long-read sequencing has the potential to greatly simplify sequencing, helping to accelerate scientists’ ability to perform de novo sequencing, haplotype phasing and transcriptomics. This project aims to develop a method to label DNA prior to next-generation sequencing, that maintains information about the proximity of fragments in the original strand, aiding in the downstream assembly of the sequencing data. To label the DNA, transposase will be loaded with specially-designed transposons containing barcode labels, and used to fragment the DNA, prior to next-generation sequencing. The first aim of this project is to construct the transposomes used to label the DNA. The second aim is to use these transposomes to tagment and sequence a model DNA system to demonstrate read lengths of ~50 kb using next-generation sequencing on less than a picogram of DNA. If successful, the sequencing approach developed in this grant will simplify synthetic long read sequencing, making high-accuracy and inexpensive long read sequencing more accessible to the genetics community.

Key facts

NIH application ID
10081920
Project number
1R43HG011217-01A1
Recipient
ESPER BIOSCIENCES, INC.
Principal Investigator
Jonathan Alden
Activity code
R43
Funding institute
NIH
Fiscal year
2020
Award amount
$329,887
Award type
1
Project period
2020-09-22 → 2023-08-31