III: Small: Improving de novo genome assembly using single-copy k-mers

NSF Award Search · 01002627DB NSF RESEARCH & RELATED ACTIVIT · $600,000 · view on nsf.gov ↗

Abstract

A genome is the complete set of deoxyribonucleic acid (DNA) in an organism and contains all of the instructions required for that organism to function. Obtaining access to the entire genomic sequence of a species, which is called genome sequencing, can enable life scientists and medical researchers to understand the causes of diseases, develop personalized medicine, identify genetic variations within populations, study evolution, and develop new treatments or cures for diseases. Due to technological limitations of DNA sequencing instruments, scientists can only read a limited amount of DNA and not entire chromosomes. As a consequence, scientists have to break the genome into fragments before reading each fragment using a sequencing instrument. Once fragments are sequenced, they need to be assembled together like a puzzle that contains billion of pieces. This problem is called genome assembly, which is the focus of this research project. Due to complexity of solving this problem, the organisms sequenced so far represent a minuscule proportion of those that inhabit our planet, many of which we depend on for our food or against which we must defend to sustain human society. The long-term impact of obtaining the genome of a new species can be profound and impactful once the sequence becomes a routine component of practical problem solving and downstream analyses for the scientific community. This project focuses on developing new methods that would allow a much more accurate and

Key facts

NSF award ID
2444456
Awardee
University of California-Riverside (CA)
SAM.gov UEI
MR5QC5FCAVH5
PI
Stefano Lonardi
Primary program
01002627DB NSF RESEARCH & RELATED ACTIVIT
All programs
INFO INTEGRATION & INFORMATICS, SMALL PROJECT
Estimated total
$600,000
Funds obligated
$600,000
Transaction type
Continuing Grant
Period
07/01/2025 → 06/30/2028