# Nanopore Direct Single-Molecule Protein Sequencing

> **NIH NIH R01** · UNIVERSITY OF CALIFORNIA, SAN DIEGO · 2021 · $345,755

## Abstract

PROJECT SUMMARY/ABSTRACT:
 Proteins are the workhorses of cells and organisms. At present, no technologies are available for the
routine proteome-scale sequencing and quantification of this physiologically important class of molecules. In
this project, we propose to develop a nanopore technology for direct de novo single-molecule protein
sequencing. Analogous to nanopore DNA sequencing, the sequences of protein molecules are determined by
electronically measuring the alteration of the ionic current flow by the residues along the linear polypeptides
while the fully denatured molecules are electrophoretically translocated one at a time through the nanopores.
To realize the technology, we need to overcome three major challenges: (1) engineering of nanopores with
dimensions (~1 nm diameter and ~0.5 nm thickness) required to distinguish 20 amino acid residues (~0.38 Å
spacing) along the linear polypeptide chain; (2) a method for controlled unidirectional translocation of unfolded
polypeptides through the nanopores; (3) algorithms for decoding sequences from current blockage profiles.
Through several years of conceptual, theoretical and experimental work, we have found potential solutions to
these challenges. First, we have invented a new hybrid solid-state/protein/cyclopeptide nanopore architecture
that will enable the engineering of nanopores capable of distinguishing the 20 different amino acid residues for
de novo protein sequencing. Second, we have also demonstrated a strategy to impart uniform charge density
along the polypeptide chain to enable unidirectional translocation of protein through nanopores. Third, we have
developed a strategy that has enabled us to model and compute the current blockages of all 207 (=1.28x109)
heptamer combinations of 20 amino acids, and thus the current blockage profiles of any proteins. We have
also developed the algorithms to decode amino acid sequences from the computed current blockage profiles.
Excitingly, with these recent breakthroughs, we have been able to demonstrate the theoretical feasibility of de
novo nanopore protein sequencing. We have shown that 12 amino acid residues can be sequenced with >90%
consensus accuracy, 2 residues with >85% accuracy, and the other 6 residue decoded as 3 pairs with >90%
accuracy. In this project, we propose to implement these innovative approaches aiming to lay the foundation
for the experimental realization of nanopore protein sequencing. The ability to sequence and enumerate
proteins will enable routine proteome-scale identification and digital quantification of proteins with the ultimate
single-molecule and single-cell sensitivity. The ability to sequence proteins at the single-molecule level will
enable routine proteome-scale identification and digital quantification of proteins with the ultimate single-
molecule sensitivity. If successful, such a disruptive technology will find broad general applications from basic
research, drug target identification and precision cli...

## Key facts

- **NIH application ID:** 10139055
- **Project number:** 5R01GM126013-04
- **Recipient organization:** UNIVERSITY OF CALIFORNIA, SAN DIEGO
- **Principal Investigator:** XIAOHUA HUANG
- **Activity code:** R01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2021
- **Award amount:** $345,755
- **Award type:** 5
- **Project period:** 2018-08-01 → 2024-04-30

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10139055

## Citation

> US National Institutes of Health, RePORTER application 10139055, Nanopore Direct Single-Molecule Protein Sequencing (5R01GM126013-04). Retrieved via AI Analytics 2026-05-24 from https://api.ai-analytics.org/grant/nih/10139055. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
