# An ENCODE ChIP-seq pipeline using endogenously tagged human DNA-associated proteins

> **NIH NIH UM1** · HUDSON-ALPHA INSTITUTE FOR BIOTECHNOLOGY · 2021 · $943,115

## Abstract

Project Summary/Abstract
Almost a tenth of human genes code for proteins that interact with chromosomes in the nucleus. Most of these
DNA-associated proteins (referred to as DAPs) are involved in regulating gene expression, by serving as part
of the basic transcriptional machinery, as transcription factors that regulate the spatial and temporal levels of
transcription, or as chromatin state regulators. These proteins are key components in biology, as
transcriptional regulation underlies fundamental biological processes in organismal development, in
determining cell states during differentiation, and in directing physiological responses to the internal and
external environment. Thus, comprehensive and detailed assessment of the molecular actions of DAPs, that is,
where they interact throughout the human genome, is a fundamental long-term goal of both basic and clinical
research. In response to RFA-HG-16-002, “Expanding the Encyclopedia of DNA Elements (ENCODE) in the
Human and Mouse (UM1)”, this application proposes to use a recently-established "shovel ready" pipeline for
mapping DAPs in human cell lines that overcomes the very high failure rates of traditional ChIP-seq, a widely-
used approach that requires specific antibodies for each factor. The new approach, called CETCh-seq,
involves adding an epitope tag at the endogenous locus encoding each protein, and using chromatin
immunoprecipitation with a universal antibody against the epitope followed by high-throughput sequencing
(ChIP-seq) to identify DAP-DNA associations genome-wide. This production pipeline will be applied to each of
1,244 DAPs that are expressed in a set of human cell lines and have not yet been mapped by ENCODE.
During the four-year project, this pipeline will be used to test each of these factors in one human cell line, and
for 100 of the DAPs, in four human cell lines, allowing characterization of cell-type differences. The project will
also tag and assay multiple allelic versions of a small number of DAPs in which pathogenic or potentially
pathogenic mutations have been identified. The project will produce genome-wide DAP maps and identify
motifs for hundreds of human regulatory proteins, providing an important component for the next phase of the
ENCODE Project. All data, as well as useful materials in the form of gene editing plasmids and tagged human
cell lines, will be made freely available to the research community.

## Key facts

- **NIH application ID:** 10240992
- **Project number:** 3UM1HG009411-04S1
- **Recipient organization:** HUDSON-ALPHA INSTITUTE FOR BIOTECHNOLOGY
- **Principal Investigator:** Eric Mendenhall
- **Activity code:** UM1 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2021
- **Award amount:** $943,115
- **Award type:** 3
- **Project period:** 2017-02-01 → 2023-01-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10240992

## Citation

> US National Institutes of Health, RePORTER application 10240992, An ENCODE ChIP-seq pipeline using endogenously tagged human DNA-associated proteins (3UM1HG009411-04S1). Retrieved via AI Analytics 2026-05-22 from https://api.ai-analytics.org/grant/nih/10240992. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
