# Illuminating the Druggable Genome Data Coordinating Center - Engagement Plan with the CFDE

> **NIH NIH OT2** · UNIVERSITY OF NEW MEXICO HEALTH SCIS CTR · 2020 · $447,013

## Abstract

The Illuminating the Druggable Genome (IDG) consortium has two major goals: First, consolidate disparate
protein- and disease-centric data types from multiple sources, integrate and harmonize them, then make them
readily available to the public; Second, adapt and scale existing technologies to unveil the function of selected
understudied members of the G-protein coupled receptor, ion channel and protein kinase families. Within the
IDG, the Knowledge Management Center (IDG-KMC) integrates data from a wide range of chemical, biological
and clinical resources, and has developed platforms that can be used to navigate understudied proteins (the
“dark genome”), and their potential contribution to specific pathologies. Specifically, the IDG KMC is creating
automated workflows to capture relevant public data for the entire proteome including manual annotations for
the IDG list, covering five major areas: genotype, phenotype, expression, structure & function, and interactions
& pathways. The IDG KMC designs, develops, implements, and updates the Target Central Repository Database
(TCRD), a protein knowledgebase. The IDG KMC also expands, improves, and maintains Pharos, the TCRD
portal, with support for automated data summaries, and active community feedback. Both TCRD and Pharos
already integrate data from three Common Fund projects: GTEx, IMPC/KOMP and LINCS. The IDG KMC
consolidates all the data generated by the Data and Resource Generation Centers (DRGCs), improving these
data findability, accessibility, interoperability, reusability (FAIRness) and serving these data on the Pharos portal.
The IDG program interface with the CFDE will enable hypothesis generation about novel drug targets for complex
diseases. Many other Common Fund (CF) programs produce data about genetic variants and differentially
expressed genes and proteins in the context of many complex human diseases. These genes in many cases do
not have much information about them. For example, the CF program Undiagnosed Disease Network (UDN)
identifies mutations in genes associated with undiagnosed diseases. The IDG-KMC has information from
empirical evidence and from computational predictions about the function of these genes, which are commonly
under-studied. Hence, data from the IDG-KMC can enrich the CFDE users who examine datasets that list genes
and proteins. Several IDG resources provide gene landing pages that provide unique information about genes.
These landing pages can be improved regarding FAIRness and can become a resource for the CFDE. In
addition, data collected by the DRGCs and by the R03 IDG awardees can enrich the content of the CFDE portal. In
particular, results from the R03 projects (Fig. 1) are currently not evaluated or stored in one place and are at risk
of becoming lost. The CFDE engagement will ensure that data from this investment remains available long term.

## Key facts

- **NIH application ID:** 10217890
- **Project number:** 1OT2OD030546-01
- **Recipient organization:** UNIVERSITY OF NEW MEXICO HEALTH SCIS CTR
- **Principal Investigator:** Christophe G. Lambert
- **Activity code:** OT2 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2020
- **Award amount:** $447,013
- **Award type:** 1
- **Project period:** 2020-09-23 → 2025-09-22

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10217890

## Citation

> US National Institutes of Health, RePORTER application 10217890, Illuminating the Druggable Genome Data Coordinating Center - Engagement Plan with the CFDE (1OT2OD030546-01). Retrieved via AI Analytics 2026-05-24 from https://api.ai-analytics.org/grant/nih/10217890. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
