Curation

NIH RePORTER · NIH · P41 · $1,054,104 · view on reporter.nih.gov ↗

Abstract

PROJECT SUMMARY/ABSTRACT - CURATION COMPONENT The utility of the Gene Expression Database for Mouse Development (GXD) is proportional to its data content, the quality of data annotations, and the level of data integration that we provide. To foster GXD's mission to facilitate insights into molecular mechanisms of development, differentiation and disease, the Curation Component will continue and expand GXD's curation of expression data and anatomical ontologies and ensure high-quality data annotation and integration. We will (1) continue the curation of classical types of expression data. We will collect and integrate expression data from RNA in situ hybridization, in situ reporter knock-in, immunohistochemistry, RT-PCR, Northern blot and Western blot experiments. Data will be acquired from the literature, via electronic data submissions from laboratories, and by collaborations with projects that generate these data at a large scale. We will ensure high-quality data annotation and integration through our editorial processes and quality controls. We will (2) expand our data curation effort to High-Throughput Sequencing (HTS) expression data. Using the detailed controlled vocabularies and ontologies employed in GXD, we will annotate HTS expression data sets from GEO and ArrayExpress by attributes of the samples used, thereby generating a searchable index that will help researchers to effectively find data sets of interest. Our focus will be on data sets that report on endogenous gene expression in wild-type and mutant mice. We will identify selected high-quality HTS expression data sets for inclusion in GXD. We will (3) continue the curation of anatomical ontologies and incorporate new anatomical concepts and relationships. We will continue to enhance and refine our anatomical ontologies; add develops-from relationships between anatomical structures of successive developmental stages to enable the analysis of differentiation pathways; and integrate expression, biological process, and phenotypic data by referencing common anatomical concepts, thus enabling a direct anatomy- based comparison of expression, phenotype, and disease data. Further, we will establish cross-references to anatomy ontologies from other species to facilitate the comparative analysis of expression patterns.

Key facts

NIH application ID
9922138
Project number
5P41HD062499-10
Recipient
JACKSON LABORATORY
Principal Investigator
MARTIN RINGWALD
Activity code
P41
Funding institute
NIH
Fiscal year
2020
Award amount
$1,054,104
Award type
5
Project period
— → —