Data Science Core

NIH RePORTER · NIH · U19 · $291,433 · view on reporter.nih.gov ↗

Abstract

Project Summary: Data Science Core The main purposes of the Data Science Resource Core are to collect, manage and distribute experimental data, to visualize the analyzed results such as structure and activity, and to provide an interface for users to use our designed methods on both imaging data and omics data. This digital infrastructure will be implemented in the form of a multi-tier services-oriented architecture. The products of the other Research Projects will be collected via protocols that are amenable to a digital representation. These will then be unified via spatial registration in a common coordinate system. Ensuring that the data have appropriate landmarks and that measurements of the spatial extent of key images is a core operation for this endeavor. With data whose content is appropriate for unification, it will subsequently be determined which digital format representation is most suitable for each data type, with the goal of making them easy to present in an online interface for the consumption of the broader scientific community. Data files with the appropriate format will be made available on the public internet. From here, a user- friendly web-based interface will be designed and implemented on top of a platform for astrocytic data visualization, that itself reuses best-in-class open-source visualization software for the web. The interface will ensure that users can easily navigate between individual data sets and also can visualize relationships between data sets across the domains from the other Research Projects. The interface will enable the user to explore imaging data of astrocytes collected at different positions of the brain, as well as the omics expression pattern at the corresponding area. The interface will also provide the visualization of the astrocyte network. To encourage broad sharing of the data resources produced, high quality metadata will be created to include with the data that conveys its provenance. Data products will be enabled to have their own digital object identifiers (DOIs), incorporate compatibility with ORCID ids, and embed RRIDs into the system as necessary. Core data products will further be shared through key federally supported data sharing resources and by applying best practices to data format and dissemination techniques used. The data analysis method will also be made available for users as web-based computational services for users to analyze their own data.

Key facts

NIH application ID
10294802
Project number
1U19NS123719-01
Recipient
SALK INSTITUTE FOR BIOLOGICAL STUDIES
Principal Investigator
Guoqiang Yu
Activity code
U19
Funding institute
NIH
Fiscal year
2021
Award amount
$291,433
Award type
1
Project period
2021-08-15 → 2026-07-31