# Global proteomics mass spectrometry data sharing infrastructure

> **NIH NIH R24** · UNIVERSITY OF CALIFORNIA, SAN DIEGO · 2024 · $702,441

## Abstract

Global proteomics mass spectrometry data sharing infrastructure - Project Summary
Technological developments and the increased pace of data generation in mass spectrometry (MS) now enable
systematic probing of the human proteome, thus contributing to the characterization of biomolecular mechanisms
required for the development of therapeutic responses to disease. Recognizing the scientific necessity of open
data to enable new discoveries and to establish the reliability of published results, the proteomics community
embraced data sharing as a common practice. The authors of thousands of papers have already publicly re-
leased the underlying MS data using the resources that we propose to support and extend: the MassIVE repos-
itory of mass spectrometry data and the ProteomeCentral data portal for the global ProteomeXchange consor-
tium of MS data repositories. In this project, we propose to develop new proteomics MS data infrastructure,
standards, workflows and data indexes to substantially advance FAIR (Findable, Accessible, Interoperable and
Reusable) access to proteomics MS datasets. First, we will develop new community standards for representation
of dataset metadata and for the detailed description of proteomics identifications and abundances detected in
available datasets, including peptides, proteins, isoforms and post-translational modifications. We will also ex-
tend workflows for dataset submission, processing and indexing to enable advanced queries by each dataset’s
detected proteomics identifications. Second, we will create new infrastructure for researchers to share controlled
access proteomics datasets from studies of human subjects where there may exist a significant risk of the data
being identifiable. Third, we will extend the ProteomeCentral data portal to support the new dataset structures,
metadata and indexes allowing for the global integration of the new levels of information across all Proteo-
meXchange repositories.

## Key facts

- **NIH application ID:** 10844352
- **Project number:** 5R24GM148372-02
- **Recipient organization:** UNIVERSITY OF CALIFORNIA, SAN DIEGO
- **Principal Investigator:** Nuno Bandeira
- **Activity code:** R24 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2024
- **Award amount:** $702,441
- **Award type:** 5
- **Project period:** 2023-05-19 → 2028-04-30

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10844352

## Citation

> US National Institutes of Health, RePORTER application 10844352, Global proteomics mass spectrometry data sharing infrastructure (5R24GM148372-02). Retrieved via AI Analytics 2026-05-23 from https://api.ai-analytics.org/grant/nih/10844352. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
