PROJECT SUMMARY COLLABORATIVE DATA SERVICES CORE The goal of the Collaborative Data Services Core (CDSC) is to facilitate research use of patient data, primarily by leveraging Moffitt Cancer Center’s Health & Research Informatics (HRI) platform, an enterprise-wide data warehouse that contains discrete data on more than 550,000 patients. CDSC is a unique Shared Resource that helps drive innovation across all CCSG Programs by serving as a gateway to Moffitt 's robust data assets, which span clinical, administrative, patient-reported, biospecimen, and molecular domains. CDSC supports members with four primary services: study design consultation, provisioning of patient-level data, study specific medical record abstraction, and training that enables Member self-service queries. Broadly, CDSC enables efficient and reproducible cohort identification, HRI data extraction, and manual chart abstraction for research purposes. The activities of the CDSC are organized into the following Specific Aims: Aim 1: To assist Members in developing research projects with consultations on study design and feasibility, including identification of patient cohorts based on detailed inclusion and exclusion criteria. Aim 2: To promote and facilitate cutting-edge translational research by providing Members with access to high- quality, discretely captured patient-level data from a variety of source systems, sometimes combined with manually abstracted data from patient medical records. Aim 3: To provide access to de-identified data by providing individual or small group training on self-service querying tools, data annotation, and querying best practices. Impact: CDSC is the centralized data concierge hub for Moffitt Members, connecting them to the entirety of Moffitt’s HRI platform. In FY20, CDSC provided support for 70 Members across all five CCSG Programs (CBE 13%, CE 26%, MM 30%, HOB 12 %, IO 19%), with 78% of those Members holding peer review funding. Moreover, CDSC supported 131 peer reviewed Members and 74 Member publications (12 high impact), underscoring that rapid, cost-effective access to high-quality clinical, biospecimen, molecular, and demographic data drives impactful cancer research. In summary, CDSC unlocks the research potential of unique data sources, which would otherwise be inaccessible to Members, and provides a facile conduit from complex data source systems to research-ready data products. Future CDSC priorities are to 1) scale increases in services for customized, project-specific data marts, including chart abstraction of complex EHR data and the linkage of medical images to clinical and molecular data; and 2) expand the size and scope of the Molecular Data Warehouse, to identify patient cohorts with specific genomic profiles in near real-time for scientific discovery and clinical trial matching at Moffitt and with our network of clinical partners across the catchment area and beyond.