TOPIC NUMBER: 464 SBIR PHASE ICLOUD-BASED MULTIMODAL DATA ANALYSIS SOFTWARE FOR THE CANCER RESEARCH DATA COMMONS

NIH RePORTER · NIH · N43 · $399,985 · view on reporter.nih.gov ↗

Abstract

Clinical Data (MAGIC) within the Cancer Research Data Commons (CRDC). It leverages extensive expertise in Natural Language Processing (NLP) for clinical narrative analysis and deep knowledge of ontology, common data models, and standards for the seamless integration of imaging and multi-omics data, to unlock the full potential of petabyte heterogeneous cancer data. In this Phase I SBIR project, the specific aims are 1) applying innovative NLP capabilities to extract essential pathology information from pathology report PDF scans in the TCGA repository; 2) constructing a cloud-based platform prototype that seamlessly integrates with CRDC, facilitating the access and integrative analysis of genomics, imaging, and clinical narrative data; 3) demonstrating the platform's utility by revealing insights related to precision diagnosis, risk stratification, and efficacious treatments for squamous cell carcinoma. In addition, the organization will define the design specifications for Phase II development and outline strategies for anticipated commercialization challenges. The ultimate goal of Polygon Health Analytics LLC is to create an all inclusive and user friendly tool for the integration and analysis of multimodal data, bridging the gap between cancer research and clinical practice.

Key facts

NIH application ID
11192437
Project number
75N91024C00070-0-9999-1
Recipient
POLYGON HEALTH ANALYTICS LLC
Principal Investigator
LIXIA YAO
Activity code
N43
Funding institute
NIH
Fiscal year
2024
Award amount
$399,985
Award type
Project period
2024-09-12 → 2025-09-11