# TOPIC NUMBER:  464 SBIR PHASE ICLOUD-BASED MULTIMODAL DATA ANALYSIS SOFTWARE FOR THE CANCER RESEARCH DATA COMMONS

> **NIH NIH N43** · POLYGON HEALTH ANALYTICS LLC · 2024 · $399,985

## Abstract

Clinical Data (MAGIC) within the Cancer Research Data Commons (CRDC). It leverages extensive expertise in Natural Language Processing (NLP) for clinical narrative analysis and deep knowledge of ontology, common data models, and standards for the seamless integration of imaging and multi-omics data, to unlock the full potential of petabyte heterogeneous cancer data. In this Phase I SBIR project, the specific aims are 1) applying innovative NLP capabilities to extract essential pathology information from pathology report PDF scans in the TCGA repository; 2) constructing a cloud-based platform prototype that seamlessly integrates with CRDC, facilitating the access and integrative analysis of genomics, imaging, and clinical narrative data; 3) demonstrating the platform's utility by revealing insights related to precision diagnosis, risk stratification, and efficacious treatments for squamous cell carcinoma. In addition, the organization will define the design specifications for Phase II development and outline strategies for anticipated commercialization challenges. The ultimate goal of Polygon Health Analytics LLC is to create an all inclusive and user friendly tool for the integration and analysis of multimodal data, bridging the gap between cancer research and clinical practice.

## Key facts

- **NIH application ID:** 11192437
- **Project number:** 75N91024C00070-0-9999-1
- **Recipient organization:** POLYGON HEALTH ANALYTICS LLC
- **Principal Investigator:** LIXIA YAO
- **Activity code:** N43 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2024
- **Award amount:** $399,985
- **Award type:** —
- **Project period:** 2024-09-12 → 2025-09-11

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/11192437

## Citation

> US National Institutes of Health, RePORTER application 11192437, TOPIC NUMBER:  464 SBIR PHASE ICLOUD-BASED MULTIMODAL DATA ANALYSIS SOFTWARE FOR THE CANCER RESEARCH DATA COMMONS (75N91024C00070-0-9999-1). Retrieved via AI Analytics 2026-05-26 from https://api.ai-analytics.org/grant/nih/11192437. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
