# SciDAP: a next generation universal platform for collaborative data analysis

> **NIH NIH R41** · DATIRIUM, LLC · 2020 · $374,500

## Abstract

The recent proliferation of next-generation sequencing (NGS) - based methods for the analysis of gene expres-
sion, chromatin structure and protein-DNA interactions has created tremendous opportunities for gaining novel
insights into basic biology, health, and disease. However, analysis of the resulting data requires computational
expertise that many traditional biologists do not possess. Hence, when dealing with genomics data, majority of
biologists require the help of bioinformaticians even for simple tasks. This places these exciting methods beyond
the reach of the majority of life scientists.
 This proposal from DATIRIUM, LLC, a start-up company from Cincinnati, OH describes a plan to create
SciDAP (Scientific Data Analysis Platform), a novel multi-omics user-friendly data analysis platform to allow
biologists to analyze the data themselves and to enable collaboration between biologists and bioinformaticians.
Datirium was founded by this application’s PIs, Artem Barski, PhD and Andrey Kartashov, initially to assist users
with installation and support of BioWardrobe. BioWardrobe, a user-friendly open-source integrative genomics
analysis platform, was developed by the Barski laboratory at Cincinnati Children’s Hospital Medical Center
(CCHMC) in 2015. It has been used by more than 40 CCHMC laboratories to process more than 8000 experi-
ments and has been applied in more than 40 publications. In addition, Datirium has installed and continues to
maintain BioWardrobe servers at several external research centers.
 For Datirium, BioWardrobe served as a Minimum Viable Product (MVP) that allowed to confirm the need
and existence of market niche for such software, but also highlighted several design limitations. The key among
them was the difficulty in adding new or modifying existing pipelines: due to the tight coupling between pipeline
and user interface this required changes at all levels of software. Unfortunately, the same limitation exists for all
user-friendly bioinformatics tools. Given that there are more than 150 NGS-based methods and many ways to
process the data, this explains why a universal and user-friendly data analysis platform does not yet exist.
 We hypothesize that we can create a data analysis platform that is both universal and user-friendly by includ-
ing interface instructions into computational pipelines itself. Platform will use these instructions to create a
graphical interface for users. Specifically, we are using containerized pipelines developed using Common Work-
flow Language (CWL). CWL allows to describe tools, pipelines and computational environment making these
pipelines both portable and reproducible. On top of CWL, Datirium developed a system of CWL extensions that
allows to describe the inputs and outputs visualizations within the CWL workflows. Importantly, our platform
will increase the rigor of computational analysis by (i) making the analysis reproducible and auditable by bioin-
formaticians due to CWL pipeline ...

## Key facts

- **NIH application ID:** 10081764
- **Project number:** 1R41HG011219-01A1
- **Recipient organization:** DATIRIUM, LLC
- **Principal Investigator:** Artem Barski
- **Activity code:** R41 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2020
- **Award amount:** $374,500
- **Award type:** 1
- **Project period:** 2020-09-14 → 2022-05-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10081764

## Citation

> US National Institutes of Health, RePORTER application 10081764, SciDAP: a next generation universal platform for collaborative data analysis (1R41HG011219-01A1). Retrieved via AI Analytics 2026-05-24 from https://api.ai-analytics.org/grant/nih/10081764. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
