# Statistical Methods for MicroRNA-Seq Experiments

> **NIH NIH R01** · UNIVERSITY OF ROCHESTER · 2022 · $392,313

## Abstract

Project Summary/Abstract
MicroRNAs (miRNAs) are a class of small (18-24 nucleotide) RNAs that are essential regulators of gene expression,
which act within the RNA-induced silencing complex (RISC) to bind mRNAs and suppress translation. Alterations
in miRNA expression have been shown to disrupt entire cellular pathways, substantially contributing to a variety
of human diseases. Despite nearly 25 years of research, miRNAs remain dicult to measure due to their short
length, relatively small number, sequence similarity, and diculty to isolate from other small RNA fragments.
While qPCR- and microarray-based miRNA assays are still widely used, the majority of recent studies use small
RNA-seq (sRNA-seq) because it allows for the quanti cation of isomiRs (miRNA isoforms) and the possibility of
identifying novel miRNAs. The processing of reads generated from sRNA-seq data globally distinguish between
miRNA reads and those from other small RNAs, but do not necessarily capture the full spectrum of miRNA
variation. Subsequent statistical analyses of processed sRNA-seq data are still performed using methods developed
for mRNA-seq data despite the fact that sRNA-seq data violate several of the assumptions of these methods.
Speci cally, methods for mRNA-seq data assume approximate independence between feature counts; however,
the small total number of miRNAs and presence of a small number of very highly expressed miRNAs result in a
lack of independence between miRNA counts. Additionally, normalization methods for mRNA-seq data assume
either the overall level of transcription is constant across samples or an equal number of features are over- and
under-expressed when comparing any two samples, neither of which hold for sRNA-seq data. The development of
statistical methods that address the challenges of sRNA-seq data represents a critical need for miRNA research.
 Our long-term goal is to advance miRNA research by developing statistical methods that are tailored to
the speci c complexities of miRNA expression data. The overall objective of this application is to improve the
analysis of sRNA-seq data by developing statistical methods that account for challenges speci c to sRNA-seq data
and outperform methods designed for mRNA-seq data. This addresses an urgent need for statistical methods
to appropriately analyze sRNA-seq data, which are now routinely generated by large consortia such as TCGA
and FANTOM. The rationale that underlies the proposed research is that methods that explicitly address the
challenges inherent in measuring miRNAs are necessary to fully elucidate the role miRNAs play in many human
disease processes.

## Key facts

- **NIH application ID:** 10488660
- **Project number:** 5R01GM139928-03
- **Recipient organization:** UNIVERSITY OF ROCHESTER
- **Principal Investigator:** MATTHEW Nicholson MCCALL
- **Activity code:** R01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2022
- **Award amount:** $392,313
- **Award type:** 5
- **Project period:** 2020-09-11 → 2025-06-30

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10488660

## Citation

> US National Institutes of Health, RePORTER application 10488660, Statistical Methods for MicroRNA-Seq Experiments (5R01GM139928-03). Retrieved via AI Analytics 2026-05-23 from https://api.ai-analytics.org/grant/nih/10488660. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
