# Discovery of structural RNAs involved in human health and disease

> **NIH NIH R01** · HARVARD UNIVERSITY · 2024 · $348,575

## Abstract

Many fundamental cellular functions depend on a variety of RNA structures conserved through evolution, and
other functional RNA structures are expected to be discovered. A signature of a conserved RNA structure is
found in alignments where paired positions display correlated substitutions (covariation) that preserve the base
pair. This evolutionary signal can be used both to predict RNA structure and to identify new conserved RNAs.
 Recent publications and preliminary results have made three important advances: A statistical covariation
test that identiﬁes signiﬁcant covariation over background covariation due to phylogeny. This test, implemented
in a method called R-scape (RNA Structural Covariation Above Phylogenetic Expectation), provides information
and control over the rate of false positive predictions. A power of covariation calculation, recently published,
that identiﬁes “negative” pairs with power (variation) but insigniﬁcant covariation, unlikely to form RNA base pairs.
A new cascading folding algorithm, named CaCoFold (Cascade covariation/variation Constrained Folding)
also recently published, that combines all positive and negative evolutionary information into complex structures
including all types of pseudoknots and triplets. In human, the efﬁcacy of these advances has been tested by ac-
curately predicting the structures of the human non-coding RNAs MALAT1 and telomerase RNA, and by inferring
that the non-coding RNAs HOTAIR and XIST do not have a conserved structure.
 These three advances give us a competitive advantage to perform unbiased genome-wide screens for con-
served structural RNAs in vertebrates with accurate 3D structure prediction. Previous vertebrate screens
for structural RNAs have been hindered by thousand of false positive predictions. In contrast, our new covariation
statistical test allows for controlling the rate of false positives. R-scape has already been used to ﬁnd struc-
tural RNAs in bacteria and viruses. Our recent eukaryotic pilot screen in fungi has identiﬁed 17 novel structural
RNAs. We hypothesize that many structural RNAs with implications for human health and disease are still to be
discovered, and that we now have the tools to ﬁnd and characterize these RNAs.
 This proposal has three speciﬁc aims that will advance the study of structural RNA biology, and the discov-
ery of novel biological mechanisms involving RNA structures. The ﬁrst aim proposes systematic genome-wide
searches to ﬁnd novel conserved vertebrate RNA structures in human. The second aim proposes to combine
revolutionary 3D structure prediction methods in machine learning with the signals used by CaCoFold into a
state of the art RNA folding method for the accurate prediction of 3D RNA structures. The third aim introduces a
method to identify RNA structures in ultra conserved vertebrate UTRs where there is no covariation signal,
and our current method lacks power. We expect our work will unveil primate-speciﬁc novel regulatory mecha-
ni...

## Key facts

- **NIH application ID:** 10904917
- **Project number:** 5R01GM144423-03
- **Recipient organization:** HARVARD UNIVERSITY
- **Principal Investigator:** Elena Rivas
- **Activity code:** R01 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2024
- **Award amount:** $348,575
- **Award type:** 5
- **Project period:** 2022-09-15 → 2026-07-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/10904917

## Citation

> US National Institutes of Health, RePORTER application 10904917, Discovery of structural RNAs involved in human health and disease (5R01GM144423-03). Retrieved via AI Analytics 2026-05-24 from https://api.ai-analytics.org/grant/nih/10904917. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
