# Deciphering the Sequence Code for Compositional Control of Phase Separated Biomolecular Condensates

> **NIH NIH F32** · UT SOUTHWESTERN MEDICAL CENTER · 2020 · $64,926

## Abstract

Project Summary/Abstract
Biomolecular condensates formed via liquid-liquid phase separation (LLPS) are important organizers of
cellular biochemistry. Protein LLPS is often mediated by contacts between intrinsically disordered proteins or
regions (IDPRs) within larger proteins, frequently in cooperation with multivalent folded interaction domains.
The space of IDPR sequences that support LLPS has been only sparsely sampled, owing to the low-
throughput techniques currently used. Thus, a general framework describing the connections between IDPR
primary sequence, conformational ensemble, and LLPS remains elusive. By developing and applying novel
high-throughput methods, the work I propose aims to break the bottlenecks of classical biochemistry. I will
design a ~10,000-sequence DNA library encoding GFP-tagged IDPRs, varying amino acid composition and
residue patterning in the primary sequence. Library sequences will be constructed by high-throughput gene
synthesis and sequences will be individually expressed microfluidic droplets using in vitro
transcription/translation. I will then use fluorescence-activated droplet sorting to select droplets bearing
sequences that support LLPS, which will be identified by high-throughput sequencing of DNA from sorted
droplets. Computational simulations and biochemical/biophysical characterization will demonstrate how
primary sequence and conformation ensemble are connected in IDPR phase separation. Further applying this
approach, I will identify IDPR features that modulate LLPS by multivalent folded domains. Finally, I will identify
IDPR features that govern rates of condensate solidification, a process that is accelerated in disease-
associated mutants of condensate-forming IDPRs. The proposed work will provide a set of rules governing
phase separation by IDPRs, backed up by an empirical dataset of unprecedented scale. This work will be
carried out in the lab of Dr. Michael Rosen, a pioneer in the field of biomolecular condensates, at the
University of Texas Southwestern Medical Center, a leading biomedical research institute with excellent
facilities and investigators in a wide array of fields. As part of this research training plan, I will be take on a
robust series of career development and training activities, including coursework in the responsible conduct
of research, grant writing, laboratory management, and educational techniques. I will also have numerous
opportunities to attend seminars and conferences, where I will be able to present my work in poster sessions
and talks, and opportunities to publish my work in peer-reviewed journals. I also plan to develop a teaching
portfolio and gain mentorship experience by training graduate students, rotation students, and visiting
undergraduates. In summary, the research training plan proposed here will provide me with the skills,
knowledge, and experience necessary to fulfill my long-term goal of becoming an independent investigator.

## Key facts

- **NIH application ID:** 9909965
- **Project number:** 1F32GM136058-01
- **Recipient organization:** UT SOUTHWESTERN MEDICAL CENTER
- **Principal Investigator:** Andrew S Lyon
- **Activity code:** F32 (R01, R21, SBIR, etc.)
- **Funding institute:** NIH
- **Fiscal year:** 2020
- **Award amount:** $64,926
- **Award type:** 1
- **Project period:** 2020-02-01 → 2022-01-31

## Primary source

NIH RePORTER: https://reporter.nih.gov/project-details/9909965

## Citation

> US National Institutes of Health, RePORTER application 9909965, Deciphering the Sequence Code for Compositional Control of Phase Separated Biomolecular Condensates (1F32GM136058-01). Retrieved via AI Analytics 2026-05-23 from https://api.ai-analytics.org/grant/nih/9909965. Licensed CC0.

---

*[NIH grants dataset](/datasets/nih-grants) · CC0 1.0*
