Automated, optimized, intelligent data collection for cryo-EM

NIH RePORTER · NIH · R01 · $655,046 · view on reporter.nih.gov ↗

Abstract

Project Summary Cryo-electron microscopy (cryo-EM) is now a widely established and indispensable method for determining the high-resolution structures of biomedically important molecules. Given that thousands of images, often acquired over the course of several days, are required to obtain such structures, automation software has played a critical role in the large-scale adoption of this method by the scientific community. In just the past five years, cryo-EM has revolutionized our understanding of entire biological systems, and in 2020 provided the first molecular descriptions of SARS-CoV-2 interaction with neutralizing antibodies. The widespread adoption of cryo-EM recently prompted the NIH to invest in three National Centers through the Transformative High Resolution Cryo- Electron Microscopy Program, providing free, high-end electron microscope access to biologists across the country. The exponential increase in the popularity of cryo-EM has led to an astonishing number of developments in sample preparation methodologies and image processing algorithms, which have improved attainable resolution of single particle reconstructions. However, comparatively little progress has been made in optimizing the quality of the cryo-EM data being collected. The pioneering software packages Leginon and Appion demonstrated the power of automated data acquisition and real-time processing (respectively), and there are now numerous programs for automated data acquisition and real-time processing. Despite advances in automation, optimally extracting the highest quality data from an EM sample still requires manual involvement of an expert electron microscopist. User intervention and expertise is necessary to run the appropriate image analyses, interpret the results, and make informed decisions on how the processed results relate to the ongoing data collection. However, even experts must content with the fact that the “best grid regions” differ drastically from sample to sample, and there are no established tools for automatically and quickly assessing the quality of the specimen across the various microenvironments of an EM grid. Given the ever-increasing incorporation of cryo-EM into labs’ research programs, it is imperative that data collection and processing be streamlined to match the growing needs of the structural community. We propose to develop a second generation Leginon/Appion software package, “Magellon”, to overcome existing bottlenecks and provide an avenue toward fully automated data acquisition that bypasses need for user input during data collection. Importantly, this software will support the computational infrastructure to enable real-time image processing results to inform on and modify the ongoing data collection regime by learning where to acquire images in regions that will yield the highest resolution structures. We will develop and incorporate new, fast image assessment routines, while also providing an application programming interface to...

Key facts

NIH application ID
10317907
Project number
1R01GM143805-01
Recipient
SCRIPPS RESEARCH INSTITUTE, THE
Principal Investigator
Gabriel C Lander
Activity code
R01
Funding institute
NIH
Fiscal year
2021
Award amount
$655,046
Award type
1
Project period
2021-09-22 → 2025-06-30