PROJECT SUMMARY – CORE B: DATA MANAGEMENT The Data Management Core will create, operate, and maintain state-of-the-art computer systems to store and safeguard the diverse types of data generated by our Center, and develop protocols and interfaces to share those data between the institutions in our Center as well as with the ReVAMPP Coordination and Data Sharing Center (CDSC). We will collate and collect information from all institutions in the Center, including in silico computational results, high-throughput sequencing data, and experimental data. Our main in silico data are design models of new proteins, comprising models of the designed protein structures, their amino acid sequences, and associated scores and metrics. Experimental data include negative-stain and cryo electron microscopy images; x-ray diffraction images; atomic coordinates of determined protein structures; deep-sequencing reads and their interpretation; and various wet lab assay results in spreadsheets-like formats. We will follow the industry-standard and NIH-mandated security practices to manage access to the systems we operate and all data backups, and the NIH FAIR data sharing principles to annotate newly generated data and make it available to others. We will distribute our computational methods as free open-source software on GitHub to make our analyses reproducible and accelerate biological research worldwide. Furthermore, we will expand and maintain our on-site high-performance computing resources to add storage capacity for the data we generate and computational capacity for data processing. In addition, we will manage user access to those resources and provide training on best-use practices of the resources operated by this Core.