Harnessing Machine Learning & Object Detection for Automated Evaluation of Student-folded Protein Models

NIH RePORTER · NIH · R43 · $295,921 · view on reporter.nih.gov ↗

Abstract

Project Summary Harnessing Machine Learning & Object Detection for Automated Evaluation of Student-folded Protein Models This project will impact public health through improved bioscience education. Mini-toobers are one of 3D Molecular Designs’ (3DMD) most innovative, popular, and effective modeling tools. This free-form modeling material (foam- covered wire) typically represents the backbone of a folded protein, allowing students to model actual protein structures. The problem is that mini-toobers lack an immediate feedback system. As a free-form modeling tool, students can use mini-toobers to model biological structures correctly just as easily as incorrectly (right vs. left-handed helices). 3DMD’s other modeling tools have feedback features like magnets, arrows, post/hole connections, etc. that help students self-correct. Educators observe student’s use of models to see their understanding of a concept; this observation is a type of formative assessment. However, there is often too much to observe in a mini-toober model (shape, sidechains, secondary structures, etc.) for an educator to assess all the data in real-time effectively. Our solution is to apply machine learning and object detection to create digital applications that deliver immediate feedback and precise assessment of student-folded mini-toober protein models. Two applications will be created: 1) Student Training App. The app's augmented reality features will give students immediate feedback on their modeling of alpha helices, beta sheets, and other protein motifs to help them learn protein structure and function. 2) Assessment App. This password-protected app will assess the model’s accuracy, generate a numerical score, and identify inaccurate regions of the protein. This Phase I proposal focuses on two specific aims necessary to establish the feasibility of achieving this goal: 1. Develop AI technology to locate student-folded zinc finger protein models with an 80% average precision rating. 2. Create a data gathering plan to collect the 3D coordinates of student-folded protein models. We will focus our research and development efforts on feasibility testing machine learning frameworks, including TensorFlow, PyTorch, and Unity Barracuda; on computer vision libraries such as OpenCV; and on object detection and conversion systems, including YOLO and ONNX to achieve these specific aims. One of the most powerful aspects of object detection is the ability to identify various shapes and forms. This particularly applies to 3DMD’s Mini-Toober protein modeling materials, as they can be folded into various structures depending on how a student interacts with the material. We will collect student-folded zinc finger models from outreach programs run by other 3DMD staff and partnering organizations. To test different approaches, we will scan the model with the developed app while recording video for documentation. The team will continue to develop and refine the application until at least an...

Key facts

NIH application ID: 10918783
Project number: 1R43GM154526-01
Recipient: 3D MOLECULAR DESIGNS
Principal Investigator: Mark Andrew Hoelzer
Activity code: R43
Funding institute: NIH
Fiscal year: 2024
Award amount: $295,921
Award type: 1
Project period: 2024-08-01 → 2026-01-31