SEA-PHAGES Logo

The official website of the HHMI Science Education Alliance-Phage Hunters Advancing Genomics and Evolutionary Science program.

Abstract Summary

Below is a summary of the abstract you submitted. Presenting author(s) is shown in bold.

If any changes need to be made, you can modify the abstract or change the authors.

You can also download a .docx version of this abstract.

If there are any problems, please email Dan at dar78@pitt.edu and he'll take care of them!

This abstract was last modified on March 15, 2021 at 12:54 p.m..

Purdue University
Corresponding Faculty Member: Kari Clase, kclase@purdue.edu
This abstract will NOT be considered for a talk.
New Methods of Phage Protein Study: From Statistical Analysis to Simulations
Emily A Kerstiens, Kari Clase

Phages are being studied for their potential in a variety of applications from food safety to pest control to phage therapy. Every potential phage product must go through extensive testing to determine the safety of a phage product. This is especially true of phage therapy products which must go through the FDA clinical trials. Sending every phage through clinical trials, however, significantly cuts down the number of phages that can be used. A possible method of FDA regulation involves determining therapeutically equivalent phages based on their statistically significant proteins. Then a database of equivalent phages could be built without putting every individual phage through clinical trials. However, with ~70% NKF proteins, there is still too much unknown.

The current standards of phage genome annotation only identify roughly ~30% of protein functions. A new method of protein assignment is proposed here using different alignment programs, hidden Markov models, cellular location predictions, and GROMACS simulations. The phage PotatoSplit was used as a test case, and the NKF proteins dropped from 63% to 37%. Most new proteins are related to phage structure identified using TMHMM and HHBlits. After studying existing programs, a machine learning algorithm is being developed using Python. This algorithm will be trained on phage proteins from PhagesDB and is expected to predict functions or functional categories.