Standard Standard

Component Parts of Bacteriophage Virions Accurately Defined by a Machine-Learning Approach Built on Evolutionary Features. / Thung, Tze Y; White, Murray E; Dai, Wei et al.
In: mSystems, Vol. 6, No. 3, e0024221, 29.06.2021.

Research output: Contribution to journalArticlepeer-review

HarvardHarvard

Thung, TY, White, ME, Dai, W, Wilksch, JJ, Bamert, RS, Rocker, A, Stubenrauch, CJ, Williams, D, Huang, C, Schittelhelm, R, Barr, JJ, Jameson, E, McGowan, S, Zhang, Y, Wang, J, Dunstan, RA & Lithgow, T 2021, 'Component Parts of Bacteriophage Virions Accurately Defined by a Machine-Learning Approach Built on Evolutionary Features', mSystems, vol. 6, no. 3, e0024221. https://doi.org/10.1128/mSystems.00242-21

APA

Thung, T. Y., White, M. E., Dai, W., Wilksch, J. J., Bamert, R. S., Rocker, A., Stubenrauch, C. J., Williams, D., Huang, C., Schittelhelm, R., Barr, J. J., Jameson, E., McGowan, S., Zhang, Y., Wang, J., Dunstan, R. A., & Lithgow, T. (2021). Component Parts of Bacteriophage Virions Accurately Defined by a Machine-Learning Approach Built on Evolutionary Features. mSystems, 6(3), Article e0024221. https://doi.org/10.1128/mSystems.00242-21

CBE

Thung TY, White ME, Dai W, Wilksch JJ, Bamert RS, Rocker A, Stubenrauch CJ, Williams D, Huang C, Schittelhelm R, et al. 2021. Component Parts of Bacteriophage Virions Accurately Defined by a Machine-Learning Approach Built on Evolutionary Features. mSystems. 6(3):Article e0024221. https://doi.org/10.1128/mSystems.00242-21

MLA

VancouverVancouver

Thung TY, White ME, Dai W, Wilksch JJ, Bamert RS, Rocker A et al. Component Parts of Bacteriophage Virions Accurately Defined by a Machine-Learning Approach Built on Evolutionary Features. mSystems. 2021 Jun 29;6(3):e0024221. Epub 2021 May 27. doi: 10.1128/mSystems.00242-21

Author

Thung, Tze Y ; White, Murray E ; Dai, Wei et al. / Component Parts of Bacteriophage Virions Accurately Defined by a Machine-Learning Approach Built on Evolutionary Features. In: mSystems. 2021 ; Vol. 6, No. 3.

RIS

TY - JOUR

T1 - Component Parts of Bacteriophage Virions Accurately Defined by a Machine-Learning Approach Built on Evolutionary Features

AU - Thung, Tze Y

AU - White, Murray E

AU - Dai, Wei

AU - Wilksch, Jonathan J

AU - Bamert, Rebecca S

AU - Rocker, Andrea

AU - Stubenrauch, Christopher J

AU - Williams, Daniel

AU - Huang, Cheng

AU - Schittelhelm, Ralf

AU - Barr, Jeremy J

AU - Jameson, Eleanor

AU - McGowan, Sheena

AU - Zhang, Yanju

AU - Wang, Jiawei

AU - Dunstan, Rhys A

AU - Lithgow, Trevor

PY - 2021/6/29

Y1 - 2021/6/29

N2 - Antimicrobial resistance (AMR) continues to evolve as a major threat to human health, and new strategies are required for the treatment of AMR infections. Bacteriophages (phages) that kill bacterial pathogens are being identified for use in phage therapies, with the intention to apply these bactericidal viruses directly into the infection sites in bespoke phage cocktails. Despite the great unsampled phage diversity for this purpose, an issue hampering the roll out of phage therapy is the poor quality annotation of many of the phage genomes, particularly for those from infrequently sampled environmental sources. We developed a computational tool called STEP3 to use the "evolutionary features" that can be recognized in genome sequences of diverse phages. These features, when integrated into an ensemble framework, achieved a stable and robust prediction performance when benchmarked against other prediction tools using phages from diverse sources. Validation of the prediction accuracy of STEP3 was conducted with high-resolution mass spectrometry analysis of two novel phages, isolated from a watercourse in the Southern Hemisphere. STEP3 provides a robust computational approach to distinguish specific and universal features in phages to improve the quality of phage cocktails and is available for use at http://step3.erc.monash.edu/. IMPORTANCE In response to the global problem of antimicrobial resistance, there are moves to use bacteriophages (phages) as therapeutic agents. Selecting which phages will be effective therapeutics relies on interpreting features contributing to shelf-life and applicability to diagnosed infections. However, the protein components of the phage virions that dictate these properties vary so much in sequence that best estimates suggest failure to recognize up to 90% of them. We have utilized this diversity in evolutionary features as an advantage, to apply machine learning for prediction accuracy for diverse components in phage virions. We benchmark this new tool showing the accurate recognition and evaluation of phage component parts using genome sequence data of phages from undersampled environments, where the richest diversity of phage still lies.

AB - Antimicrobial resistance (AMR) continues to evolve as a major threat to human health, and new strategies are required for the treatment of AMR infections. Bacteriophages (phages) that kill bacterial pathogens are being identified for use in phage therapies, with the intention to apply these bactericidal viruses directly into the infection sites in bespoke phage cocktails. Despite the great unsampled phage diversity for this purpose, an issue hampering the roll out of phage therapy is the poor quality annotation of many of the phage genomes, particularly for those from infrequently sampled environmental sources. We developed a computational tool called STEP3 to use the "evolutionary features" that can be recognized in genome sequences of diverse phages. These features, when integrated into an ensemble framework, achieved a stable and robust prediction performance when benchmarked against other prediction tools using phages from diverse sources. Validation of the prediction accuracy of STEP3 was conducted with high-resolution mass spectrometry analysis of two novel phages, isolated from a watercourse in the Southern Hemisphere. STEP3 provides a robust computational approach to distinguish specific and universal features in phages to improve the quality of phage cocktails and is available for use at http://step3.erc.monash.edu/. IMPORTANCE In response to the global problem of antimicrobial resistance, there are moves to use bacteriophages (phages) as therapeutic agents. Selecting which phages will be effective therapeutics relies on interpreting features contributing to shelf-life and applicability to diagnosed infections. However, the protein components of the phage virions that dictate these properties vary so much in sequence that best estimates suggest failure to recognize up to 90% of them. We have utilized this diversity in evolutionary features as an advantage, to apply machine learning for prediction accuracy for diverse components in phage virions. We benchmark this new tool showing the accurate recognition and evaluation of phage component parts using genome sequence data of phages from undersampled environments, where the richest diversity of phage still lies.

U2 - 10.1128/mSystems.00242-21

DO - 10.1128/mSystems.00242-21

M3 - Article

C2 - 34042467

VL - 6

JO - mSystems

JF - mSystems

SN - 2379-5077

IS - 3

M1 - e0024221

ER -