Machine Learning Techniques for Accurate Prediction of Proteins Function

Authors

  • Antonio Lorenza Dipartimento di Malattie Infettive, Parassitarie ed Immunomediate, Istituto Superiore di Sanità, Rome, Italy Author
  • Gabriella Caterina Dipartimento di Malattie Infettive, Parassitarie ed Immunomediate, Istituto Superiore di Sanità, Rome, Italy Author
  • Bianca Isabella Dipartimento di Malattie Infettive, Parassitarie ed Immunomediate, Istituto Superiore di Sanità, Rome, Italy Author

DOI:

https://doi.org/10.63995/HIXN4599

Keywords:

Bioinformatics; Deep Learning; Drug Discovery; Genomic Sequences; Machine Learning; Protein Function

Abstract

Machine learning techniques are revolutionizing the prediction of protein functions, offering unprecedented accuracy and efficiency in understanding biological processes. By leveraging large datasets of protein sequences and structures, machine learning models can identify patterns and relationships that are often elusive to traditional methods. Techniques such as deep learning, support vector machines, and random forests have shown remarkable success in predicting protein functions, including enzymatic activities, binding sites, and interaction networks. These approaches utilize diverse data sources, including genomic sequences, structural annotations, and interaction data, to train models capable of making precise functional predictions. The integration of machine learning with bioinformatics tools accelerates the discovery of novel protein functions and facilitates the annotation of uncharacterized proteins, contributing significantly to fields like drug discovery, disease diagnosis, and synthetic biology. This abstract underscores the transformative impact of machine learning on protein function prediction, highlighting its potential to enhance our understanding of complex biological systems and drive advancements in biomedical research.

Downloads

Download data is not yet available.

References

John BO Mitchell. “Machine learning methods in chemoinformatics”. In: Wiley Interdisciplinary Reviews: Computational Molecular Science 4.5 (2014), pp. 468–481. DOI: https://doi.org/10.1002/wcms.1183

Zhiyong Lu, Duane Szafron, Russell Greiner, Paul Lu, David S Wishart, Brett Poulin, John Anvik, Cam Macdonell, and Roman Eisner. “Predicting subcellular localization of proteins using machine-learned classifiers”. In: Bioinformatics 20.4 (2004), pp. 547–556. DOI: https://doi.org/10.1093/bioinformatics/btg447

James C Whisstock and Arthur M Lesk. “Prediction of protein function from protein sequence and structure”. In: Quarterly reviews of biophysics 36.3 (2003), pp. 307–340. DOI: https://doi.org/10.1017/S0033583503003901

Predrag Radivojac, Wyatt T Clark, Tal Ronnen Oron, Alexandra M Schnoes, Tobias Wittkop, Artem Sokolov, Kiley Graim, Christopher Funk, Karin Verspoor, Asa Ben-Hur, et al. “A large-scale evaluation of computational protein function prediction”. In: Nature methods 10.3 (2013), pp. 221–227.

José Juan Almagro Armenteros, Casper Kaae Sønderby, Søren Kaae Sønderby, Henrik Nielsen, and Ole Winther. “DeepLoc: prediction of protein subcellular localization using deep learning”. In: Bioinformatics 33.21 (2017), pp. 3387–3395. DOI: https://doi.org/10.1093/bioinformatics/btx431

Balachandran Manavalan, Sathiyamoorthy Subramaniyam, Tae Hwan Shin, Myeong Ok Kim, and Gwang Lee. “Machine-learning-based prediction of cell-penetrating peptides and their uptake efficiency with improved accuracy”. In: Journal of proteome research 17.8 (2018), pp. 2715–2726. DOI: https://doi.org/10.1021/acs.jproteome.8b00148

Sheng Wang, Siqi Sun, Zhen Li, Renyu Zhang, and Jinbo Xu. “Accurate de novo prediction of protein contact map by ultra-deep learning model”. In: PLoS computational biology 13.1 (2017), e1005324. DOI: https://doi.org/10.1371/journal.pcbi.1005324

Lianyi Han, Juan Cui, Honghuang Lin, Zhiliang Ji, Zhiwei Cao, Yixue Li, and Yuzong Chen. “Recent progresses in the application of machine learning approach for predicting protein functional class independent of sequence similarity”. In: Proteomics 6.14 (2006), pp. 4023–4037. DOI: https://doi.org/10.1002/pmic.200500938

Kevin K Yang, Zachary Wu, and Frances H Arnold. “Machine-learning-guided directed evolution for protein engineering”. In: Nature methods 16.8 (2019), pp. 687–694. DOI: https://doi.org/10.1038/s41592-019-0496-6

Pedro J Ballester and John BO Mitchell. “A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking”. In: Bioinformatics 26.9 (2010), pp. 1169–1175. DOI: https://doi.org/10.1093/bioinformatics/btq112

Karsten M Borgwardt, Cheng Soon Ong, Stefan Schönauer, SVN Vishwanathan, Alex J Smola, and Hans-Peter Kriegel. “Protein function prediction via graph kernels”. In: Bioinformatics 21.suppl_1 (2005), pp. i47–i56. DOI: https://doi.org/10.1093/bioinformatics/bti1007

Rong Liu and Jianjun Hu. “DNABind: A hybrid algorithm for structure-based prediction of DNA-binding residues by combining machine learning-and template-based approaches”. In: PROTEINS: structure, Function, and Bioinformatics 81.11 (2013), pp. 1885–1899. DOI: https://doi.org/10.1002/prot.24330

Maurício Boff de Ávila, Mariana Morrone Xavier, Val Oliveira Pintro, and Walter Filgueira de Azevedo Jr. “Supervised machine learning techniques to predict binding affinity. A study for cyclin-dependent kinase 2”. In: Biochemical and biophysical research communications 494.1-2 (2017), pp. 305–310. DOI: https://doi.org/10.1016/j.bbrc.2017.10.035

Brian Kuhlman and Philip Bradley. “Advances in protein structure prediction and design”. In: Nature reviews molecular cell biology 20.11 (2019), pp. 681–697. DOI: https://doi.org/10.1038/s41580-019-0163-x

Roded Sharan, Igor Ulitsky, and Ron Shamir. “Network-based prediction of protein function”. In: Molecular systems biology 3.1 (2007), p. 88. DOI: https://doi.org/10.1038/msb4100129

Guy Yachdav, Edda Kloppmann, Laszlo Kajan, Maximilian Hecht, Tatyana Goldberg, Tobias Hamp, Peter Hönigschmid, Andrea Schafferhans, Manfred Roos, Michael Bernhofer, et al. “PredictProtein—an open resource for online prediction of protein structural and functional features”. In: Nucleic acids research 42.W1 (2014), W337–W343. DOI: https://doi.org/10.1093/nar/gku366

Michael Schantz Klausen, Martin Closter Jespersen, Henrik Nielsen, Kamilla Kjaergaard Jensen, Vanessa Isabell Jurtz, Casper Kaae Soenderby, Morten Otto Alexander Sommer, Ole Winther, Morten Nielsen, Bent Petersen, et al. “NetSurfP-2.0: Improved prediction of protein structural features by integrated deep learning”. In: Proteins: Structure, Function, and Bioinformatics 87.6 (2019), pp. 520–527. DOI: https://doi.org/10.1002/prot.25674

Downloads

Published

2020-07-20

How to Cite

Antonio Lorenza, Gabriella Caterina, & Bianca Isabella. (2020). Machine Learning Techniques for Accurate Prediction of Proteins Function. Fusion of Multidisciplinary Research, An International Journal, 1(2), 85-96. https://doi.org/10.63995/HIXN4599