Machine Learning Techniques for Accurate Prediction of Proteins Function
DOI:
https://doi.org/10.63995/HIXN4599Keywords:
Bioinformatics; Deep Learning; Drug Discovery; Genomic Sequences; Machine Learning; Protein FunctionAbstract
Machine learning techniques are revolutionizing the prediction of protein functions, offering unprecedented accuracy and efficiency in understanding biological processes. By leveraging large datasets of protein sequences and structures, machine learning models can identify patterns and relationships that are often elusive to traditional methods. Techniques such as deep learning, support vector machines, and random forests have shown remarkable success in predicting protein functions, including enzymatic activities, binding sites, and interaction networks. These approaches utilize diverse data sources, including genomic sequences, structural annotations, and interaction data, to train models capable of making precise functional predictions. The integration of machine learning with bioinformatics tools accelerates the discovery of novel protein functions and facilitates the annotation of uncharacterized proteins, contributing significantly to fields like drug discovery, disease diagnosis, and synthetic biology. This abstract underscores the transformative impact of machine learning on protein function prediction, highlighting its potential to enhance our understanding of complex biological systems and drive advancements in biomedical research.
Downloads
References
John BO Mitchell. “Machine learning methods in chemoinformatics”. In: Wiley Interdisciplinary Reviews: Computational Molecular Science 4.5 (2014), pp. 468–481. DOI: https://doi.org/10.1002/wcms.1183
Zhiyong Lu, Duane Szafron, Russell Greiner, Paul Lu, David S Wishart, Brett Poulin, John Anvik, Cam Macdonell, and Roman Eisner. “Predicting subcellular localization of proteins using machine-learned classifiers”. In: Bioinformatics 20.4 (2004), pp. 547–556. DOI: https://doi.org/10.1093/bioinformatics/btg447
James C Whisstock and Arthur M Lesk. “Prediction of protein function from protein sequence and structure”. In: Quarterly reviews of biophysics 36.3 (2003), pp. 307–340. DOI: https://doi.org/10.1017/S0033583503003901
Predrag Radivojac, Wyatt T Clark, Tal Ronnen Oron, Alexandra M Schnoes, Tobias Wittkop, Artem Sokolov, Kiley Graim, Christopher Funk, Karin Verspoor, Asa Ben-Hur, et al. “A large-scale evaluation of computational protein function prediction”. In: Nature methods 10.3 (2013), pp. 221–227.
José Juan Almagro Armenteros, Casper Kaae Sønderby, Søren Kaae Sønderby, Henrik Nielsen, and Ole Winther. “DeepLoc: prediction of protein subcellular localization using deep learning”. In: Bioinformatics 33.21 (2017), pp. 3387–3395. DOI: https://doi.org/10.1093/bioinformatics/btx431
Balachandran Manavalan, Sathiyamoorthy Subramaniyam, Tae Hwan Shin, Myeong Ok Kim, and Gwang Lee. “Machine-learning-based prediction of cell-penetrating peptides and their uptake efficiency with improved accuracy”. In: Journal of proteome research 17.8 (2018), pp. 2715–2726. DOI: https://doi.org/10.1021/acs.jproteome.8b00148
Sheng Wang, Siqi Sun, Zhen Li, Renyu Zhang, and Jinbo Xu. “Accurate de novo prediction of protein contact map by ultra-deep learning model”. In: PLoS computational biology 13.1 (2017), e1005324. DOI: https://doi.org/10.1371/journal.pcbi.1005324
Lianyi Han, Juan Cui, Honghuang Lin, Zhiliang Ji, Zhiwei Cao, Yixue Li, and Yuzong Chen. “Recent progresses in the application of machine learning approach for predicting protein functional class independent of sequence similarity”. In: Proteomics 6.14 (2006), pp. 4023–4037. DOI: https://doi.org/10.1002/pmic.200500938
Kevin K Yang, Zachary Wu, and Frances H Arnold. “Machine-learning-guided directed evolution for protein engineering”. In: Nature methods 16.8 (2019), pp. 687–694. DOI: https://doi.org/10.1038/s41592-019-0496-6
Pedro J Ballester and John BO Mitchell. “A machine learning approach to predicting protein–ligand binding affinity with applications to molecular docking”. In: Bioinformatics 26.9 (2010), pp. 1169–1175. DOI: https://doi.org/10.1093/bioinformatics/btq112
Karsten M Borgwardt, Cheng Soon Ong, Stefan Schönauer, SVN Vishwanathan, Alex J Smola, and Hans-Peter Kriegel. “Protein function prediction via graph kernels”. In: Bioinformatics 21.suppl_1 (2005), pp. i47–i56. DOI: https://doi.org/10.1093/bioinformatics/bti1007
Rong Liu and Jianjun Hu. “DNABind: A hybrid algorithm for structure-based prediction of DNA-binding residues by combining machine learning-and template-based approaches”. In: PROTEINS: structure, Function, and Bioinformatics 81.11 (2013), pp. 1885–1899. DOI: https://doi.org/10.1002/prot.24330
Maurício Boff de Ávila, Mariana Morrone Xavier, Val Oliveira Pintro, and Walter Filgueira de Azevedo Jr. “Supervised machine learning techniques to predict binding affinity. A study for cyclin-dependent kinase 2”. In: Biochemical and biophysical research communications 494.1-2 (2017), pp. 305–310. DOI: https://doi.org/10.1016/j.bbrc.2017.10.035
Brian Kuhlman and Philip Bradley. “Advances in protein structure prediction and design”. In: Nature reviews molecular cell biology 20.11 (2019), pp. 681–697. DOI: https://doi.org/10.1038/s41580-019-0163-x
Roded Sharan, Igor Ulitsky, and Ron Shamir. “Network-based prediction of protein function”. In: Molecular systems biology 3.1 (2007), p. 88. DOI: https://doi.org/10.1038/msb4100129
Guy Yachdav, Edda Kloppmann, Laszlo Kajan, Maximilian Hecht, Tatyana Goldberg, Tobias Hamp, Peter Hönigschmid, Andrea Schafferhans, Manfred Roos, Michael Bernhofer, et al. “PredictProtein—an open resource for online prediction of protein structural and functional features”. In: Nucleic acids research 42.W1 (2014), W337–W343. DOI: https://doi.org/10.1093/nar/gku366
Michael Schantz Klausen, Martin Closter Jespersen, Henrik Nielsen, Kamilla Kjaergaard Jensen, Vanessa Isabell Jurtz, Casper Kaae Soenderby, Morten Otto Alexander Sommer, Ole Winther, Morten Nielsen, Bent Petersen, et al. “NetSurfP-2.0: Improved prediction of protein structural features by integrated deep learning”. In: Proteins: Structure, Function, and Bioinformatics 87.6 (2019), pp. 520–527. DOI: https://doi.org/10.1002/prot.25674
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
© The Author(s). Published by Fusion of Multidisciplinary Research, An International Journal (FMR), Netherlands.
This is an open-access article distributed under the Creative Commons Attribution license, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.