Applications of Deep learning models in Bioinformatics

Preeti Thareja

Research Scholar, Department of Computer Science and Applications, M.D. University, Rohtak, Haryana, India

Corresponding Author: preetithareja10@gmail.com

Rajender Singh Chhillar

Professor, Department of Computer Science and Applications, M.D. University, Rohtak, Haryana, India

Corresponding Author: chhillar02@gmail.com

Abstract :

Deep learning (DL) models have had an influence on machine learning-based in bioinformatics applications since they allow for the learning of complicated non-linear interactions between functionalities. Deep learning models also enable information utilized from large unlabeled data that is unrelated to the problem under investigation. Protein-protein interactions (PPIs) are important in a variety of biological functions, including cell signaling, immune function, and cellular organization. PPIs analysis is thus vital, as it may spotlight the detection of targeted proteins and their role in the disease and thus help in designing treatments for it. PPIs play critical roles in life processes, and abnormal interactions are linked to a variety of disorders. Identification of interaction sites is critical for understanding disease mechanisms and designing new drugs. Because of the overall cost of experimental methods, effective and efficient computational methods for PPI prediction are extremely valuable. Machine learning and deep learning techniques have produced remarkable results, but their efficacy is highly reliant on protein interpretation and feature extraction. This chapter will explain various deep learning models that can be used in Bioinformatics as well as the challenges they face.

Keywords:
  • Deep Learning,
  • Bioinformatics,
  • PPIs,
  • Convolutional Neural Network (CNN),
  • Recurrent Neural Network (RNN),
  • Discrete Wavelet Transform ,
  • Deep Belief Network (DBN)
Reference

[1] Y. Li, C. Huang, L. Ding, Z. Li, Y. Pan, and X. Gao, “Deep learning in bioinformatics: Introduction, application, and perspective in the big data era,” Methods, vol. 166, no. April 2019, pp. 4–21, 2019, doi: 10.1016/j.ymeth.2019.04.008.

[2] Z. Liao, G. Pan, C. Sun, and J. Tang, “Predicting subcellular location of protein with evolution information and sequence ‑ based deep learning,” pp. 1–22, 2021.

[3] P. Thareja and R. S. Chhillar, “A review of data mining optimization techniques for bioinformatics applications,” Int. J. Eng. Trends Technol., vol. 68, no. 10, pp. 58–62, 2020, doi: 10.14445/22315381/IJETT-V68I10P210.

[4] Aman and R. S. Chhillar, “Analyzing Predictive Algorithms in Data Mining for Cardiovascular Disease using WEKA Tool,” Int. J. Adv. Comput. Sci. Appl., vol. 12, no. 8, p. 2021, Oct. 2021.

[5] P. Thareja and R. S. Chhillar, “Comparative Analysis of Data Mining Algorithms for Cancer Gene Expression Data,” vol. 12, no. 10, pp. 322–328, 2021, doi: http://dx.doi.org/10.14569/IJACSA.2021.0121035.

[6] N. Sapoval et al., “deep learning across the biosciences,” pp. 1–12, 2022, doi: 10.1038/s41467-022-29268-7.

[7] Aman and R. S. Chhillar, “Disease predictive models for healthcare by using data mining techniques: State of the art,” Int. J. Eng. Trends Technol., vol. 68, no. 10, pp. 52–57, 2020, doi: 10.14445/22315381/IJETT-V68I10P209.

[8] C. Zhang, Y. Lu, and T. Zang, “CNN ‑ DDI : a learning ‑ based method for predicting drug – drug interactions using convolution neural networks,” pp. 1–11, 2022.

[9] E. Elbasani, S. N. Njimbouom, T. J. Oh, E. H. Kim, H. Lee, and J. D. Kim, “GCRNN : graph convolutional recurrent neural network for compound – protein interaction prediction,” pp. 1–13, 2021.

[10] D. Griffith and A. S. Holehouse, “PARROT is a flexible recurrent neural network framework for analysis of large protein datasets,” pp. 1–17, 2021.

[11] S. R. Stahlschmidt, B. Ulfenborg, and J. Synnergren, “Multimodal deep learning for biomedical data fusion : a review,” vol. 23, pp. 1–15, 2022.

[12] P. Supriya, B. Marudamuthu, S. K. Soam, and C. S. Rao, “Trends and Application of Data Science in Bioinformatics BT - Trends of Data Science and Applications: Theory and Practices,” S. S. Rautaray, P. Pemmaraju, and H. Mohanty, Eds. Singapore: Springer Singapore, 2021, pp. 227–244.

[13] P. Thareja and R. S. Chhillar, “A Detailed Survey on Data Mining based Optimization Schemes for Bioinformatics Applications,” 2021.

[14] B. Tang, Z. Pan, K. Yin, and A. Khateeb, “Recent advances of deep learning in bioinformatics and computational biology,” Front. Genet., vol. 10, no. MAR, pp. 1–10, 2019, doi: 10.3389/fgene.2019.00214.

[15] R. Zemouri, N. Zerhouni, and D. Racoceanu, “Deep learning in the biomedical applications: Recent and future status,” Appl. Sci., vol. 9, no. 8, Apr. 2019, doi: 10.3390/APP9081526.

[16] G. Market, “Genomics Market by Product & Service (System & Software, Consumables, Services), Technology (Sequencing, PCR), Application (Drug Discovery & Development, Diagnostic, Agriculture), End User (Hospital & Clinics, Research Centers) – Global Forecast to 2025.” https://www.marketsandmarkets.com/Market-Reports/genomics-market-613.html.

[17] A. Darolia and R. S. Chhillar, “Analyzing Three Predictive Algorithms for Diabetes Mellitus Against the Pima Indians Dataset,” ECS Trans., vol. 107, no. 1, pp. 2697–2704, 2022, doi: 10.1149/10701.2697ecst.

[18] B. Wen et al., “Deep Learning in Proteomics,” Proteomics, vol. 20, no. 21–22, Nov. 2020, doi: 10.1002/PMIC.201900335.

[19] H. S. Basavegowda and G. Dagnew, “Deep learning approach for microarray cancer data classification,” vol. 5, pp. 22–33, 2020, doi: 10.1049/trit.2019.0028.

[20] C. Simon, K. Davidsen, C. Hansen, E. Seymour, M. B. Barnkob, and L. R. Olsen, “BioReader : a text mining tool for performing classification of biomedical literature,” vol. 19, no. Suppl 13, 2019.

[21] A. R. Jamasb, B. Day, ˘ Ta ˘ Lina Cangea, P. Liò, and T. L. Blundell, “Chapter 16 Deep Learning for Protein-Protein Interaction Site Prediction,” doi: 10.1007/978-1-0716-1641-3_16.

[22] R. Syrlybaeva and E.-M. Strauch, “Deep learning of Protein Sequence Design of Protein-protein Interactions,” doi: 10.1101/2022.01.28.478262.

[23] S. Hashemifar, B. Neyshabur, A. A. Khan, and J. Xu, “Predicting protein-protein interactions through sequence-based deep learning,” Bioinformatics, vol. 34, no. 17, pp. i802–i810, 2018, doi: 10.1093/bioinformatics/bty573.

[24] X. Hu, C. Feng, Y. Zhou, A. Harrison, and M. Chen, “DeepTrio: a ternary prediction system for protein–protein interaction using mask multiple parallel convolutional neural networks,” Bioinformatics, vol. 38, no. 3, pp. 694–702, 2022, doi: 10.1093/bioinformatics/btab737.

[25] P. Wang, G. Zhang, Z. G. Yu, and G. Huang, “A Deep Learning and XGBoost-Based Method for Predicting Protein-Protein Interaction Sites,” Front. Genet., vol. 12, no. October, pp. 1–11, 2021, doi: 10.3389/fgene.2021.752732.

[26] J. Yang, N. Li, S. Fang, K. Yu, and Y. Chen, “Semantic Features Prediction for Pulmonary Nodule Diagnosis Based on Online Streaming Feature Selection,” IEEE Access, vol. 7, pp. 61121–61135, 2019, doi: 10.1109/ACCESS.2019.2903682.

© The Author(s), under exclusive license to Technoarete Publishers 2022
  • ISBN -
  • Instant PDF download
  • Readable on all devices
doi.org/10.36647/MLAIDA/2022.12.B1.Ch009