Exploring Machine Learning in Lung Cancer: Predictive Modelling, Gene Associations, and Challenges

Authors

  • K. Mary Sudha Rani Research Scholar,CSE dept., JNTUH Hyderabad, Assistant Professor,AIML Dept., Chaitanya Bharathi Institute of Technology, Hyderabad.
  • V. Kamakshi Prasad Professor,CSE dept., JNTUH Hyderabad

Keywords:

Lung cancer, prediction, gene association, machine learning, high-throughput genomic data, multi-omics data, support vector machines, random forests, deep understanding, network-based methodologies, predictive modeling, biomarker identification

Abstract

Lung cancer is a disease with a high mortality rate and widespread occurrence. Therefore, developing accurate prediction methods and practical gene association analyses is crucial. The utilization of high-throughput genomic data to reveal significant genetic factors has seen an increase in the application of machine-learning techniques. This document presents a thorough examination of machine learning methodologies that are presently utilized to forecast lung cancer and scrutinize gene correlations. The analysis examines different data types, such as gene expression profiles, genomic variants, and clinical data. The primary focus is on integrating multi-omics data for a more comprehensive understanding. Our study comprehensively examines a variety of machine learning algorithms, including traditional methods such as support vector machines and random forests, advanced deep learning architectures, and network-based methodologies. The following discourse explores the pragmatic utilization of the methods above in predictive modeling, biomarker identification, and drug discovery routes. The article addresses common obstacles in the field, such as interpretability and validation, and proposes potential avenues for future research, such as incorporating multi-omics data and implementing personalized medicine. This survey provides a detailed analysis of the recent advancements in machine learning techniques for lung cancer research. It aims to establish a strong basis for future improvements in diagnosis, prognosis, and treatment strategies.

Downloads

Download data is not yet available.

References

Knox, S. S. (2010, April 26). From "omics" to complex disease: a systems biology approach to gene-environment interactions in cancer. PubMed Central (PMC). https://doi.org/10.1186/1475-2867-10-11

Thandra, K. C., Barsouk, A., Saginala, K., Aluru, J. S., & Barsouk, A. (2021, February 23). Epidemiology of lung cancer. PubMed Central (PMC). https://doi.org/10.5114/wo.2021.103829

Molina, J. R., Yang, P., Cassivi, S. D., Schild, S. E., & Adjei, A. A. (n.d.). Non–Small Cell Lung Cancer: Epidemiology, Risk Factors, Treatment, and Survivorship. PubMed Central (PMC). https://doi.org/10.4065/83.5.584

Machine learning for multi-omics data integration in cancer. (2022, January 22). Machine Learning for Multi-omics Data Integration in Cancer - ScienceDirect. https://doi.org/10.1016/j.isci.2022.103798

Ruan, X., Ye, Y., Cheng, W., Xu, L., Huang, M., Chen, Y., Zhu, J., Lu, X., & Yan, F. (2022, May 6). Multi-Omics Integrative Analysis of Lung Adenocarcinoma: An in silico Profiling for Precise Medicine. Frontiers. https://doi.org/10.3389/fmed.2022.894338

Cano-Gamez, E., & Trynka, G. (2020, April 6). From GWAS to Function: Using Functional Genomics to Identify the Mechanisms Underlying Complex Diseases. Frontiers. https://doi.org/10.3389/fgene.2020.00424

Pai, A. (2020, February 17). CNN vs. RNN vs. ANN - Analyzing 3 Types of Neural Networks in Deep Learning. Analytics Vidhya. https://www.analyticsvidhya.com/blog/2020/02/cnn-vs-rnn-vs-mlp-analyzing-3-types-of-neural-networks-in-deep-learning/

A novel candidate disease gene prioritization method using deep graph convolutional networks and semi-supervised learning - PubMed. (2022, October 14). PubMed. https://doi.org/10.1186/s12859-022-04954-x

Mathew, C. J., David, A. M., & Joy Mathew, C. M. (2020, December 11). Artificial Intelligence and its future potential in lung cancer screening. PubMed Central (PMC). https://doi.org/10.17179/excli2020-3095

Vamathevan, J., Clark, D., Czodrowski, P., Dunham, I., Ferran, E., Lee, G., Li, B., Madabhushi, A., Shah, P., Spitzer, M., & Zhao, S. (n.d.). Applications of machine learning in drug discovery and development. PubMed Central (PMC). https://doi.org/10.1038/s41573-019-0024-5

Vamathevan, J., Clark, D., Czodrowski, P., Dunham, I., Ferran, E., Lee, G., Li, B., Madabhushi, A., Shah, P., Spitzer, M., & Zhao, S. (n.d.). Applications of machine learning in drug discovery and development. PubMed Central (PMC). https://doi.org/10.1038/s41573-019-0024-5

Genome, transcriptome, and proteome: the rise of omics data and their integration in biomedical sciences - PubMed. (2018, March 1). PubMed. https://doi.org/10.1093/bib/bbw114

Krassowski, M., Das, V., Sahu, S. K., & Misra, B. B. (2020, November 20). State of the Field in Multi-Omics Research: From Computational Needs to Data Mining and Sharing. Frontiers. https://doi.org/10.3389/fgene.2020.610798

Translational bioinformatics and healthcare informatics: computational and ethical challenges - PubMed. (2009, September 16). PubMed. https://pubmed.ncbi.nlm.nih.gov/20169020/

https://academic.oup.com/nar/article/45/D1/D833/2290909. (n.d.). https://academic.oup.com/nar/article/45/D1/D833/2290909

Witte, J. S. (n.d.). Genome-Wide Association Studies and Beyond. PubMed Central (PMC). https://doi.org/10.1146/annurev.publhealth.012809.103723

Next Generation Sequencing - an overview | ScienceDirect Topics. (n.d.). Next Generation Sequencing - an Overview | ScienceDirect Topics. https://doi.org/10.1016/B978-0-12-801418-9.00002-0

Abass, Y. A., & Adeshina, S. A. (n.d.). Deep Learning Methodologies for Genomic Data Prediction: Review | Atlantis Press. Deep Learning Methodologies for Genomic Data Prediction: Review | Atlantis Press. https://doi.org/10.2991/jaims.d.210512.001

Muzio, G., O’Bray, L., & Borgwardt, K. (2021, March 1). Biological network analysis with deep learning. OUP Academic. https://doi.org/10.1093/bib/bbaa257

State-of-the-art in artificial neural network applications: A survey. (2018, November 23). State-of-the-art in Artificial Neural Network Applications: A Survey - ScienceDirect. https://doi.org/10.1016/j.heliyon.2018.e00938

Piñero, J., Bravo, L., Queralt-Rosinach, N., Gutiérrez-Sacristán, A., Deu-Pons, J., Centeno, E., García-García, J., Sanz, F., & Furlong, L. I. (2016, October 19). DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants. PubMed Central (PMC). https://doi.org/10.1093/nar/gkw943

Downloads

Published

17.05.2023

How to Cite

Rani, K. M. S. ., & Prasad, V. K. . (2023). Exploring Machine Learning in Lung Cancer: Predictive Modelling, Gene Associations, and Challenges. International Journal of Intelligent Systems and Applications in Engineering, 11(6s), 490–499. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/2874

Issue

Section

Research Article