Classification of DNA Microarray Gene Expression Leukaemia Data through ABC and CNN Method

Authors

  • Abdul Wahid Research Scholar, Department of Electronics & Inst. Technology.University of Kashmir, INDIA
  • M. Tariq Banday Professor, Department of Electronics & Inst. Technology,University of Kashmir, INDIA

Keywords:

Gene Expression, DNA Micro-array, Feature Selection, Swarm Intelligence, ABC Algorithm

Abstract

Biomedical and health-care informatics research is increasingly using big data technology. At an unprecedented velocity and scale, large volumes of biological and clinical data have been created and gathered. DNA microarray classification has been widely used in biological and medical research to study gene expression patterns, identify disease biomarkers, classify cancer subtypes, predict treatment responses, and discover novel gene functions. Predictive analytics is becoming more popular for its applications in healthcare and has a lot of potential. While the performance concerns are still in a way and optimization approaches are used to address these concerns. In the proposed methodology we have adopted hybrid approach of optimization algorithm (Artificial Bee colony) for feature selection and deep learning (Convolutional Neural Network) method for classification. ABC method helps in obtaining best features which also improves the accuracy of classification.  The accuracy and other performance characteristics of the proposed algorithm CNN are examined. To demonstrate the usefulness of the proposed model, it is compared to various algorithms such as decision tree, random forest, and KNN based on performance metrics and the proposed approach achieves 98% accuracy which is remarkable as compared to other approaches.

Downloads

Download data is not yet available.

References

Gambhir, S., Malik, S. K., & Kumar, Y. (2016). Role of soft computing approaches in healthcare domain: a mini review. Journal of medical systems, 40(12), 1-20.

Lundberg, S. M., Nair, B., Vavilala, M. S., Horibe, M., Eisses, M. J., Adams, T., ... & Lee, S. I. (2018). Explainable machine-learning predictions for the prevention of hypoxaemia during surgery. Nature biomedical engineering, 2(10), 749-760.

Khan, S., Khan, A., Maqsood, M., Aadil, F., & Ghazanfar, M. A. (2019). Optimized gabor feature extraction for mass classification using cuckoo search for big data e-healthcare. Journal of Grid Computing, 17(2), 239-254.

Chawda, B., & Patel, J. (2016). Natural Computing Algorithms–A Survey. International Journal of Emerging Technology and Advanced Engineering, 6(6).

Nayar, N., Ahuja, S., & Jain, S. (2019). Swarm intelligence for feature selection: a review of literature and reflection on future challenges. Advances in data and information sciences, 211-221.

Wang, L., Xi, Y., Sung, S., & Qiao, H. (2018). RNA-seq assistant: machine learning based methods to identify more transcriptional regulated genes. BMC genomics, 19(1), 1-13.

Dash, S., Shakyawar, S. K., Sharma, M., & Kaushik, S. (2019). Big data in healthcare: management, analysis and future prospects. Journal of Big Data, 6(1), 1-25.

Golub, T. R., et al. (1999). Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring. Science (80-.), 286(5439), 531–537.

Wang, J., Bø, T. H., Jonassen, I., Myklebost, O., & Hovig, E. (2003). Tumor classification and marker gene prediction by feature selection and fuzzy c-means clustering using microarray data. BMC Bioinformatics, 4(1), 60

Kim, J., et al. (2005). Identification of potential biomarkers of genotoxicity and carcinogenicity in L5178Y mouse lymphoma cells by cDNA microarray analysis. Environmental and Molecular Mutagenesis, 45(1), 80–89

Hambali, M., Saheed, Y., Oladele, T., & Gbolagade, M. (2019). ADABOOST ensemble algorithms for breast cancer classification. Journal of Advance Computer Research, 10(2), 31–52.

Hassan, M. K., El Desouky, A. I., Elghamrawy, S. M., & Sarhan, A. M. (2019). Big data challenges and opportunities in healthcare informatics and smart hospitals. In Security in smart cities: Models, applications, and challenges (pp. 3-26). Springer, Cham.

Klug, M., Barash, Y., Bechler, S., Resheff, Y. S., Tron, T., Ironi, A., ... & Klang, E. (2020). A gradient boosting machine learning model for predicting early mortality in the emergency department triage: devising a nine-point triage score. Journal of general internal medicine, 35(1), 220-227.

Soffer, S., Klang, E., Barash, Y., Grossman, E., & Zimlichman, E. (2021). Predicting In-Hospital Mortality at Admission to the Medical Ward: A Big-Data Machine Learning Model. The American journal of medicine, 134(2), 227-234.

Alonso-González, C.J., Moro-Sancho, Q.I., Simon-Hurtado, A. and Varela-Arrabal, R., 2012. Microarray gene expression classification with few genes: Criteria to combine attribute selection and classification methods. Expert Systems with Applications, 39(8), pp.7270-7280.

Alshamlan, Hala M., Ghada H. Badr, and Yousef A. Alohali. "Abc-svm: artificial bee colony and svm method for microarray gene selection and multi class cancer classification." Int. J. Mach. Learn. Comput 6, no. 3 (2016): 184.

Sheikhpour, R., and M. Aghaseram. "Diagnosis of acute myeloid and lymphoblastic leukemia using gene selection of microarray data and data mining algorithm." Scientific Journal of Iran Blood Transfus Organ 12, no. 4 (2016): 347-357.

Dwivedi, Ashok Kumar. "Artificial neural network model for effective cancer classification using microarray gene expression data." Neural Computing and Applications 29, no. 12 (2018): 1545-1554.

Arif, Muhammad Azharuddin, and Zuraini Ali Shah. "Implementation of Statistical Feature Selection and Feature Extraction on Cancer Classification." Academia of Intelligence Computing 1, no. 1 (2020): 21-29.

Remeseiro López, B., & Bolon Canedo, V. (2019). A review of feature selection methods in medical applications. Computers in Biology and Medicine, 112

Ashok Kumar, L. ., Jebarani, M. R. E. ., & Gokula Krishnan, V. . (2023). Optimized Deep Belief Neural Network for Semantic Change Detection in Multi-Temporal Image. International Journal on Recent and Innovation Trends in Computing and Communication, 11(2), 86–93. https://doi.org/10.17762/ijritcc.v11i2.6132

Guofeng Zhou et al, Employing artificial bee colony and particle swarm techniques for optimizing a neural network in prediction of heating and cooling loads of residential buildings, Journal of Cleaner ProductionVolume 254, 1 May 2020, 120082.

Doreswamy, & UmmeSalma, M. (2016). A binary bat inspired algorithm for the classification of breast cancer data. International Journal on Soft Computing Intelligence and Applications, 5(2/3), 1–21.

Alomari, O. A., et al. (2017). Mrmr ba: A hybrid gene selection algorithm for cancer classification. Journal of Theoretical and Applied Information Technology, 95(12), 15

Tabares-Soto, R.; Orozco-Arias, S.; Romero-Cano, V.; Segovia Bucheli, V.; Rodríguez-Sotelo, J.L.; Jiménez-Varón, C.F. A Comparative Study of Machine Learning and Deep Learning Algorithms to Classify Cancer Types Based on Microarray Gene Expression Data. PeerJ Comput. Sci. 2020, 6, e270.

Hira, S.; Bai, A. A Novel Map Reduced Based Parallel Feature Selection and Extreme Learning for Micro Array Cancer Data Classification. Wirel. Pers. Commun. 2022, 123, 1483–1505.

Vaiyapuri, T.; Liyakathunisa; Alaskar, H.; Aljohani, E.; Shridevi, S.; Hussain, A. Red Fox Optimizer with Data-Science-Enabled Microarray Gene Expression Classification Model. Appl. Sci. 2022, 12, 4172.

Deng, X.; Li, M.; Deng, S.; Wang, L. Hybrid Gene Selection Approach Using XGBoost and Multi-Objective Genetic Algorithm for Cancer Classification. Med. Biol. Eng. Comput. 2022, 60, 663–681.

Rostami, M.; Forouzandeh, S.; Berahmand, K.; Soltani, M.; Shahsavari, M.; Oussalah, M. Gene Selection for Microarray Data Classification via Multi-Objective Graph Theoretic-Based Method. Artif. Intell. Med. 2022, 123, 102228.

Xie, W.; Fang, Y.; Yu, K.; Min, X.; Li, W. MFRAG: Multi-Fitness RankAggreg Genetic Algorithm for Biomarker Selection from Microarray Data. Chemom. Intell. Lab. Syst. 2022, 226, 104573.

Zellar, P. I. . (2021). Business Security Design Improvement Using Digitization. International Journal of New Practices in Management and Engineering, 10(01), 19–21. https://doi.org/10.17762/ijnpme.v10i01.98

Ngiam, K. Y., & Khor, W. (2019). Big data and machine learning algorithms for health-care delivery. The Lancet Oncology, 20(5), e262-e273.

Yu, H., Yang, L. T., Zhang, Q., Armstrong, D., & Deen, M. J. (2021). Convolutional neural networks for medical image analysis: state-of-the-art, comparisons, improvement and perspectives. Neurocomputing, 444, 92-110.

Abiodun, O. I., Jantan, A., Omolara, A. E., Dada, K. V., Mohamed, N. A., & Arshad, H. (2018). State-of-the-art in artificial neural network applications: A survey. Heliyon, 4(11), e00938.

Talasila, V., Madhubabu, K., Madhubabu, K., Mahadasyam, M., Atchala, N., & Kande, L. (2020). The prediction of diseases using rough set theory with recurrent neural network in big data analytics. International Journal of Intelligent Engineering and Systems, 13(5), 10-18.

Jayasri, N. P., & Aruna, R. (2021). Big data analytics in health care by data mining and classification techniques. ICT Express.

Ayyad, S. M., Saleh, A. I., & Labib, L. M. (2019). Gene expression cancer classification using modified K-Nearest Neighbors technique. Biosystems, 176, 41-51.

Proposed Implementation Architecture

Downloads

Published

01.07.2023

How to Cite

Wahid, A. ., & Banday, M. T. . (2023). Classification of DNA Microarray Gene Expression Leukaemia Data through ABC and CNN Method. International Journal of Intelligent Systems and Applications in Engineering, 11(7s), 119–131. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/2939