Multivariate Analysis on Personalized Cancer Data using a Hybrid Classification Model using Voting Classifier

Authors

  • Ashok Reddy Kandula Research Scholar, Department of Computer Science and Engineering, Annamalai University, Annamalai Nagar, Chidambaram, Tamil Nadu-608002, India
  • R. Sathya Assistant Professor, Department of Computer Science and Engineering, Annamalai University, Annamalai Nagar, Chidambaram, Tamil Nadu-608002, India
  • S. Narayana Professor, Department of Computer Science and Engineering, Seshadri Rao Gudlavalleru Engineering College, Gudlavalleru-521356, India

Keywords:

Artificial Intelligence, Medical data analysis, gene mutation, ensemble, Logistic Regression, Support Vector Machine, Random Forest

Abstract

Cancer was found to be a significant burdening type of problem in the medical system. An accurate diagnosis is considered to be a challenging task for physicians. Modern Artificial Intelligence (AI) research proves that cancers are easily identifiable and early detectable and can be diagnosed by classification with the help of gene mutations that occur in the cells. This paper presented a hybrid classification model using an ensemble machine learning technique to classify the type of cancer class from the given medical dataset. The proposed hybrid voting classifier utilizes guided WOA (Whale Optimization Algorithm) that aggregates different classification models' prediction results to choose the most voted class. This helps to increase the chance that individual classifiers, e.g., Logistic Regression (LR), Support Vector Machines (SVM), and Random Forest (RF), show significant discrepancies. The results were found to be better compared to the existing work.

Downloads

Download data is not yet available.

References

Yan, K., & Lu, H. (2019, January). Evaluating ensemble learning impact on gene selection for automated cancer diagnosis. In International Workshop on Health Intelligence (pp. 183-186). Springer, Cham.

Kandula, A. R. (2021). R, S., & S, N., Performing Uni-variate Analysis on Cancer Gene Mutation Data Using SGD Optimized Logistic Regression. International Journal of Engineering Trends and Technology, 69(2), 59-67.

KANDULA, A. R., SATHYA, R., & NARAYANA, S. (2022). COMPARATIVE ANALYSIS OF MACHINE LEARNING TECHNIQUES ON GENETIC MUTATION-BASED CANCER DIAGNOSIS DATA. Journal of Theoretical and Applied Information Technology, 100(6).

Christodoulou, E., Ma, J., Collins, G. S., Steyerberg, E. W., Verbakel, J. Y., & Van Calster, B. (2019). A systematic review shows no performance benefit of machine learning over logistic regression for clinical prediction models. Journal of clinical epidemiology, 110, 12-22.

Wei, L., Wan, S., Guo, J., & Wong, K. K. (2017). A novel hierarchical selective ensemble classifier with bioinformatics application. Artificial intelligence in medicine, 83, 82-90.

Zhou, Y., Cheng, G., Jiang, S., & Dai, M. (2020). Building an efficient intrusion detection system based on feature selection and ensemble classifier. Computer networks, 174, 107247.

Pratama, M., Pedrycz, W., & Lughofer, E. (2018). Evolving ensemble fuzzy classifier. IEEE Transactions on Fuzzy Systems, 26(5), 2552-2567.

Qiu, W. R., Xiao, X., Xu, Z. C., & Chou, K. C. (2016). iPhos-PseEn: identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier. Oncotarget, 7(32), 51270.

Yousaf, A., Umer, M., Sadiq, S., Ullah, S., Mirjalili, S., Rupapara, V., & Nappi, M. (2020). Emotion recognition by textual tweets classification using the voting classifier (LR-SGD). IEEE Access, 9, 6286-6295.

Atallah, R., & Al-Mousa, A. (2019, October). Heart disease detection using machine learning majority voting ensemble method. In 2019 2nd international conference on new trends in computing sciences (ictcs) (pp. 1-6). IEEE.

Assiri, A. S., Nazir, S., & Velastin, S. A. (2020). Breast tumor classification using an ensemble machine learning method. Journal of Imaging, 6(6), 39.

El-Kenawy, E. S. M., Ibrahim, A., Mirjalili, S., Eid, M. M., & Hussein, S. E. (2020). Novel feature selection and voting classifier algorithms for COVID-19 classification in CT images. IEEE Access, 8, 179317-179335.

Sreedhar P, Siva Satya & Kandula, Ashok & Tamilarasi, K & Maan, Srikrishna & Alsekait, Deema & Publishing House, International Scitech. (2022). Deep Neural Network for Image Recognition In Medical Diagnosis. Journal of Pharmaceutical Negative Results. 2022. 10.47750/pnr.2022.13.S09.47.

Goyal, A., & Sardana, N. (2019, August). Empirical analysis of ensemble machine learning techniques for bug triaging. In 2019 Twelfth International Conference on Contemporary Computing (IC3) (pp. 1-6). IEEE.

Jan, Z. M., & Verma, B. (2019, June). Ensemble classifier optimization by reducing input features and base classifiers. In 2019 IEEE Congress on Evolutionary Computation (CEC) (pp. 1580-1587). IEEE.

Dogan, A., & Birant, D. (2019, September). A weighted majority voting ensemble approach for classification. In 2019 4th International Conference on Computer Science and Engineering (UBMK) (pp. 1-6). IEEE.

Huang, S., Cai, N., Pacheco, P. P., Narrandes, S., Wang, Y., & Xu, W. (2018). Applications of support vector machine (SVM) learning in cancer genomics. Cancer genomics & proteomics, 15(1), 41-51

Sheykhmousa, M., Mahdianpari, M., Ghanbari, H., Mohammadimanesh, F., Ghamisi, P., & Homayouni, S. (2020). Support vector machine versus random forest for remote sensing image classification: A meta-analysis and systematic review. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 13, 6308-6325

Dogan, A., & Birant, D. (2019, September). A weighted majority voting ensemble approach for classification. In 2019 4th International Conference on Computer Science and Engineering (UBMK) (pp. 1-6). IEEE.

Swarajya lakshmi v papineni, A.Mallikarjuna Reddy, Sudeepti yarlagadda , Snigdha Yarlagadda, Haritha Akkineni "An Extensive Analytical Approach on Human Resources using Random Forest Algorithm" International Journal of Engineering Trends and Technology 69.5(2021):119-127.

A. Mallikarjuna Reddy, V. Venkata Krishna, L. Sumalatha, “Efficient Face Recognition by Compact Symmetric Elliptical Texture Matrix (CSETM)”, Jour of Adv Research in Dynamical & Control Systems, Vol. 10, 4-Regular Issue, 2018.

Mallikarjuna Reddy, A., Rupa Kinnera, G., Chandrasekhara Reddy, T., Vishnu Murthy, G., et al., (2019), “Generating cancelable fingerprint template using triangular structures”, Journal of Computational and Theoretical Nanoscience, Volume 16, Numbers 5-6, pp. 1951-1955(5), doi: https://doi.org/10.1166/jctn.2019.7830.

Mallikarjuna Reddy, A.,Venkata Krishna, V. and Sumalatha, L.” Face recognition approaches: A survey” International Journal of Engineering and Technology (UAE), 4.6 Special Issue 6, volume number 7 , 117-121,2018.

Mallikarjuna A. Reddy, Sudheer K. Reddy, Santhosh C.N. Kumar, Srinivasa K. Reddy, “Leveraging bio-maximum inverse rank method for iris and palm recognition”, International Journal of Biometrics, 2022 Vol.14 No.3/4, pp.421 - 438, DOI: 10.1504/IJBM.2022.10048978.

V. NavyaSree, Y. Surarchitha, A. M. Reddy, B. Devi Sree, A. Anuhya and H. Jabeen, "Predicting the Risk Factor of Kidney Disease using Meta Classifiers," 2022 IEEE 2nd Mysore Sub Section International Conference (MysuruCon), 2022, pp. 1-6, doi: 10.1109/MysuruCon55714.2022.9972392.

B. H. Rao et al., "MTESSERACT: An Application for Form Recognition in Courier Services," 2022 3rd International Conference on Smart Electronics and Communication (ICOSEC), 2022, pp. 848-853, doi: 10.1109/ICOSEC54921.2022.9952031.

P. S. Silpa et al., "Designing of Augmented Breast Cancer Data using Enhanced Firefly Algorithm," 2022 3rd International Conference on Smart Electronics and Communication (ICOSEC), 2022, pp. 759-767, doi: 10.1109/ICOSEC54921.2022.9951883.

B. S. Reddy, A. Mallikarjuna Reddy, M. H. D. S. Sradda, T. Mounika, S. Mounika and M. K, "A Comparative Study on Object Detection Using Retinanet," 2022 IEEE 2nd Mysore Sub Section International Conference (MysuruCon), 2022, pp. 1-6, doi: 10.1109/MysuruCon55714.2022.9972742.

Model Architecture

Downloads

Published

04.02.2023

How to Cite

Kandula, A. R. ., Sathya, R. ., & Narayana, S. . (2023). Multivariate Analysis on Personalized Cancer Data using a Hybrid Classification Model using Voting Classifier. International Journal of Intelligent Systems and Applications in Engineering, 11(1), 354–362. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/2546

Issue

Section

Research Article