NGBFA Feature Selection Algorithm-based Hybrid Ensemble Classifier to Predict Cervical Cancer

Authors

  • CH Bhavani, Ch. Sarada, A. Jyothi Babu, Gurrampally Kumar, Moligi Sangeetha, Patlegar Vijay Kumar

Keywords:

feature selection, Machine learning, cervical cancer, Hybrid Ensemble, classification

Abstract

Early diagnosis may cure cervical cancer. Researchers have struggled to prediction the disease's course because there are no early indications. Several machine learning methods have predicted CC in the past decade. Ensemble techniques generate and integrate several models for more accurate results. This contrasts with single-classifier prediction. During this research, we established "Robust Model Stacking: A Hybrid Ensemble." This classifier runs a homogeneous classifier-based classifications at the base level, then a heterogeneous ensemble that predicts additional data using majority voting (soft). This study included 858 patients, 32 risk indicator characteristics, and four CC diagnosis test targets. SMOTE oversampling solved the data imbalance problem. For each of the dataset's four goal variables, accuracy, recall, f1-score, precision, and AUC-ROC were used to assess the model. The proposed biopsy approach is 98% accurate, Hinselmann 97%, Schiller 96.09%, and Citology 93%. Ensemble learning improves prediction accuracy and reduces bias and variation in this study.

Downloads

Download data is not yet available.

References

Organization CDC, Centers for Disease Control and Prevention retrieved from https://www.cdc.gov/cancer/cervical/basic_info/index.htm#:~:text=When%20cancer%20starts%20in%20 the,at%20risk%20for%20cervical%20cancer. Last accessed on 06/04/2022.

W. H. O. R. Health, W. H. O. C. Diseases, and H. Promotion, Comprehensive Cervical Cancer Control: A Guide to Essential Practice, World Health Organization, Geneva, Switzerland, 2006.

Organization W H 2022 retrieved from https://www.who.int/health-topics/cervicalcancer#tab=tab_1 last accessed 06-04-2022.

Organization W H 2021 retrieved from https://www.who.int/news-room/fact-sheets/detail/human-papillomavirus-(hpv)-and-cervical-cancer

Bosch FX, de Sanjosé S. Chapter 1: Human papillomavirus and cervical cancer--burden and assessment of causality. J Natl Cancer Inst Monogr. 2003;(31):3-13. doi: 10.1093/oxfordjournals.jncimonographs.a003479. PMID: 12807939.

[6] Ch. Bhavani, Dr. A. Govardhan. Supervised Algorithms of Machine Learning in the Prediction of Cervical Cancer: A Comparative Analysis. Annals of the Romanian Society for Cell Biology, 1380–1393. 2021, Retrieved from https:// www.annalsofrscb.ro/index.php/journal/article/view/4580

Organization ASCO retrieved from https://www.cancer.net/cancer-types/cervical-cancer/symptoms-and-signs last accessed 06/04/2022.

Jaswinder Singh & Sandeep Shrama. (2019). Prediction of Cervical Cancer Using Machine Learning Techniques. International Journal of Applied Engineering Research ISSN 0973-4562 Volume 14, Number 11 (2019) pp. 2570-2577.

Akter, L., Ferdib-Al-Islam, Islam, M.M. et al. Prediction of Cervical Cancer from Behavior Risk Using Machine Learning Techniques. SN COMPUT. SCI. 2, 177 (2021). https://doi.org/10.1007/s42979-021-00551-6.

Mohamed Hosni, Ibtissam Abnane, Ali Idri, Juan M. Carrillo de Gea, José Luis Fernández Alemán,Reviewing ensemble classification methods in breast cancer, Computer Methods and Programs in Biomedicine, Volume 177, 2019, Pages 89-112, ISSN 0169-2607, https://doi.org/10.1016/j.cmpb.2019.05.019. (https://www.sciencedirect.com/science/article/pii/S0169260719301907)

A. Idri, M. Hosni, A. Abran, Systematic Mapping Study of Ensemble Effort Estimation, in: Proc. 11th Int. Conf. Eval. Nov. Softw. Approaches to Softw. Eng., 2016: pp. 132–139. doi:10.5220/0005822701320139.

R.E. Schapire, Measures of Diversity in Classifier Ensembles, Mach. Learn. 51 (2003) 181–207. doi:10.1049/ic:20010105.

Pintelas P, Livieris IE. Special Issue on Ensemble Learning and Applications. Algorithms. 2020; 13(6):140. https://doi.org/10.3390/a13060140.

Karlos, S.; Kostopoulos, G.; Kotsiantis, S. A Soft-Voting Ensemble Based Co-Training Scheme Using Static Selection for Binary Classification Problems. Algorithms 2020, 13, 26.

Sabzevari, M., Martínez-Muñoz, G. & Suárez, A. Building heterogeneous ensembles by pooling homogeneous ensembles. Int. J. Mach. Learn. & Cyber. 13, 551–558 (2022). https://doi.org/10.1007/s13042-021-01442-1.

J. L. Fernández-Alemán, J. M. Carrillo-de-Gea, M. Hosni, A. Idri and G. García-Mateos, "Homogeneous and heterogeneous ensemble classification methods in diabetes disease: a review," 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC), 2019, pp. 3956-3959, doi: 10.1109/EMBC.2019.8856341.

Esra’a Alshdaifat, Malak Al-hassan, Ahmad Aloqaily, Effective heterogeneous ensemble classification: An alternative approach for selecting base classifiers, ICT Express, Volume 7, Issue 3, 2021, Pages 342-349, ISSN 2405-9595, https://doi.org/10.1016/j.icte.2020.11.005. (https://www.sciencedirect.com/science/article/pii/S2405959520304872).

de Oliveira JM, dos Santos EM, Carvalho JRH, de Vasconcelos Marques LA (2013) Ensemble of heterogeneous classifiers applied to lithofacies classification using logs from different wells. International joint conference on neural networks. IJCNN, Dallas, TX, USA, pp 1–6.

Nanni L, Brahnam S, Ghidoni S, Lumini A (2015) Toward a general-purpose heterogeneous ensemble for pattern classifcation. Comput Intell Neurosci 2015:85.

Haque MN, Noman N, Berretta R, Moscato P (2016b) Heterogeneous ensemble combination search using genetic algorithm for class imbalanced data classifcation. PloS One 11(1):e0146116.

Alshdaifat E, Al-hassan M, Aloqaily A (2020) Effective heterogeneous ensemble classification: an alternative approach for selecting base classifiers. ICT Express, South Korea.

Luong, Anh Vu, Vu, Trung Hieu, Nguyen, Phuong Minh, Van Pham, Nang, McCall, John, Liew, Alan Wee-Chung, Nguyen, Tien Thanh, "A homogeneous-heterogeneous ensemble of classifiers.", , 2020.

Nanglia, S., Ahmad, M., Khan, F. A., & Jhanjhi, N. Z. (2022). An enhanced Predictive heterogeneous ensemble model for breast cancer prediction. Biomedical Signal Processing and Control, 72, 103279.

Feng Y, Wang X, Zhang J. A heterogeneous ensemble learning method for neuroblastoma survival prediction. IEEE J Biomed Health Inform. 2021 Apr 13;PP. doi: 10.1109/JBHI.2021.3073056. Epub ahead of print. PMID: 33848254.

Fernandes, K., Chicco, D. and Cardoso, J.S. (2018) Supervised deep learning embeddings for the prediction of cervical cancer diagnosis. PeerJ, 4, 1–20.

Mehmood M, Rizwan M, Gregus ml M and Abbas S (2021) Machine Learning Assisted Cervical Cancer Detection. Front. Public Health 9:788376. doi: 10.3389/fpubh.2021.788376.

Pinar Yildirim. 2018. Classification with incomplete data and ensemble learners for the prediction of cervical cancer risk. In Proceedings of the International Conference on Intelligent Science and Technology (ICIST '18). Association for Computing Machinery, New York, NY, USA, 1–5. DOI:https://doi.org/10.1145/3233740.3233741.

Barua, Lipika, M. Sharif, and Tania Akter. "Analyzing Cervical Cancer by using an Ensemble Learning Approach based on Meta Classifier." International Journal of Computer Applications 182 (2019): 29-33.

Gupta, S., & Gupta, M. K. (2021). Computational prediction of cervical cancer diagnosis using ensemble-based classification algorithm. The Computer Journal.

CH. Bhavani, A. Govardhan, Cervical cancer prediction using stacked ensemble algorithm with SMOTE and RFERF, Materials Today: Proceedings, Volume 80, Part 3, 2023, Pages 3451-3457, ISSN 2214-7853, https://doi.org/10.1016/j.matpr.2021.07.269. (https://www.sciencedirect.com/science/article/pii/S2214785321051282)

PALETI, L., NAGASRI, A., SUNITHA, P., SANDYA, V., SUMALLIKA, T., KANDUKURI, P., & KUMAR, K. K. (2023). SUGAR CANE LEAF DISEASE CLASSIFICATION AND IDENTIFICATION USING DEEP MACHINE LEARNING ALGORITHMS. Journal of Theoretical and Applied Information Technology, 101(20).

Madhavi, P., Supreethi, K.P. (2023). Multi-dimensional STAQR Indexing Algorithm for Drone Applications. In: Morusupalli, R., Dandibhotla, T.S., Atluri, V.V., Windridge, D., Lingras, P., Komati, V.R. (eds) Multi-disciplinary Trends in Artificial Intelligence. MIWAI 2023. Lecture Notes in Computer Science(), vol 14078. Springer, Cham. https://doi.org/10.1007/978-3-031-36402-0_57, Pages 611-619.

Madhavi, P., and S.K. P. (2023) A Metaphorical Analysis of Different Encoding Techniques for Spatial-Temporal Data. International Journal of Intelligent Systems and Applications in Engineering, vol. 11, no. 11s, pp. 302-308, https://ijisae.org/index.php/IJISAE/article/view/3473.

CH Bhavani, A. Govardhan, A Novel Genetic Inspired Binary Firefly Algorithm For Feature Selection in the Prediction of Cervical Cancer, Advances in Data Science and Adaptive Analysis, doi: 10.1142/S2424922X23500031.

Peng L, Zhang H, Zhang H, Yang B (2017) A fast feature weighting algorithm of data gravitation classification. Inf Sci 375:54–78. https://doi.org/10.1016/j.ins.2016.09.044

Hsu W-Y (2018) A decision-making mechanism for assessing risk factor significance in cardiovascular diseases. Decis Support Syst 115:64–77. https://doi.org/10.1016/j.dss.2018.09.004.

Downloads

Published

26.06.2024

How to Cite

CH Bhavani. (2024). NGBFA Feature Selection Algorithm-based Hybrid Ensemble Classifier to Predict Cervical Cancer. International Journal of Intelligent Systems and Applications in Engineering, 12(4), 960–967. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/6318

Issue

Section

Research Article