Binary Grey Wolf Optimizer in Diabetes Prediction

Authors

  • Andi Nugroho, Harco Leslie Hendric Spits Warnars, Sani Muhamad Isa, Widodo Budiharto

Keywords:

BGWO, Classification, Diabetes, KNN, SVM

Abstract

Diabetes is a disease that is feared today because it has claimed quite a lot of victims. Thus knowing diabetes early can prevent death from diabetes. However, for diabetes detection classification tools, there is still no one that uses z-score normalization in the classification of the PIMA diabetes dataset, The purpose of this research is to be able to find a classification model for predicting diabetes from an early age. we will take a public dataset from the Pima Indians Diabetes Database (PIDD). The method that will be used in this study uses an optimization approach with the Improve Binary Grey Wolf Optimizer (BGWO) and Z-Score normalization, for testing it will use classification algorithms such as Support Vector Machine (SVM), Decision Tree (DT), k-Nearest Neighbors (KNN). The BGWO optimization algorithm and Z-Score normalization is used to optimize the performance of the SVM, DT, and KNN classification algorithm in producing the best accuracy value. The BGWO optimization algorithm and Z-Score normalization can increase the accuracy value from KNN = 76.5%, DT = 74.7%, and SVM = 77.7% to KNN + BGWO = 78%, DT + BGWO = 76.2% and SVM + BGWO = 79.8% in PIDD dataset.

Downloads

Download data is not yet available.

References

N. Fazakis, O. Kocsis, E. Dritsas, S. Alexiou, N. Fakotakis, and K. Moustakas, “Machine Learning Tools for Long-Term Type 2 Diabetes Risk Prediction,” IEEE Access, vol. 9, pp. 103737–103757, 2021.

R. Rajni and A. Amandeep, “RB-Bayes algorithm for the prediction of diabetic in Pima Indian dataset,” Int. J. Electr. Comput. Eng., vol. 9, no. 6, p. 4866, Dec. 2019.

D. Sisodia and D. S. Sisodia, “Prediction of diabetes using classification algorithms,” Procedia Comput. Sci., 2018.

J. J. Khanam and S. Y. Foo, “A comparison of machine learning algorithms for diabetes prediction,” ICT Express, vol. 7, no. 4, pp. 432–439, 2021.

@data-society, “Pima Indians Diabetes Database,” 2016. [Online]. Available: https://data.world/data-society/pima-indians-diabetes-database/workspace/file?filename=diabetes.csv. [Accessed: 22-Nov-2022].

M. H. Nadimi-Shahraki, S. Taghian, and S. Mirjalili, “An improved grey wolf optimizer for solving engineering problems,” Expert Syst. Appl., vol. 166, p. 113917, 2021.

M. Shariati et al., “A novel hybrid extreme learning machine–grey wolf optimizer (ELM-GWO) model to predict compressive strength of concrete with partial replacements for cement,” Eng. Comput., vol. 38, no. 1, pp. 757–779, 2022.

R. Liu and C. S. Pun, “Machine-Learning-enhanced systemic risk measure: A Two-Step supervised learning approach,” J. Bank. Financ., vol. 136, 2022.

N. Abellán, E. Baquedano, and M. Domínguez-rodrigo, “marks using machine learning methods and computer vision algorithms ,” Geobios, 2022.

M. Abdel-Basset, K. M. Sallam, R. Mohamed, I. Elgendi, K. Munasinghe, and O. M. Elkomy, “An Improved Binary Grey-Wolf Optimizer with Simulated Annealing for Feature Selection,” IEEE Access, vol. 9, pp. 139792–139822, 2021.

A. M. Helmi, M. A. A. Al-Qaness, A. Dahou, R. Damaševičius, T. Krilavičius, and M. A. Elaziz, “A novel hybrid gradient-based optimizer and grey wolf optimizer feature selection method for human activity recognition using smartphone sensors,” Entropy, vol. 23, no. 8, 2021.

M. Roayaei, “On the binarization of Grey Wolf optimizer: a novel binary optimizer algorithm,” Soft Comput., vol. 25, no. 23, pp. 14715–14728, 2021.

Q. Al-Tashi, S. J. Abdul Kadir, H. M. Rais, S. Mirjalili, and H. Alhussian, “Binary Optimization Using Hybrid Grey Wolf Optimization for Feature Selection,” IEEE Access, vol. 7, pp. 39496–39508, 2019.

T. Thaher, M. Awad, M. Aldasht, A. Sheta, H. Turabieh, and H. Chantar, An Enhanced Evolutionary Based Feature Selection Approach Using Grey Wolf Optimizer for the Classification of High-dimensional Biological Data, vol. 28, no. 5. 2022.

X. Sun, Z. Jin, Y. Cai, Z. Yang, and L. Chen, “Grey Wolf Optimization Algorithm Based State Feedback Control for a Bearingless Permanent Magnet Synchronous Machine,” IEEE Trans. Power Electron., vol. 35, no. 12, pp. 13631–13640, 2020.

E. Momanyi and D. Segera, “A Master-Slave Binary Grey Wolf Optimizer for Optimal Feature Selection in Biomedical Data Classification,” Biomed Res. Int., vol. 2021, 2021.

Q. Al-Tashi et al., “Binary Multi-Objective Grey Wolf Optimizer for Feature Selection in Classification,” IEEE Access, vol. 8, pp. 106247–106263, 2020.

D. Segera, M. Mbuthia, and A. Nyete, “An excited binary grey Wolf optimizer for feature selection in highly dimensional datasets,” ICINCO 2020 - Proc. 17th Int. Conf. Informatics Control. Autom. Robot., no. Icinco, pp. 125–133, 2020.

X. Li and K. M. Luk, “The Grey Wolf Optimizer and Its Applications in Electromagnetics,” IEEE Trans. Antennas Propag., vol. 68, no. 3, pp. 2186–2197, 2020.

J. Hu et al., “Orthogonal learning covariance matrix for defects of grey wolf optimizer: Insights, balance, diversity, and feature selection,” Knowledge-Based Syst., vol. 213, p. 106684, 2021.

R. Al-Wajih, S. J. Abdulkadir, N. Aziz, Q. Al-Tashi, and N. Talpur, “Hybrid binary grey Wolf with Harris hawks optimizer for feature selection,” IEEE Access, vol. 9, pp. 31662–31677, 2021.

P. Hu, J. S. Pan, and S. C. Chu, “Improved Binary Grey Wolf Optimizer and Its application for feature selection,” Knowledge-Based Syst., vol. 195, p. 105746, 2020.

P. Vijayaragavan, R. Ponnusamy, and M. Aramudhan, “An optimal support vector machine based classification model for sentimental analysis of online product reviews,” Futur. Gener. Comput. Syst., vol. 111, pp. 234–240, 2020.

J. Cervantes, F. Garcia-Lamont, L. Rodríguez-Mazahua, and A. Lopez, “A comprehensive survey on support vector machine classification: Applications, challenges and trends,” Neurocomputing, no. xxxx, 2020.

C. Yuan, L. Yang, and P. Sun, “Correntropy-based metric for robust twin support vector machine,” Inf. Sci. (Ny)., vol. 545, pp. 82–101, 2021.

C. Jimenez-Castaño, A. Alvarez-Meza, and A. Orozco-Gutierrez, “Enhanced automatic twin support vector machine for imbalanced data classification,” Pattern Recognit., vol. 107, 2020.

Z. Che, B. Liu, Y. Xiao, and H. Cai, “Twin support vector machines with privileged information,” Inf. Sci. (Ny)., vol. 573, pp. 141–153, 2021.

R. B. Lukmanto, Suharjito, A. Nugroho, and H. Akbar, “Early detection of diabetes mellitus using feature selection and fuzzy support vector machine,” Procedia Comput. Sci., vol. 157, pp. 46–54, 2019.

I. G. Lee, Q. Zhang, S. W. Yoon, and D. Won, “A mixed integer linear programming support vector machine for cost-effective feature selection,” Knowledge-Based Syst., vol. 203, p. 106145, 2020.

A. Gejji, S. Shukla, S. Pimparkar, T. Pattharwala, and A. Bewoor, “Using a support vector machine for building a quality prediction model for center-less honing process,” Procedia Manuf., vol. 46, no. 2019, pp. 600–607, 2020.

C. Mallika and S. Selvamuthukumaran, “A Hybrid Crow Search and Grey Wolf Optimization Technique for Enhanced Medical Data Classification in Diabetes Diagnosis System,” Int. J. Comput. Intell. Syst., vol. 14, no. 1, 2021.

I. Stancic, L. Veic, J. Music, and T. Grujic, “Classification of Low-Resolution Flying Objects in Videos Using the Machine Learning Approach,” Adv. Electr. Comput. Eng., vol. 22, no. 2, pp. 45–52, 2022.

A. Plaia, S. Buscemi, J. Fürnkranz, and E. L. Mencía, “Comparing Boosting and Bagging for Decision Trees of Rankings,” J. Classif., vol. 39, no. 1, pp. 78–99, 2022.

L. Vanfretti and V. S. N. Arava, “Decision tree-based classification of multiple operating conditions for power system voltage stability assessment,” Int. J. Electr. Power Energy Syst., vol. 123, no. October 2019, p. 106251, 2020.

B. Charbuty and A. Abdulazeez, “Classification based on decision tree algorithm for machine learning,” Journal of Applied Science and Technology …. jastt.org, 2021.

N. Maleki, Y. Zeinali, and S. T. A. Niaki, “A k-NN method for lung cancer prognosis with the use of a genetic algorithm for feature selection,” Expert Syst. Appl., vol. 164, no. July 2019, p. 113981, 2021.

S. Li, K. Zhang, Q. Chen, S. Wang, and S. Zhang, “Feature Selection for High Dimensional Data Using Weighted K-Nearest Neighbors and Genetic Algorithm,” IEEE Access, vol. 8, pp. 139512–139528, 2020.

S. Zhang, “Challenges in KNN Classification,” IEEE Trans. Knowl. Data Eng., pp. 1–13, 2021.

S. Li, K. Zhang, Y. Li, S. Wang, and S. Zhang, “Online streaming feature selection based on neighborhood rough set,” Appl. Soft Comput., vol. 113, p. 108025, 2021.

V. Aggarwal, V. Gupta, P. Singh, K. Sharma, and N. Sharma, “Detection of spatial outlier by using improved Z-score test,” Proc. Int. Conf. Trends Electron. Informatics, ICOEI 2019, vol. 2019-April, no. Icoei, pp. 788–790, 2019.

R. Patra and B. khuntia, “Analysis and Prediction Of Pima Indian Diabetes Dataset Using SDKNN Classifier Technique,” IOP Conf. Ser. Mater. Sci. Eng., vol. 1070, no. 1, p. 012059, 2021.

Downloads

Published

24.03.2024

How to Cite

Andi Nugroho. (2024). Binary Grey Wolf Optimizer in Diabetes Prediction. International Journal of Intelligent Systems and Applications in Engineering, 12(3), 3659–3664. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/6026

Issue

Section

Research Article