An Intelligent Harris Hawks Optimization (IHHO) based Pivotal Decision Tree (PDT) Machine Learning Model for Diabetes Prediction

Authors

  • Roma Fayaz Lecturer, Department of Computer Science, College of Computer Science and Information Technology, Jazan University, Jazan, Kingdom of Saudi Arabia
  • G.Vinoda Reddy Professor, Department of Computer Science and Engineering (AI&ML), CMR Technical Campus, Kandlakoya, Medchal (M), Hyderabad, Telangana-501401
  • M. Sujaritha Professor,Sri Krishna College of Engineering and Technology, Kuniyamuthur, Tamil Nadu 641008, India
  • N. Soundiraraj Assistant Professor Department of Electronics and Communication Engineering, PSNA College of Engineering and Technology, Dindigul, Tamil Nadu, 624622, India
  • W.Gracy Theresa Associate Professor, Department of Computer and Science Engineering, Panimalar Engineering College, Chennai, Tamil Nadu 600123, India
  • Dharmendra Kumar Roy Assistant Professor, Department of Computer Science and Engineering, Hyderabad Institute of Technology and Management (HITAM), Hyderabad, Telangana, 501401,India
  • J.Jeffin Gracewell Assistant Professor, Department of Electronics and Communication, Saveetha Engineering College, Chennai, Tamil Nadu 602105,India
  • S. Gopalakrishnan Associate Professor, Department of Electronics and Communication Engineering, Vel Tech Rangarajan Dr.Sagunthala R&D Institute of Science and Technology (Deemed To be University),Chennai-600062, Tamil Nadu, India

Keywords:

Diabetes Prediction, Machine Learning, Inherent Coefficient Normalization (ICN), Intelligent Harris Hawks Optimization (IHHO), Pivotal Decision Tree (PDT), PIMA Indian Dataset

Abstract

In ancient times, an accurate diabetes prediction and type of classification are the most important and demanding tasks in the medical field for providing proper diagnosis to the patients. For this purpose, various machine learning based detection systems are developed in the conventional works to predict the diabetes from the given dataset. Still, it has some limitations with the factors of difficult to understand, high time requirement for training and testing, over fitting, and error outputs. Therefore, the proposed research work objects to implement a group of data mining techniques for developing an automated and efficient diabetes detection system.  In this framework, an Inherent Coefficient Normalization (ICN) technique is implemented at first for preprocessing the PIMA Indian dataset obtained from the repository, which highly improves the quality of data for processing. Then, an Intelligent Harris Hawks Optimization (IHHO) technique is utilized to optimally select the features for training the classifier. Finally, the Pivotal Decision Tree (PDT) based classification technique is deployed to predict the data as whether diabetes or non-diabetes with reduced computational complexity and time consumption. During analysis, the performance and results of the proposed IHHO-PDT technique is validated and compared using various measures.

Downloads

Download data is not yet available.

References

A. Paleczek, D. Grochala, and A. Rydosz, "Artificial breath classification using XGBoost algorithm for diabetes detection," Sensors, vol. 21, p. 4187, 2021.

G. Kalyani, B. Janakiramaiah, A. Karuna, and L. Prasad, "Diabetic retinopathy detection and classification using capsule networks," Complex & Intelligent Systems, pp. 1-14, 2021.

S. Kumari, D. Kumar, and M. Mittal, "An ensemble approach for classification and prediction of diabetes mellitus using soft voting classifier," International Journal of Cognitive Computing in Engineering, vol. 2, pp. 40-46, 2021.

K. M. Aamir, L. Sarfraz, M. Ramzan, M. Bilal, J. Shafi, and M. Attique, "A Fuzzy Rule-Based System for Classification of Diabetes," Sensors, vol. 21, p. 8095, 2021.

D. Rahhal, R. Alhamouri, I. Albataineh, and R. Duwairi, "Detection and Classification of Diabetic Retinopathy Using Artificial Intelligence Algorithms," in 2022 13th International Conference on Information and Communication Systems (ICICS), 2022, pp. 15-21.

A. Sharma, K. Guleria, and N. Goyal, "Prediction of diabetes disease using machine learning model," in International Conference on Communication, Computing and Electronics Systems, 2021, pp. 683-692.

S. Shafi and G. A. Ansari, "Early prediction of diabetes disease & classification of algorithms using machine learning approach," in Proceedings of the International Conference on Smart Data Intelligence (ICSMDI 2021), 2021.

Kumari, S. S. ., and K. S. . Rani. “Big Data Classification of Ultrasound Doppler Scan Images Using a Decision Tree Classifier Based on Maximally Stable Region Feature Points”. International Journal on Recent and Innovation Trends in Computing and Communication, vol. 10, no. 8, Aug. 2022, pp. 76-87, doi:10.17762/ijritcc.v10i8.5679.

D. K. Choubey, S. Tripathi, P. Kumar, V. Shukla, and V. K. Dhandhania, "Classification of Diabetes by Kernel based SVM with PSO," Recent Advances in Computer Science and Communications (Formerly: Recent Patents on Computer Science), vol. 14, pp. 1242-1255, 2021.

R. Saxena, S. K. Sharma, M. Gupta, and G. Sampada, "A Novel Approach for Feature Selection and Classification of Diabetes Mellitus: Machine Learning Methods," Computational Intelligence and Neuroscience, vol. 2022, 2022.

P. Tiwari and V. Singh, "Diabetes disease prediction using significant attribute selection and classification approach," in Journal of Physics: Conference Series, 2021, p. 012013.

K. Thaiyalnayaki, "Classification of diabetes using deep learning and svm techniques," International Journal of Current Research and Review, vol. 13, p. 146, 2021.

H. N. K. Al-Behadili and K. R. Ku-Mahamud, "Fuzzy unordered rule using greedy hill climbing feature selection method: An application to diabetes classification," Journal of Information and Communication Technology, vol. 20, pp. 391-422, 2021.

A. Bilal, G. Sun, Y. Li, S. Mazhar, and A. Q. Khan, "Diabetic retinopathy detection and classification using mixed models for a disease grading database," IEEE Access, vol. 9, pp. 23544-23553, 2021.

A. Prabha, J. Yadav, A. Rani, and V. Singh, "Design of intelligent diabetes mellitus detection system using hybrid feature selection based XGBoost classifier," Computers in Biology and Medicine, vol. 136, p. 104664, 2021.

X. Wang, M. Zhai, Z. Ren, H. Ren, M. Li, D. Quan, et al., "Exploratory study on classification of diabetes mellitus through a combined Random Forest Classifier," BMC medical informatics and decision making, vol. 21, pp. 1-14, 2021.

J. B. Awotunde, F. E. Ayo, R. G. Jimoh, R. O. Ogundokun, O. E. Matiluko, I. D. Oladipo, et al., "Prediction and classification of diabetes mellitus using genomic data," in Intelligent IoT systems in personalized health care, ed: Elsevier, 2021, pp. 235-292.

S. Singh, B. Rathore, H. Das, S. Agrawal, D. Bhutia, V. Maan, et al., "An Improved Convolutional Neural Network for Classification of Type-2 Diabetes Mellitus," in Proceedings of the Third International Conference on Information Management and Machine Intelligence, 2023, pp. 417-424.

J. Chaki, S. T. Ganesh, S. Cidham, and S. A. Theertan, "Machine learning and artificial intelligence based diabetes mellitus detection and self-management: a systematic review," Journal of King Saud University-Computer and Information Sciences, 2020.

R. Kamalraj, S. Neelakandan, M. R. Kumar, V. C. S. Rao, R. Anand, and H. Singh, "Interpretable filter based convolutional neural network (IF-CNN) for glucose prediction and classification using PD-SS algorithm," Measurement, vol. 183, p. 109804, 2021.

T. Chauhan, S. Rawat, S. Malik, and P. Singh, "Supervised and unsupervised machine learning based review on diabetes care," in 2021 7th International Conference on Advanced Computing and Communication Systems (ICACCS), 2021, pp. 581-585.

Patil, V. N., & Ingle, D. R. (2022). A Novel Approach for ABO Blood Group Prediction using Fingerprint through Optimized Convolutional Neural Network. International Journal of Intelligent Systems and Applications in Engineering, 10(1), 60–68. https://doi.org/10.18201/ijisae.2022.268

S. S. Reddy, N. Sethi, R. Rajender, and V. Vetukuri, "Non-invasive Diagnosis of Diabetes Using Chaotic Features and Genetic Learning," in International Conference on Image Processing and Capsule Networks, 2022, pp. 161-170.

H. Thakkar, V. Shah, H. Yagnik, and M. Shah, "Comparative anatomization of data mining and fuzzy logic techniques used in diabetes prognosis," Clinical eHealth, vol. 4, pp. 12-23, 2021.

N. A. Libre. (2021). A Discussion Platform for Enhancing Students Interaction in the Online Education. Journal of Online Engineering Education, 12(2), 07–12. Retrieved from http://onlineengineeringeducation.com/index.php/joee/article/view/49

A. Mujumdar and V. Vaidehi, "Diabetes prediction using machine learning algorithms," Procedia Computer Science, vol. 165, pp. 292-299, 2019.

T. N. Joshi and P. Chawan, "Diabetes prediction using machine learning techniques," Ijera, vol. 8, pp. 9-13, 2018.

M. Alehegn, R. Joshi, and P. Mulay, "Analysis and prediction of diabetes mellitus using machine learning algorithm," International Journal of Pure and Applied Mathematics, vol. 118, pp. 871-878, 2018.

Agarwal, D. A. . (2022). Advancing Privacy and Security of Internet of Things to Find Integrated Solutions. International Journal on Future Revolution in Computer Science &Amp; Communication Engineering, 8(2), 05–08. https://doi.org/10.17762/ijfrcsce.v8i2.2067

H. R. Abdulqadir, A. M. Abdulazeez, and D. A. Zebari, "Data mining classification techniques for diabetes prediction," Qubahan Academic Journal, vol. 1, pp. 125-133, 2021.

G. Battineni, G. G. Sagaro, C. Nalini, F. Amenta, and S. K. Tayebati, "Comparative machine-learning approach: a follow-up study on type 2 diabetes predictions by cross-validation methods," Machines, vol. 7, p. 74, 2019.

K. Arumugam, M. Naved, P. P. Shinde, O. Leiva-Chauca, A. Huaman-Osorio, and T. Gonzales-Yanac, "Multiple disease prediction using Machine learning algorithms," Materials Today: Proceedings, 2021.

Q. Zou, K. Qu, Y. Luo, D. Yin, Y. Ju, and H. Tang, "Predicting diabetes mellitus with machine learning techniques," Frontiers in genetics, vol. 9, p. 515, 2018.

M. T. García-Ordás, C. Benavides, J. A. Benítez-Andrades, H. Alaiz-Moretón, and I. García-Rodríguez, "Diabetes detection using deep learning techniques with oversampling and feature augmentation," Computer Methods and Programs in Biomedicine, vol. 202, p. 105968, 2021.

J. Ramesh, R. Aburukba, and A. Sagahyroon, "A remote healthcare monitoring framework for diabetes prediction using machine learning," Healthcare Technology Letters, vol. 8, pp. 45-57, 2021.

J. J. Boutilier, T. C. Chan, M. Ranjan, and S. Deo, "Risk stratification for early detection of diabetes and hypertension in resource-limited settings: machine learning analysis," Journal of medical Internet research, vol. 23, p. e20123, 2021.

A. A. Abaker and F. A. Saeed, "A comparative analysis of machine learning algorithms to build a predictive model for detecting diabetes complications," Informatica, vol. 45, 2021.

M. H. Mahmoud, S. Alamery, H. Fouad, A. Altinawi, and A. E. Youssef, "An automatic detection system of diabetic retinopathy using a hybrid inductive machine learning algorithm," Personal and Ubiquitous Computing, pp. 1-15, 2021.

H. Naz and S. Ahuja, "SMOTE-SMO-based expert system for type II diabetes detection using PIMA dataset," International Journal of Diabetes in Developing Countries, pp. 1-9, 2021.

Types of data mining techniques used for diabetes prediction

Downloads

Published

16.12.2022

How to Cite

Fayaz, R. ., Reddy, G. ., Sujaritha, M., Soundiraraj, N., Theresa, W. ., Roy, D. K. ., Gracewell, J. ., & Gopalakrishnan, S. (2022). An Intelligent Harris Hawks Optimization (IHHO) based Pivotal Decision Tree (PDT) Machine Learning Model for Diabetes Prediction. International Journal of Intelligent Systems and Applications in Engineering, 10(4), 415–423. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/2277

Issue

Section

Research Article

Most read articles by the same author(s)