An Approach to Predict Early Diabetes Mellitus with An Unsupervised Clustering Technique

Authors

  • Rita Ganguly Dr. B. C. Roy Engineering College, West Bengal – 713206, INDIA
  • Dharmpal Singh JIS University,West Bengal – 700109, INDIA

Keywords:

Clustering, Diabetes, K-Means Clustering, Factor Analysis, Disease

Abstract

Hyperglycemia which constitutes a considerable imminence to human health.  Diabetes may lead to an anomalous rise in glucose levels. Preliminary detection of diabetes reduces the risk of fatality and agony.  In our country around 30 million peoples are recognized with this fatal disease. It is tremendously complicated to develop a virtuous and precise diabetes forecasting. The ICMR with diabetic people have taken inventiveness and emerged with various solution but regrettably they endured like leftovers. Clustering is an important technique for the prediction of diabetes. In machine learning the clustering technique contingent on unsupervised learning and classification techniques contingent on supervised learning. In this research work, the factor analysis concept has been solicited to genesis of total effect on the PIMA Indian Diabetic Dataset and designate the prime factors that repercussion on it. K-Means algorithm conviction has been on the total effect data to acquire the cluster in superlative mode and for the quantification of distance the Euclidean distance function has been used. The numbers of clusters have been pronounced on the base of output of the dataset and it causes formation of knowledge based. To predict diabetics various machine learning accession have been solicited on cluster-based dataset. K-Means clustering algorithm used for early diabetic identification containing the data of 165 diabetic patients. The maximum precision, recall and F1-score1.00 obtained by K-Means and accuracy obtained by logistic regression 0.7662, decision tree 0.7269, SVM 0.7835 and random forest 0.7922 respectively. All anticipated outcomes are displayed in a comparison table and pointed out the aspect of research.

Downloads

Download data is not yet available.

References

Li, L., Tang, H., Wu, Z., Gong, J., Gruidl, M., Zou, J., Tockman, M., Clark, R.A. (2004). Data mining techniques for cancer detection using serum proteomic profiling. Artificial Intelligence in Medicine, 32(2): 71- 83. doi.org/10.1016/j.artmed.2004.03.006

Panzarasa, S. (2010). Data mining techniques for analyzing stroke care processes. Proceedings of the 13th World Congress on Medical Informatics, pp. 939-943. https://doi.org/10.3233/978-1-60750-588-4-939

Porter, T., Green, B. (2009). Identifying diabetic patients: a data mining approach. AMCIS 2009 Proceedings, 500.

Kuzuya, T., Nakagawa, S., Satoh, J., Kanazawa, Y., Iwamoto, Y., Kobayashi, M., Kashiwagi, A., Araki, E., Ito, C., Inagaki, N., Iwamoto, Y., Kasuga, M., Hanafusa, T., Haneda, M., Ueki, K., Committee of the Japan Diabetes Society on the Diagnostic Criteria of Diabetes Mellitus. (2002). Report of the Committee on the classification and diagnostic criteria of diabetes mellitus. Diabetes Research and Clinical Practice, 55(1): 65-85. https://doi.org/10.1016/s01688227(01)00365-5

Matheus, C.J., Piatetsky-Shapiro, G., McNeill, D. (1996). 20 selecting and reporting what is interesting: The kefir application to healthcare data. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1. 1.200.7107.

V.Chang, J.Bailey, Q.A.Xu. Z.Sun, Pima Indians diabetes Mellitus Classification Based on Machine Learning (ML) Algorithms, Neural Computer Applications,(2022) 1-17.

M.K.Hasan, M.A.Alam, D.Das, E.Hossain, M.Hasan, Diabetes Prediction Using Ensembling of Different Machine Learning Classifiers, IEEE Access 8(2020) 76516-76531.

J.J. Khanam , S.Y.Foo, A Comparison of Machine Learning Algorithms for Diabetes Prediction, ICT Express (2021).

M.A.Sarwar, N.Kamal, W.Hamid, M.A.Shah, Prediction of Diabetes Using Machine learning Algorithms in Healthcare in:2018 24th International Conference on Automation and Computing, ICAC,IEEE,2018,pp1-6.

L.Alturki, K.Aloraini, A.Aldughayshim, S.Albahli, Predictions of Readmissions and Length of Stay of Diabetes Related Patients, in: 2019 IEEE/ACS 16th International Conference on Computer Systems and Applicattions, AICCSA,IEEE,2019,pp.1-8

V. Anuja and R.Chitra., “Classification Of Diabetes Disease Using Support Vector Machine”, International Journal of Engineering Research and Applications (IJERA), vol.3,Issue 2, pp. 1797-1801, 2013.

Aiswarya I., S. Jeyalatha and Ronak S., “Diagnosis Of Diabetes Using Classification Mining Techniques”, International Journal of Data Mining & Knowledge Management Process (IJDKP), vol.5, ,No. 1, pp. 1-14, 2015.

K.Rajesh and V.Sangeetha,”Application of Data Mining Methods and Techniques for Diabetes Diagnosis,” in proceedings of International journal of Engineering and Innovative Technology, vol.2, Issue 3, pp. 43-46, 2012.

Harleen and Dr. Pankaj B.,”A Prediction Technique in Data Mining for Diabetes Mellitus,” Journal of Management Sciences and Technology, vol. 4, Issue 1, pp. 1-12, 2016.

Ravi S. and Smt T., ”Prognosis of Diabetes Using Data mining Approach-Fuzzy C Means Clustering and Support Vector Machine,” International Journal of Computer Trends and Technology (IJCTT), vol. 11, No. 2, pp. 94-98, 2014.

G. Krishnaveni*, T. Sudha,” A Novel Technique To Predict Diabetic Disease Using Data Mining Classification Techniques” in International Conference on Innovative Applications in Engineering and Information Technology (ICIAEIT2017), vol. 3, Issue 1, pp. 5-11, 2017.

Raj A., Vishnu P., and Kavita B.,”K-Fold Cross Validation and Classification Accuracy of PIMA Indian Diabetes Data Set Using Higher Order Neural Network and PCA”, International Journal of Soft Computing and Engineering (IJSCE), Volume-2, Issue-6, pp. 436-438, January 2013.

Vrushali B., and Rakhi W., “Review on Prediction of Diabetes using Data Mining Technique”, International Journal of Research and Scientific Innovation (IJRSI), Volume IV, Issue IA, pp. 43-46, January 2017.

Thirumal P., and Nagarajan N.,” Utilization of Data Mining Techniques for Diagnosis of Diabetes Mellitus - A Case Study”, ARPN Journal of Engineering and Applied Sciences, Vol. 10, No. 1, pp. 8-13, January 2015.

Huamaní, E. L. ., Leon-Ayala, R. ., Alva-Mantari, A. ., & Meneses-Claudio, B. . (2023). Prototype of a Mobile Application for the Detection of Car Accidents on the Roads of Peru . International Journal on Recent and Innovation Trends in Computing and Communication, 11(3), 37–42. https://doi.org/10.17762/ijritcc.v11i3.6198

Jang Bahadur Saini, D. . (2022). Pre-Processing Based Wavelets Neural Network for Removing Artifacts in EEG Data. Research Journal of Computer Systems and Engineering, 3(1), 43–47. Retrieved from https://technicaljournals.org/RJCSE/index.php/journal/article/view/40

Downloads

Published

16.07.2023

How to Cite

Ganguly, R. ., & Singh, D. (2023). An Approach to Predict Early Diabetes Mellitus with An Unsupervised Clustering Technique. International Journal of Intelligent Systems and Applications in Engineering, 11(3), 45–55. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3141

Issue

Section

Research Article