An Approach to Predict Early Diabetes Mellitus with An Unsupervised Clustering Technique
Keywords:
Clustering, Diabetes, K-Means Clustering, Factor Analysis, DiseaseAbstract
Hyperglycemia which constitutes a considerable imminence to human health. Diabetes may lead to an anomalous rise in glucose levels. Preliminary detection of diabetes reduces the risk of fatality and agony. In our country around 30 million peoples are recognized with this fatal disease. It is tremendously complicated to develop a virtuous and precise diabetes forecasting. The ICMR with diabetic people have taken inventiveness and emerged with various solution but regrettably they endured like leftovers. Clustering is an important technique for the prediction of diabetes. In machine learning the clustering technique contingent on unsupervised learning and classification techniques contingent on supervised learning. In this research work, the factor analysis concept has been solicited to genesis of total effect on the PIMA Indian Diabetic Dataset and designate the prime factors that repercussion on it. K-Means algorithm conviction has been on the total effect data to acquire the cluster in superlative mode and for the quantification of distance the Euclidean distance function has been used. The numbers of clusters have been pronounced on the base of output of the dataset and it causes formation of knowledge based. To predict diabetics various machine learning accession have been solicited on cluster-based dataset. K-Means clustering algorithm used for early diabetic identification containing the data of 165 diabetic patients. The maximum precision, recall and F1-score1.00 obtained by K-Means and accuracy obtained by logistic regression 0.7662, decision tree 0.7269, SVM 0.7835 and random forest 0.7922 respectively. All anticipated outcomes are displayed in a comparison table and pointed out the aspect of research.
Downloads
References
Li, L., Tang, H., Wu, Z., Gong, J., Gruidl, M., Zou, J., Tockman, M., Clark, R.A. (2004). Data mining techniques for cancer detection using serum proteomic profiling. Artificial Intelligence in Medicine, 32(2): 71- 83. doi.org/10.1016/j.artmed.2004.03.006
Panzarasa, S. (2010). Data mining techniques for analyzing stroke care processes. Proceedings of the 13th World Congress on Medical Informatics, pp. 939-943. https://doi.org/10.3233/978-1-60750-588-4-939
Porter, T., Green, B. (2009). Identifying diabetic patients: a data mining approach. AMCIS 2009 Proceedings, 500.
Kuzuya, T., Nakagawa, S., Satoh, J., Kanazawa, Y., Iwamoto, Y., Kobayashi, M., Kashiwagi, A., Araki, E., Ito, C., Inagaki, N., Iwamoto, Y., Kasuga, M., Hanafusa, T., Haneda, M., Ueki, K., Committee of the Japan Diabetes Society on the Diagnostic Criteria of Diabetes Mellitus. (2002). Report of the Committee on the classification and diagnostic criteria of diabetes mellitus. Diabetes Research and Clinical Practice, 55(1): 65-85. https://doi.org/10.1016/s01688227(01)00365-5
Matheus, C.J., Piatetsky-Shapiro, G., McNeill, D. (1996). 20 selecting and reporting what is interesting: The kefir application to healthcare data. http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1. 1.200.7107.
V.Chang, J.Bailey, Q.A.Xu. Z.Sun, Pima Indians diabetes Mellitus Classification Based on Machine Learning (ML) Algorithms, Neural Computer Applications,(2022) 1-17.
M.K.Hasan, M.A.Alam, D.Das, E.Hossain, M.Hasan, Diabetes Prediction Using Ensembling of Different Machine Learning Classifiers, IEEE Access 8(2020) 76516-76531.
J.J. Khanam , S.Y.Foo, A Comparison of Machine Learning Algorithms for Diabetes Prediction, ICT Express (2021).
M.A.Sarwar, N.Kamal, W.Hamid, M.A.Shah, Prediction of Diabetes Using Machine learning Algorithms in Healthcare in:2018 24th International Conference on Automation and Computing, ICAC,IEEE,2018,pp1-6.
L.Alturki, K.Aloraini, A.Aldughayshim, S.Albahli, Predictions of Readmissions and Length of Stay of Diabetes Related Patients, in: 2019 IEEE/ACS 16th International Conference on Computer Systems and Applicattions, AICCSA,IEEE,2019,pp.1-8
V. Anuja and R.Chitra., “Classification Of Diabetes Disease Using Support Vector Machine”, International Journal of Engineering Research and Applications (IJERA), vol.3,Issue 2, pp. 1797-1801, 2013.
Aiswarya I., S. Jeyalatha and Ronak S., “Diagnosis Of Diabetes Using Classification Mining Techniques”, International Journal of Data Mining & Knowledge Management Process (IJDKP), vol.5, ,No. 1, pp. 1-14, 2015.
K.Rajesh and V.Sangeetha,”Application of Data Mining Methods and Techniques for Diabetes Diagnosis,” in proceedings of International journal of Engineering and Innovative Technology, vol.2, Issue 3, pp. 43-46, 2012.
Harleen and Dr. Pankaj B.,”A Prediction Technique in Data Mining for Diabetes Mellitus,” Journal of Management Sciences and Technology, vol. 4, Issue 1, pp. 1-12, 2016.
Ravi S. and Smt T., ”Prognosis of Diabetes Using Data mining Approach-Fuzzy C Means Clustering and Support Vector Machine,” International Journal of Computer Trends and Technology (IJCTT), vol. 11, No. 2, pp. 94-98, 2014.
G. Krishnaveni*, T. Sudha,” A Novel Technique To Predict Diabetic Disease Using Data Mining Classification Techniques” in International Conference on Innovative Applications in Engineering and Information Technology (ICIAEIT2017), vol. 3, Issue 1, pp. 5-11, 2017.
Raj A., Vishnu P., and Kavita B.,”K-Fold Cross Validation and Classification Accuracy of PIMA Indian Diabetes Data Set Using Higher Order Neural Network and PCA”, International Journal of Soft Computing and Engineering (IJSCE), Volume-2, Issue-6, pp. 436-438, January 2013.
Vrushali B., and Rakhi W., “Review on Prediction of Diabetes using Data Mining Technique”, International Journal of Research and Scientific Innovation (IJRSI), Volume IV, Issue IA, pp. 43-46, January 2017.
Thirumal P., and Nagarajan N.,” Utilization of Data Mining Techniques for Diagnosis of Diabetes Mellitus - A Case Study”, ARPN Journal of Engineering and Applied Sciences, Vol. 10, No. 1, pp. 8-13, January 2015.
Huamaní, E. L. ., Leon-Ayala, R. ., Alva-Mantari, A. ., & Meneses-Claudio, B. . (2023). Prototype of a Mobile Application for the Detection of Car Accidents on the Roads of Peru . International Journal on Recent and Innovation Trends in Computing and Communication, 11(3), 37–42. https://doi.org/10.17762/ijritcc.v11i3.6198
Jang Bahadur Saini, D. . (2022). Pre-Processing Based Wavelets Neural Network for Removing Artifacts in EEG Data. Research Journal of Computer Systems and Engineering, 3(1), 43–47. Retrieved from https://technicaljournals.org/RJCSE/index.php/journal/article/view/40
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.