PCOScare: Conventional Machine Learning Classifiers for Diagnosing and Prevention

Authors

  • Vaibhav C. Gandhi, Khyati R. Nirmal, Uma Maheswari, Sudha Rajesh, P. Tharcis, Dhruvi Thakkar

Keywords:

Polycystic Ovary Syndrome, Random Forest Classifier, Machine Learning, Early Detection, Prevention

Abstract

PCOS is a common endocrine disorder that affects women of reproductive age. Timely identification and diagnosis of PCOS are essential for the successful treatment and prevention of related health complications. ML techniques have shown promise in automating PCOS diagnosis using various clinical and biochemical features. Though, the presentation of these mockups seriously relies on the selection of relevant features, as including irrelevant or redundant features can lead to overfitting and decreased generalization performance. In this study, we propose an optimized feature selection approach for PCOS detection using ML algorithms. We first compile a comprehensive dataset comprising clinical and biochemical features commonly associated with PCOS, including hormone levels, menstrual irregularities, and anthropometric measurements. For the purpose of handling missing data and scaling features properly, feature preprocessing techniques like normalization and imputation are used. We next investigate several feature selection strategies, such as filter, wrapper, and embedding approaches, to find the most relevant characteristics for PCOS identification. We use cross-validation and presentation needles with accuracy, sensitivity, specificity, and area under the receiver operating characteristic curve (AUC-ROC) in a methodical review approach to maximize feature selection. The presentation of respectively feature assortment method is compared to assess its effectiveness in identifying discriminative features for PCOS diagnosis. Furthermore, we investigate the impact of feature selection on different ML algorithms, including support vector classifier, random forests, and Xg-boosting classifiers. Our results demonstrate that feature selection significantly improves the performance of PCOS detection models by reducing dimensionality and focusing on the most relevant features. Moreover, we identify key features that contribute most to the discriminative power of the models, providing insights into the underlying characteristics of PCOS. The optimized feature selection approach proposed in this study offers a promising strategy for developing accurate and interpretable ML models for PCOS detection, ultimately aiding clinicians in early diagnosis and personalized treatment planning.

Downloads

Download data is not yet available.

References

“A Comparative Study for Detecting Polycystic Ovary Syndrome Using A Machine Learning Framework,” IEEE Conference Publication | IEEE Xplore, Dec. 13, 2023. [Online]. Available: https://ieeexplore.ieee.org/document/10441348

“Accessible Polycystic Ovarian Syndrome Diagnosis Using Machine Learning,” IEEE Conference Publication | IEEE Xplore, May 27, 2022. [Online]. Available: https://ieeexplore.ieee.org/document/9824049

“Implementation of Various Machine Learning Algorithms to Predict Polycystic Ovary Syndrome,” IEEE Conference Publication | IEEE Xplore, May 26, 2023. [Online]. Available: https://ieeexplore.ieee.org/document/10170497

“A Comparative Study on Different Machine Learning Algorithms to Detect PCOS,” IEEE Conference Publication | IEEE Xplore, Jul. 06, 2023. [Online]. Available: https://ieeexplore.ieee.org/document/10307174

Majumdar, Sounak & Biswas, Saroj & Purkayastha, Biswajit. (2023). Diagnosis of Polycystic Ovary Syndrome Using Machine Learning Models: A Comparative Analysis. 1-5. 10.1109/i-PACT58649.2023.10434903.

“A Comparative Study for Detecting Polycystic Ovary Syndrome Using A Machine Learning Framework,” IEEE Conference Publication | IEEE Xplore, Dec. 13, 2023. https://ieeexplore.ieee.org/document/10441348

“Hybrid Machine Learning Algorithms for Polycystic Ovary Syndrome Detection,” IEEE Conference Publication | IEEE Xplore, Oct. 25, 2022. https://ieeexplore.ieee.org/document/10041525

Elmannai H, El-Rashidy N, Mashal I, Alohali MA, Farag S, El-Sappagh S, Saleh H. Polycystic Ovary Syndrome Detection Machine Learning Model Based on Optimized Feature Selection and Explainable Artificial Intelligence. Diagnostics (Basel). 2023 Apr 21;13(8):1506. doi: 10.3390/diagnostics13081506. PMID: 37189606; PMCID: PMC10137609.

Laganà AS, Vitale SG, Noventa M, Vitagliano A. Current Management of Polycystic Ovary Syndrome: From Bench to Bedside. Int J Endocrinol. 2018 Nov 14;2018:7234543. doi: 10.1155/2018/7234543. PMID: 30538744; PMCID: PMC6261402.

D. Hdaib, N. Almajali, H. Alquran, W. A. Mustafa, W. Al-Azzawi and A. Alkhayyat, "Detection of Polycystic Ovary Syndrome (PCOS) Using Machine Learning Algorithms," 2022 5th International Conference on Engineering Technology and its Applications (IICETA), Al-Najaf, Iraq, 2022, pp. 532-536, doi: 10.1109/IICETA54559.2022.9888677.

Vedpathak, Shreyas & Thakre, Vaidehi. (2020). PCOcare: PCOS Detection and Prediction using Machine Learning Algorithms. Bioscience Biotechnology Research Communications. 13. 240-244. 10.21786/bbrc/13.14/56.

Prasher S, Nelson L, Sharma A. Evaluation of Machine Learning Techniques to Diagnose Polycystic Ovary Syndrome Using Kaggle Dataset. InInternational Conference on Emerging Trends in Expert Applications & Security 2023 Feb 17 (pp. 279-287). Singapore: Springer Nature Singapore.

Subha R, Nayana BR, Radhakrishnan R, Sumalatha P. Computerized diagnosis of polycystic ovary syndrome using machine learning and swarm intelligence techniques.

S. Ahmed et al., "A Review on the Detection Techniques of Polycystic Ovary Syndrome Using Machine Learning," in IEEE Access, vol. 11, pp. 86522-86543, 2023, doi: 10.1109/ACCESS.2023.3304536.

P. Chitra, M. Sumathi, K. Srilatha, F. V. Jayasudha and S. Amudha, "Review of Artificial Intelligent based Algorithm for Prediction of Polycystic Ovary Syndrome(PCOS) from Blood Samples," 2022 4th International Conference on Inventive Research in Computing Applications (ICIRCA), Coimbatore, India, 2022, pp. 1172-1176, doi: 10.1109/ICIRCA54612.2022.9985699.

Khanna, Varada Vivek, Krishnaraj Chadaga, Niranajana Sampathila, Srikanth Prabhu, Venkatesh Bhandage, and Govardhan K. Hegde. 2023. "A Distinctive Explainable Machine Learning Framework for Detection of Polycystic Ovary Syndrome" Applied System Innovation 6, no. 2: 32. https://doi.org/10.3390/asi6020032.

Darji, M., Dave, J.A., Rathod, D.B. (2023). Review of Deep Learning: A New Era. In: Tuba, M., Akashe, S., Joshi, A. (eds) ICT Infrastructure and Computing. Lecture Notes in Networks and Systems, vol 520. Springer, Singapore. https://doi.org/10.1007/978-981-19-5331-6_33

V. C. Gandhi and P. P. Gandhi, “A Survey-Insights of ML and DL in Health Domain,” in In2022 International Conference on Sustainable Computing and Data Communication Systems (ICSCDS), IEEE, 2022, pp. 239–246.

Jain, A., Sharma, P.C., Vishwakarma, S.K., Gupta, N.K., Gandhi, V.C. (2022). Metaheuristic Techniques for Automated Cryptanalysis of Classical Transposition Cipher: A Review. In: Somani, A.K., Mundra, A., Doss, R., Bhattacharya, S. (eds) Smart Systems: Innovations in Computing. Smart Innovation, Systems and Technologies, vol 235. Springer, Singapore. https://doi.org/10.1007/978-981-16-2877-1_43

A Vigorous Chaotic Function Based Image Authentication Structure AK Singh, VC Gandhi, MM Subramanyam, S Kumar, S Aggarwal, S Tiwari, Journal of Physics: Conference Series 1854 (1), 012039.

Downloads

Published

07.05.2024

How to Cite

Vaibhav C. Gandhi. (2024). PCOScare: Conventional Machine Learning Classifiers for Diagnosing and Prevention . International Journal of Intelligent Systems and Applications in Engineering, 12(3), 3247–3255. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/5930

Issue

Section

Research Article