Leveraging Machine Learning Techniques for Improving Heart Disease Prediction Systems Using Feature Selection

Authors

Keywords:

Relief Algorithm, Genetic algorithm, healthcare infrastructure, Machine-learning method, Chi-square

Abstract

Although there have been advancements in the Indian healthcare system over the past few decades, there is still a long way to go before we can claim to have reached world standards. Despite being the second most populated nation, India ranks 143rd out of 195 nations in terms of healthcare infrastructure. Even now, seven decades after India's declaration of independence, the country's healthcare system remains unable to guarantee universal access to care. Access to affordable, high-quality healthcare is still a pipe dream, especially for those who live in rural areas. Not everyone can afford medical services. Private organizations charge a high price for their therapies. No considerable financial assistance is allowed. This study suggests a novel hybrid feature selection method for identifying the most important qualities. Standard feature selection approaches such as Maximum relevance and minimum redundancy (mRMR), Relief, a genetic algorithm, and Least absolute shrinkage and selection operator (LASSO) were compared. A variety of classifiers were used to create a cardiovascular disease prediction system, including logistic regression, Naive Bayes, Random Forest, and support vector machine. This study made use of data from the Cleveland heart disease dataset. According to the results of this research, a Random forest based prediction model trained using characteristics discovered via a new hybrid feature selection may provide the best accuracy and sensitivity. According to the results of the research, applying feature selection algorithms enhances the performance of the prediction system in terms of accuracy, sensitivity, specificity, and throughput.

Downloads

Download data is not yet available.

References

The Lancet, "Health in India, 2017", The Lancet, vol. 389, no. 10065, p. 127, 2017. Available: 10.1016/s0140-6736(17)30075-2

Who.int, 2021. [Online]. Available: https://www.who.int/hrh/resources/16058health_workforce_India.pdf. [Accessed: 20- March- 2021].

Niti.gov.in, 2021. [Online]. Available: https://www.niti.gov.in/sites/default/files/2020- 12/PHS_13_dec_web.pdf. [Accessed: 28- January- 2021].

Kasthuri A. “Challenges to Healthcare in India - The Five A's. Indian journal of community medicine “: official publication of Indian Association of Preventive & Social Medicine, 43(3), 141–143. https://doi.org/10.4103/ijcm.IJCM_194_18

V. Bajpai, "The Challenges Confronting Public Hospitals in India, Their Origins, and Possible Solutions", Advances in Public Health, vol. 2014, pp. 1-27, 2014. Available: 10.1155/2014/898502

Agarwal R, Mittal M. Inventory classification using multi-level association rule mining. Int J Dec Supp Syst Technol. (IJDSST), 2019;11(2):1–12.

Agrawal R, Srikant R. Fast algorithms for mining association rules. In: Proceedings of 20th international conference very large data bases, VLDB. Vol. 1215, pp. 487–499; 1994.

Akbaş KE, Kivrak M, Arslan AK, Çolak C. Assessment of association rules based on certainty factor: an application on heart data set, in 2019 International artificial intelligence and data processing symposium (IDAP) (pp. 1–5). IEEE; 2019.

Altaf W, Shahbaz M, Guergachi A. Applications of association rule mining in health informatics: a survey. Artif Intell Rev. 2017;47(3):313–40.

Alwidian J, Hammo BH, Obeid N. WCBA: weighted classification based on association rules algorithm for breast cancer disease. Appl Soft Comput. 2018;62:536–49.

American Heart Association. Heart disease and stroke statistics 2017 at-a-glance. Geraadpleegd van: https://healthmetrics.heart.org/wp-content/uploads/2017/06/Heart-Disease-and-Stroke-Statistics-2017-ucm_491265.pdf.

Amin MS. Identifying significant features and data mining techniques in predicting cardiovascular disease; 2018.

Amin MS, Chiam YK, Varathan KD Identification of significant features and data mining techniques in predicting heart disease. Telem Inform. 2019;36;82–93.

Repository.upenn.edu, 2021. [Online]. Available: https://repository.upenn.edu/cgi/viewcontent.cgi?article=1176&context=hcmg_papers.

P. Arokiasamy, "India's escalating burden of non-communicable diseases", The Lancet Global Health, vol. 6, no. 12, pp. e1262-e1263, 2018. Available: 10.1016/s2214- 109x(18)304480

"Lifestyle diseases in India", Pib.gov.in, 2021. [Online]. Available: https://pib.gov.in/PressReleaseIframePage.aspx?PRID=1540840. [Accessed: 28- May- 2021]. [10]"Non-communicable diseases", WHO. int, 2021. [Online]. Available: https://www.who.int/news-room/fact-sheets/detail/noncommunicable-diseases.

Bashir, S., Khan, Z. S., Khan, F. H., Anjum, A., & Bashir, K. (2019). Improving heart disease prediction using feature selection approaches. In 2019 16th International Bhurban Conference on Applied Sciences and Technology (IBCAST) (pp. 619–623). IEEE.

Cengiz AB, Birant KU, Birant D. Analysis of pre-weighted and post-weighted association rule mining, in 2019 Innovations in Intelligent Systems and Applications Conference (ASYU) (pp. 1–5). IEEE.

Chauhan A, Jain A, Sharma P, Deep V. Heart disease prediction using evolutionary rule learning, in 2018 4th International conference on computational intelligence & communication technology (CICT) (pp. 1–4). IEEE; 2018.

Dey L, Mukhopadhyay A. Biclustering-based association rule mining approach for predicting cancer-associated protein interactions. IET Syst Biol. 2019;13(5):234–42.

"India: Health of the Nation’s States", Institute for Health Metrics and Evaluation, 2021. [Online]. Available: http://www.healthdata.org/policy- report/India-health- nation%E2%80%99s-states.

"National Program for Prevention and Control of Cancer, Diabetes, CVD and Stroke( NPCDCS) | National Health Portal Of India", Nhp.gov.in, 2021. [Online]. Available: https://www.nhp.gov.in/national-programme-for-prevention-and-control- of-c_pg. [Accessed: 28- May- 2021].

"Ayushman Bharat - National Health Protection Mission | National Portal of India", India.gov.in, 2021. [Online]. Available: https://www.india.gov.in/spotlight/ayushman- bharat-national-health-protection-mission.

Fitriyani NL, Syafrudin M, Alfian G, Rhee J. HDPM: an effective heart disease prediction model for a clinical decision support system. IEEE Access. 2020;8:133034–50.

Kevin Harris, Lee Green, Juan Garcia, Juan Castro, Juan González. Intelligent Personal Assistants in Education: Applications and Challenges. Kuwait Journal of Machine Learning, 2(2). Retrieved from http://kuwaitjournals.com/index.php/kjml/article/view/185

Nayak, R. ., & Samanta, S. . (2023). Prediction of Factors Influencing Social Performance of Indian MFIs using Machine Learning Approach. International Journal on Recent and Innovation Trends in Computing and Communication, 11(1), 77–87. https://doi.org/10.17762/ijritcc.v11i1.6053

Downloads

Published

11.07.2023

How to Cite

Vinta, S. R. ., Anbalagan, E., Basavaraddi, C. C. S. ., B., A. P. ., T., R. G. ., & Mazumdar, N. . (2023). Leveraging Machine Learning Techniques for Improving Heart Disease Prediction Systems Using Feature Selection. International Journal of Intelligent Systems and Applications in Engineering, 11(8s), 560–567. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3087