The Application of Data Mining Techniques to the Detection of Cancer

Authors

  • Raghavendra R. Assistant Professor, Department of Computer Science and IT, Jain(Deemed-to-be University), Bangalore-27, India
  • Neeraj Kumari Assistant Professor, College of Computing Science and Information Technology, Teerthanker Mahaveer University, Moradabad, Uttar Pradesh, India
  • Surendra Yadav Professor, Department of Computer Science & Application, Vivekananda Global University, Jaipur, India
  • Prabha Shreeraj Nair Associate Professor. & Dy. HoD, Department of Information Technology and M.Tech Integrated, Noida Institute of Engineering and Technology, Greater Noida, Uttar Pradesh, India

Keywords:

Cancer detection, data mining, histogram equalization (HE), linear discriminant analysis (LDA), Elephant herding optimized logistic regression (EHOLR)

Abstract

Cancer is one of the leading causes of mortality worldwide. In 2018, there were approximately 1,735,350 new instances of cancer identified in the United States alone, and 609,640 individuals passed away as a direct result of the disease. Cancers include skin melanoma, lung bronchus cancer, breast cancer, prostate cancer, colon and rectum cancer, bladder cancer, kidney and renal pelvis cancer, and others. Cancer has risen to prominence in the scientific community due to the wide variety of cancers and the enormous number of people it affects. There is still active research on cancer prevention and diagnostic strategies. Using data mining methods, we sought to create a reliable and workable system for cancer diagnosis. Machine learning techniques may assist professionals in creating tools that enable early cancer detection. To improve cancer diagnosis rates, this research aims to introduce a novel machine learning method called the Elephant herding optimized logistic regression (EHOLR) strategy. Histogram equalization (HE) was used for preprocessing the acquired cancer data, and linear discriminant analysis (LDA) was used to extract the data's features. Finally, cancer detection is accomplished using our recommended strategy. The effectiveness of the suggested strategy is then assessed using the performance matrix, namely accuracy, recall, and precision..

Downloads

Download data is not yet available.

References

Eltalhi, S. and Kutrani, H., 2019. Breast cancer diagnosis and prediction using machine learning and data mining techniques: a review. IOSR Journal of Dental and Medical Sciences, 18(4), pp.85-94.

AbdElNabi, M.L.R., Wajeeh Jasim, M., El-Bakry, H.M., Hamed N. Taha, M. and Khalifa, N.E.M., 2020. Breast and colon cancer classification from gene expression profiles using data mining techniques. Symmetry, 12(3), p.408.

Jatain, R. ., & Jailia, M. . (2023). Automatic Human Face Detection and Recognition Based On Facial Features Using Deep Learning Approach. International Journal on Recent and Innovation Trends in Computing and Communication, 11(2s), 268–277. https://doi.org/10.17762/ijritcc.v11i2s.6146

Abdar, M., Zomorodi-Moghadam, M., Zhou, X., Gururajan, R., Tao, X., Barua, P.D. and Gururajan, R., 2020. A new nested ensemble technique for automated diagnosis of breast cancer. Pattern Recognition Letters, 132, pp.123-131.

Mohammed, S.A., Darrab, S., Noaman, S.A. and Saake, G., 2020. Analysis of breast cancer detection using different machine learning techniques. In Data Mining and Big Data: 5th International Conference, DMBD 2020, Belgrade, Serbia, July 14–20, 2020, Proceedings 5 (pp. 108-117). Springer Singapore.

Simsek, S., Kursuncu, U., Kibis, E., AnisAbdellatif, M. and Dag, A., 2020. A hybrid data mining approach for identifying the temporal effects of variables associated with breast cancer survival. Expert Systems with Applications, 139, p.112863.

Kaur, I., Doja, M.N. and Ahmad, T., 2022. Data mining and machine learning in cancer survival research: an overview and future recommendations. Journal of Biomedical Informatics, p.104026

Razali, N., Mostafa, S.A., Mustapha, A., Abd Wahab, M.H. and Ibrahim, N.A., 2020, April. Risk factors of cervical cancer using classification in data mining. In Journal of Physics: Conference Series (Vol. 1529, No. 2, p. 022102). IOP Publishing.

Fatima, N., Liu, L., Hong, S. and Ahmed, H., 2020. Prediction of breast cancer, comparative review of machine learning techniques, and their analysis. IEEE Access, 8, pp.150360-150376.

Sohail, M.N., Jiadong, R., Uba, M.M. and Irshad, M., 2019. A comprehensive looks at data mining techniques contributing to medical data growth: a survey of researcher reviews. Recent Developments in Intelligent Computing, Communication and Devices: Proceedings of ICCD 2017, pp.21-26.

Alam, T.M., Khan, M.M.A., Iqbal, M.A., Abdul, W. and Mushtaq, M., 2019. Cervical cancer prediction through different screening methods using data mining. IJACSA) International Journal of Advanced Computer Science and Applications, 10(2).

Kumar, V., Mishra, B.K., Mazzara, M., Thanh, D.N. and Verma, A., 2020. Prediction of malignant and benign breast cancer: A data mining approach in healthcare applications. In Advances in Data Science and Management: Proceedings of ICDSM 2019 (pp. 435-442). Springer Singapore.

Singh, M. ., Angurala, D. M. ., & Bala, D. M. . (2020). Bone Tumour detection Using Feature Extraction with Classification by Deep Learning Techniques. Research Journal of Computer Systems and Engineering, 1(1), 23–27. Retrieved from https://technicaljournals.org/RJCSE/index.php/journal/article/view/21

Neto, C., Brito, M., Lopes, V., Peixoto, H., Abelha, A. and Machado, J., 2019. Application of data mining for the prediction of mortality and occurrence of complications for gastric cancer patients. Entropy, 21(12), p.1163.

AbdElNabi, M.L.R., Wajeeh Jasim, M., El-Bakry, H.M., Hamed N. Taha, M. and Khalifa, N.E.M., 2020. Breast and colon cancer classification from gene expression profiles using data mining techniques. Symmetry, 12(3), p.408

Ghorbani, R. and Ghousi, R., 2019. Predictive data mining approaches in medical diagnosis: A review of some diseases prediction. International Journal of Data and Network Science, 3(2), pp.47-70.

Yang, J., Li, Y., Liu, Q., Li, L., Feng, A., Wang, T., Zheng, S., Xu, A. and Lyu, J., 2020. Brief introduction of medical database and data mining technology in big data era. Journal of Evidence‐Based Medicine, 13(1), pp.57-69.

Downloads

Published

11.07.2023

How to Cite

R., R. ., Kumari, N. ., Yadav, S. ., & Nair, P. S. . (2023). The Application of Data Mining Techniques to the Detection of Cancer. International Journal of Intelligent Systems and Applications in Engineering, 11(8s), 27–34. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3017