Deep Learning and Machine Learning Approach to Breast Cancer Classification with Random Search Hyperparameter Tuning

Authors

  • Inutu Kawina Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram 522502, Andhra Pradesh, India.
  • Amarendra K. Professor, Department of Computer Science and Engineering, Koneru Lakshmaiah Education Foundation, Vaddeswaram 522502, Andhra Pradesh, India.
  • Bhaskar Marapelli Associate Professor, Department of Computer Science and Information Technology, Koneru Lakshmaiah Education Foundation, Vaddeswaram 522502, Andhra Pradesh, India.

Keywords:

Breast cancer, Convolution neural network, Deep learning, Deep neural network, Machine learning, Random search hyperparameter tuning

Abstract

Breast cancer is one of the main causes of death and a threat to women, artificial intelligence has grown in importance in the field of health over time. With the aid of random search hyperparameter tuning, this study proposed a machine learning and deep learning approach to breast cancer classification. Two datasets related to breast cancer were used in this study, and findings proved that optimizing random search hyperparameters enhances the models performance, in addition it is seen that machine learning classifiers are not as effective in classifying breast cancer as compared to deep learning. Among the deep learning approaches Convolutional neural networks showed the highest accuracy over deep neural networks on both datasets. It was further observed that random search hyperparameter tuning performed better on the Breakhis_400x dataset than on the breast histopathology dataset which could be attributed to the notion that hyperparameter tuning using random search performs better on small data than large data.

Downloads

Download data is not yet available.

References

R.L. Siegel, K.D. Miller, N.S v. and A. Jemal, "Cancer Statistics," CA: A Cancer Journal for Clinicians, vol. 73, no. 1, pp. 17-48, 2023. https://doi.org/10.3322/caac.21763

World health organization. “Breast cancer”. 2023. Available: https://www.who.int/news-room/fact-sheets/detail/breast-cancer

World Health organization. “Cancer”. 2022. Available: https://www.who.int/news-room/fact-sheets/detail/cancer

National Cancer Institute. “Common Cancer Types”. (n.d). Available at https://www.cancer.gov/types/common-cancers

G.W. Prager, S Braga, B. Bystricky, C. Qvortrup, C. Criscitiello, E. Esin, G.S. Sonke, G.A. Martínez, J.S. Frenel, M. Karamouzis , M. Strijbos , O. Yazici , P. Bossi , S. Banerjee, T. Troiani, A. Eniu, F. Ciardiello, J. Tabernero, C.C. Zielinski, P.G. Casali, F. Cardoso, J.Y. Douillard, S. Jezdic , K. McGregor, G. Bricalli, M.Vyas, A. Ilbawi. “Global cancer control: responding to the growing burden, rising costs and inequalities in access”. ESMO Open. 2018 Feb 2;3(2):e000285. doi: 10.1136/esmoopen-2017-000285. PMID: 29464109; PMCID: PMC5812392.

Y. Xu, X. Liu, X. Cao, C. Huang, E. Liu, S. Qian,X. Liu, Y. Wu, F. Dong, C. Qiu, K.Hua, W.Su, J. Wu, H. Xu , Y. Han , C. Fu, Z. Yin, M. Liu, R. Roepman., S. Dietmann, F. Kengara, Z. Zhang , L. Zhang, T. Zhao, J. Dai, J. Yang, L. Lan, M. Luo, Z. Liu, T. An, B. Zhang, X. He, S. Cong, X. Liu, W. Zhang, J.P. Lewis J. P. Tiedje, Q. Wang, Z. An, F. Wang, L. Zhang, T. Huang, C. Lu, Z. Cai, F. Wang and J. Zhang. “Artificial intelligence: A powerful paradigm for scientific research”. The Innovation. 2021, Vol 2, issue 4. https://doi.org/10.1016/j.xinn.2021.100179

L. Yang. “Image classification of MNIST dataset by using machine learning techniques”. UC Merced. 2021. https://escholarship.org/uc/item/81b0z2hh

Assegie A.T. (2021). An optimized K-Nearest Neighbor based breast cancer detection. Journal of Robotics and Control (JRC) Vol 2, Issue 3, pp 115-118. https://doi: 10.18196/jrc.2363

H.H. Farag, L.A.A. Said, M.R.M. Rizk and M.B.E. Ahmed. “Hyperparameters Optimization for ResNet and Xception in the Purpose of Diagnosing COVID-19”.IOS Press. 2021, vol. 41, no. 2, pp. 3555-3571. https://doi:10.3233/JIFS-2109255

A.K. Nugroho and H. Suhartanto. “Hyper-Parameter Tuning based on Random Search for DenseNet Optimization”. 2020 7th International Conference on Information Technology, Computer, and Electrical Engineering (ICITACEE). 2020, pp. 96-99.

M. Karthik, K. Thangavel and K. Sasirekha. “Novel Deep CNN Model based Breast Cancer Classification.” 2023 7th International Conference on Computing Methodologies and Communication (ICCMC). 2023, pp. 524-529.

K. Sreekala and J. Sahoo. "Hyper Parameter Optimization of Convolutional Neural Networks for Breast Cancer Classification," 2021 International Conference on Advances in Computing and Communications (ICACC), Kochi, Kakkanad, India. 2021, pp. 1-6, https://doi: 10.1109/ICACC-202152719.2021.9708331.

G. Jayandhi, L.J.S. Jasmine and M.S. Jones. X. “Mammogram image classification system using deep learning for breast cancer diagnosis”. AIP Conf. Proc. 2519, 030066 2022. https://doi.org/10.1063/5.0109640

C. Maklin. “Synthetic Minority Over-sampling Technique (SMOTE)”. 2022. Available at https://medium.com/@corymaklin/synthetic-minority-over-sampling-technique-smote-7d419696b88c

A. Özdemir, K. Polat and A. Alhudhaif. “Classification of imbalanced hyperspectral images using SMOTE-based deep learning methods”. Expert Systems with Applications. 2021, Vol 178, https://doi.org/10.1016/j.eswa.2021.114986

I.H. Sarker. “Machine Learning: Algorithms, Real-World Applications and Research Directions”. SN comput. Sci. 2021 Vol 2, article 160. https://doi.org/10.1007/s42979-021-00592-x

V. Jain. “Introduction to KNN Algorithms". 2022. Available at https://www.analyticsvidhya.com/blog/2022/01/introduction-to-knn-algorithms/

S.F. Khorshid and A.M.Abdulazeez. “Breast cancer diagnosis based on k-nearest neighbors: a Review”. Palarch’s Journal of Archaeology of Egypt/Egyptology 18(4), 2021 pp.1927-1951. ISSN 1567-214x

C. Martins. “Gaussian Naive Bayes Explained with Scikit-Learn”. 2023. Available at https://builtin.com/artificial-intelligence/gaussian-naive-bayes

K. Jain. “How to Improve Naive Bayes? Section 3: Tuning the Model in Python”. 2021. Available at https://medium.com/analytics-vidhya/how-to-improve-naive-bayes-9fa698e14cba

V. Viswanatha, A.C. Ramachandra, B. Avinash and S. Shashank. “Breast cancer classification using logistic regression”. High Technology Letters. 2023, Vol 29, Issue 8. https://gjstx-e.cn/

G.G. Pekhimenko (n.d). Penalized Logistic Regression for Classification.

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel., P. Prettenhofer, r. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay. “Scikit-learn: Machine Learning in Python”. Journal of Machine Learning Research. 2021, Vol 12 pp. 2825—2830. Available: https://scikit-learn.org/stable/modules/generated/sklearn.linear_model.LogisticRegression.html

I.H. Sarker. “Deep Learning: A Comprehensive Overview on Techniques, Taxonomy, Applications and Research Directions”. SN COMPUT. SCI. 2, article 420,2021. https://doi.org/10.1007/s42979-021-00815-1

L. Alzubaidi, J. Zhang, A.J. Humaidi, A. Al-Dujaili, Y. Duan, O. Al-Shamma, J. Santamaría, A. Mohammed, F.M. Al-Amidie and L. Farhan. (2021). “Review of deep learning: concepts, CNN architectures, challenges, applications, future directions”. J Big Data 8, article 53, 2021. https://doi.org/10.1186/s40537-021-00444-8

Simplilearn. “What is epoch in Machine Learning?”.2023 Available: https://www.simplilearn.com/tutorials/machine-learning-tutorial/what-is-epoch-in-machine-learning#:~:text=Batch%20size%20is%20a%20hyperparameter,more%20samples%20and%20making%20predictions.

K. Shen. “Effect of batch size on training dynamics”.2018. Available: https://medium.com/mini-distill/effect-of-batch-size-on-training-dynamics-21c14f7a716e

J Brownlee. “Difference Between a Batch and an Epoch in a Neural Network”.2022. Available: https://machinelearningmastery.com/difference-between-a-batch-and-an-epoch/

J. Lim, S. Jeong, S. Lim, H. Cho, J.Y. Shim, S. Hong S.C. Kwon, H. Lee, I. Moon, J. Kim, Y. Amashita. and M. Kano. “Development of Dye Exhaustion Behavior Prediction Model using Deep Neural Network”. Computer Aided Chemical Engineering. 2022, Vol 49, pp. 1825-1830. https://doi.org/10.1016/B978-0-323-85159-6.50304-3

W. Jia, M. Sun, J. Lian and S. Hou. “Feature dimensionality reduction: a review”. Complex Intell. Syst. 8, 2022, pp 2663–2693. https://doi.org/10.1007/s40747-021-00637-x

F. Nazari and W. Yan. “Convolutional versus Dense Neural Networks: Comparing the Two Neural Networks’ Performance in Predicting Building Operational Energy Use Based on the Building Shape”. n.d. Available: https://arxiv.org/ftp/arxiv/papers/2108/2108.12929.pdf

Downloads

Published

23.02.2024

How to Cite

Kawina, I. ., K. , A. ., & Marapelli , B. . (2024). Deep Learning and Machine Learning Approach to Breast Cancer Classification with Random Search Hyperparameter Tuning. International Journal of Intelligent Systems and Applications in Engineering, 12(16s), 264–275. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/4818

Issue

Section

Research Article

Most read articles by the same author(s)