Machine Learning-based Classification and Analysis of Breast Cancer Pathological Images


  • Prabha Ravi Department of Medical Electronics Ramaiah Institute of Technology Bengaluru, Karnataka
  • Neha Sara John Department of Medical Electronics Ramaiah Institute of Technology Bengaluru, Karnataka
  • Prithvi B. S. Department of CSE Jyothy Institute of Technology Bengaluru, Karnataka
  • Supriya B. S. Department of Medical Electronics Ramaiah Institute of Technology Bengaluru, Karnataka
  • Elavaar Kuzhali S. Department of Electronics & Instrumentation Engineering Ramaiah Institute of Technology Bengaluru, Karnataka
  • Varsha M. S. Department of Medical Electronics Ramaiah Institute of Technology Bengaluru, Karnataka


breast cancer, histopathology, nuclear segmentation, machine learning, CNN


Histopathology is the microscopic examination of diseased cells and tissues present in the body. Determining the degree of severity of illness is very essential in the diagnosis process. Assessing the variations that take place during the illness requires an in-depth understanding of the fundamental structures and functions of various tissues. This study aimed to develop a diagnostic tool to aid clinicians in the preliminary digital examination of histopathological breast tissue slides obtained after surgical biopsy. The designed study was   carried out on the open-source Breakhis dataset. The breast tumor microscopic tissue images were divided into 5 categories, including benign breast tumors and 4 different types of malignant breast tumors (lobular carcinoma, ductal carcinoma, papillary carcinoma, and mucinous carcinoma). The images were prepared for classification by performing preprocessing steps such as color normalization and nuclei segmentation using the K-means algorithm.  A comparative analysis was performed using four image classification algorithms: AlexNet, Convolutional Neural Networks (CNN), Support Vector Machine (SVM), and K-nearest Neighbors (KNN). The AlexNet performed at a significantly higher caliber with an accuracy of 98%. A GUI, or graphic user interface, was created as an assist platform for physicians to utilize the designed breast tumor tissue image analysis pipeline.


Download data is not yet available.


Spanhol, Fabio A., Luiz S. Oliveira, Caroline Petitjean, and Laurent Heutte. “A dataset for breast cancer histopathological image classifi- cation.” Ieee transactions on biomedical engineering 63, no. 7 (2015): 1455-1462.

Roy, Santanu, Alok kumar Jain, Shyam Lal, and Jyoti Kini. ”A study about color normalization methods for histopathology images.” Micron 114 (2018): 42-61.

Bradley, Derek, and Gerhard Roth. “Adaptive thresholding using the integral image.” Journal of graphics tools 12, no. 2 (2007): 13-21.

Chan, Tony F., and Luminita A. Vese. ”Active contours without edges.”

IEEE Transactions on image processing 10, no. 2 (2001): 266-277.

Whitaker, Ross T. “A level-set approach to 3D reconstruction from range data.” International journal of computer vision 29, no. 3 (1998): 203- 231.

Abdolhoseini, Mahmoud, Murielle G. Kluge, Frederick R. Walker, and Sarah J. Johnson. “Segmentation of heavily clustered nuclei from histopathological images.” Scientific reports 9, no. 1 (2019): 1-13.

Sarrafzadeh, Omid, and Alireza Mehri Dehnavi. “Nucleus and cytoplasm segmentation in microscopic images using K-means clustering and region growing.” Advanced biomedical research 4 (2015).

Win, Khin Yadanar, Somsak Choomchuay, and Kazuhiko Hamamoto. “K mean clustering based automated segmentation of overlapping cell nuclei in pleural effusion cytology images.” In 2017 International Conference on Advanced Technologies for Communications (ATC), pp. 265-269. IEEE, 2017.

Bakthavachalam, M. D., and Dr S. Raj. “A Study Of Breast Cancer Analysis Using K-Nearest Neighbor With Different Distance Measures And Classification Rules Using Machine Learning.” European Journal of Molecular & Clinical Medicine 7, no. 3 (2020): 4842-4851.

Sewak, Mihir, Priyanka Vaidya, Chien-Chung Chan, and Zhong-Hui Duan. “SVM approach to breast cancer classification.” In Second Inter- national Multi-Symposiums on Computer and Computational Sciences (IMSCCS 2007), pp. 32-37. IEEE, 2007.

Kumar, Kundan, and Annavarapu Chandra Sekhara Rao. “Breast cancer classification of image using convolutional neural network.” In 2018 4th International Conference on Recent Advances in Information Technology (RAIT), pp. 1-6. IEEE, 2018.

Feng, Yixiao, Mia Spezia, Shifeng Huang, Chengfu Yuan, Zongyue Zeng, Linghuan Zhang, Xiaojuan Ji et al. “Breast cancer development and progression: Risk factors, cancer stem cells, signaling pathways, genomics, and molecular pathogenesis.” Genes & diseases 5, no. 2 (2018): 77-106.

Nahid, Abdullah-Al, and Yinan Kong. “Involvement of machine learning for breast cancer image classification: a survey.” Computational and mathematical methods in medicine 2017 (2017).

Omondiagbe, David A., Shanmugam Veeramani, and Amandeep S. Sidhu. “Machine learning classification techniques for breast cancer di- agnosis.” In IOP Conference Series: Materials Science and Engineering, vol. 495, no. 1, p. 012033. IOP Publishing, 2019.

Agrawal, Utkarsh, Daniele Soria, Christian Wagner, Jonathan Garibaldi, Ian O. Ellis, John MS Bartlett, David Cameron, Emad A. Rakha, and Andrew R. Green. “Combining clustering and classification ensembles: A novel pipeline to identify breast cancer profiles.” Artificial intelligence in medicine 97 (2019): 27-37.

Jelen´, Łukasz, Thomas Fevens, and Adam Krzyz˙ak. “Classification of breast cancer malignancy using cytological images of fine needle aspiration biopsies.” International Journal of Applied Mathematics and Computer Science 18, no. 1 (2008): 75-83.

Avramov, Todor K., and Dong Si. “Comparison of Feature Reduction Methods and Machine Learning Models for Breast Cancer Diagnosis.” In Proceedings of the International Conference on Compute and Data Analysis, pp. 69-74. 2017.

Obaid, O. Ibrahim, Mazin Abed Mohammed, M. K. A. Ghani, A. Mostafa, and Fahad Taha. “Evaluating the performance of machine learning techniques in the classification of Wisconsin Breast Cancer.” International Journal of Engineering & Technology 7, no. 4.36 (2018): 160-166.

Irshad, Humayun, Antoine Veillard, Ludovic Roux, and Daniel Raco- ceanu. “Methods for nuclei detection, segmentation, and classification in digital histopathology: a review—current status and future potential.” IEEE reviews in biomedical engineering 7 (2013): 97-114.

Christopher Thomas, Stephen Wright, Juan Martinez, Alejandro Perez, Luis Rodriguez. Integrating Machine Learning and Decision Science for Intelligent Automation. Kuwait Journal of Machine Learning, 2(4). Retrieved from

Uppal, A. ., Naruka, M. S. ., & Tewari, G. . (2023). Image Processing based Plant Disease Detection and Classification . International Journal on Recent and Innovation Trends in Computing and Communication, 11(1s), 52–56.

Reddy, B.R.S., Saxena, A.K., Pandey, B.K., Gupta, S., Gurpur, S., Dari, S.S., Dhabliya, D. Machine learning application for evidence image enhancement (2023) Handbook of Research on Thrust Technologies? Effect on Image Processing, pp. 25-38.




How to Cite

Ravi, P. ., John, N. S. ., S., P. B. ., B. S., S. ., Kuzhali S., E. ., & M. S., V. . (2023). Machine Learning-based Classification and Analysis of Breast Cancer Pathological Images . International Journal of Intelligent Systems and Applications in Engineering, 12(4s), 216–222. Retrieved from



Research Article

Most read articles by the same author(s)