Optimizing K-Means Clustering using the Artificial Firefly Algorithm

Authors

  • LNC. Prakash K. Computer Science & Engineering (DS), CVR College of Engineering, Hyderabad, India
  • G. Suryanarayana Department of Information Technology, Vardhaman College of Engineering, Hyderabad, India.
  • N. Swapna Computer Science & Engineering, Vijay Rural Engineering College, Nizamabad, Telangana, India
  • T. Bhaskar Computer Science & Engineering, CMR College of Engineering and Technology, Hyderabad, Talangana, India
  • Ajmeera Kiran Computer Science & Engineering, MLR Institute of Technology, Dundigal, Hyderabad.

Keywords:

Artificial firefly algorithm, K-Means, Optimization

Abstract

Data clustering is a typical data analysis approach that is utilised in a variety of domains, namely machine learning, pattern matching, and visual analytics. K-means clustering is a popular and straightforward solution to data clustering, although it has important shortcomings, including local optimum convergence and initial point sensitivities. To attend the challenge of local convergence of optimal clusters in this article a swam-based optimization technique is proposed. Firefly method is a swarm-based technique used for optimizing challenges. This research proposes a tale approach for clustering data using the firefly algorithm. It is demonstrated how the K-Means technique may be applied to locate the centroids for the known initial cluster centres. The approach was later enhanced to improve centroids and clusters using firefly optimization. This novel algorithm is known as AFA. The experimental findings demonstrated the suggested method's efficiency and capabilities for data clustering and the conclusions show that the suggested model outperform traditional K-means clustering.

Downloads

Download data is not yet available.

References

C. Pizzuti and D. Talia, ‘‘P-AutoClass: scalable parallel clustering for mining large data sets’’, in IEEE transaction on Knowledge and data engineering, Vol. 15, pp. 629-641, May 2003.

K. C. Wong and G. C. L. Li, “Simultaneous Pattern and Data Clustering for Pattern Cluster Analysis’’, in IEEE Transaction on Knowledge and Data Engineering, Vol. 20, pp. 911-923, Los Angeles, USA, June 2008.

J. Marr, ‘‘Comparison Of Several Clustering Algorithms for Data Rate Compression of LPC Parameters’’, in IEEE International Conference on Acoustics Speech, and Signal Processing, Vol. 6, pp. 964-966, January 2003.

X. L. Yang, Q. Song and W. B. Zhang, ‘‘Kernel-based Deterministic Annealing Algorithm For Data Clustering’’, in IEEE Proceedings on Vision, Image and Signal Processing, Vol. 153, pp. 557-568, March 2007.

J. Grabmeier, A. Rudolph, “Techniques of cluster algorithms in data mining”, Data Mining and Knowledge Discovery, vol. 6(4), 2002, pp. 303–360. doi: 10.1023/A: 1016308404627.

H. Yu, Z.G. Liu, G.Y. Wang, “Automatically determining the number of clusters using decision-theoretic rough set”, in: J. Yao et al. (Eds.), Proceedings of the 6th International Conference on Rough Sets and Knowledge Technology, RSKT’11, LNCS, vol. 6954, 2011, pp. 504–513, https://doi.org/10.1016/j.ijar.2013.03.018

Narayana G. Surya, and D. Vasumathi. “Clustering for high dimensional categorical data based on text similarity.” In Proceedings othe 2nd International Conference on Communication and Information Processing (ICCIP-2016) pp. 17–21. 2016.

LNC. Prakash K, Dr. K. Anuradha “Optimal Feature Selection for multi valued Attributes using Transaction Weights as Utility Scale” proceedings of second international conference on computational intelligence and informatics (ICCII-2017). https://doi.org/10.1007/978-981-10-822 8-3.

Rajasekhar A, et al. Computing with the collective intelligence of honey bees–a survey. Swarm Evolution Comput. 2017;32:25–48.

Dorigo M, Stützle T. The ant colony optimization metaheuristic: algorithms, applications, and advances. Handbook of metaheuristics. Boston: Springer; 2003. p. 250–85.

Kennedy J, Eberhart RC. Particle swarm optimization proceeding IEEE international conference of neural network IV. In: IEEE Service centre, Piscataway. 1995.

Zhao S-Z, Ponnuthurai NS, Swagatam D. Dynamic multi-swarm particle swarm optimizer with sub-regional harmony search. In: IEEE Congress on Evolutionary Computation. IEEE, 2010.

Yang X-S, Deb S. Cuckoo search: recent advances and applications. Neural Comput Appl. 2014;24(1):169–74.

Yang X-S. Firefy algorithm, stochastic test functions and design optimisation. Int J Bio-inspired Comput. 2010;2(2):78–84.

MacQueen, J. (1967) Some methods for classification and analysis of multivariate observations, Proceedings of the Fifth Berkeley Symposium On Mathematical Statistics and Probabilities, 1, 281-296.

Morissette L, Chartier S. The k-means clustering technique: General considerations and implementation in Mathematica. Tutorials in Quantitative Methods for Psychology. 2013 Feb;9(1):15-24.

A. Ito, "Successive Binary Partition K-means Method for Clustering with Less Cluster Size Bias," 2022 7th International Conference on Signal and Image Processing (ICSIP), Suzhou, China, 2022, pp. 772-776, doi: 10.1109/ICSIP55141.2022.9886452.

Y. Li, Y. Zhang, Q. Tang, W. Huang, Y. Jiang and S. -T. Xia, "t-k-means: A ROBUST AND STABLE k-means VARIANT," ICASSP 2021 - 2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, ON, Canada, 2021, pp. 3120-3124, doi: 10.1109/ICASSP39728.2021.9414687.

T. N. Reddy and K. P. Supreethi, "Optimization of K-means algorithm: Ant colony optimization," 2017 International Conference on Computing Methodologies and Communication (ICCMC), Erode, India, 2017, pp. 530-535, doi: 10.1109/ICCMC.2017.8282522.

S. Kapil, M. Chawla and M. D. Ansari, "On K-means data clustering algorithm with genetic algorithm," 2016 Fourth International Conference on Parallel, Distributed and Grid Computing (PDGC), Waknaghat, India, 2016, pp. 202-206, doi: 10.1109/PDGC.2016.7913145.

Prakash K., L.N.C., Surya Narayana, G., Ansari, M.D., Gunjan, V.K. (2022). Optimization of K-Means Clustering with Modified Spiral Phenomena. In: Kumar, A., Mozar, S. (eds) ICCCE 2021. Lecture Notes in Electrical Engineering, vol 828. Springer, Singapore. https://doi.org/10.1007/978-981-16-7985-8_126.

Pasi Fränti, Sami Sieranoja, “How much can k-means be improved by using better initialization and repeats?”, Pattern Recognition, Volume 93, 2019, Pages 95-112, ISSN 0031-3203, https://doi.org/10.1016/j.patcog.2019.04.014.

Rani, R. U. ., Rao, P. S. ., Lavanaya, K. ., Satyanarayana, N. ., Lallitha, S. ., & Prasad J, P. . (2023). Optimization of Energy-Efficient Cluster Head Selection Algorithm for Internet of Things in Wireless Sensor Networks. International Journal on Recent and Innovation Trends in Computing and Communication, 11(4), 238–248. https://doi.org/10.17762/ijritcc.v11i4.6445.

H. Wang, W. Wang, X. Zhou, H. Sun, J. Zhao, X. Yu, Z. Cui, Firefly algorithm with neighbourhood attraction, Inform. Sci. 382–383 (2017) 374–387.

I. Fister, I. Fister Jr, X.S. Yang, J. Brest, A comprehensive review of firefly algorithms, Swarm Evol. Comput. 13 (2013) 34–46.

A. Baykasoğlu, F.B. Ozsoydan, An improved firefly algorithm for solving dynamic multidimensional knapsack problems, Expert Syst. Appl. 41 (8) (2014) 3712–3725.

A.H. Gandomi, X.-S. Yang, A.H. Alavi, Mixed variable structural optimization using firefly algorithm, Comput. Struct. 89 (23) (2011) 2325–2336.

C.S. Sung, H.W. Jin, A tabu-search-based heuristic for clustering, Pattern Recognit. 33 (5) (2000) 849–858.

S.Z. Selim, K. Alsultan, A simulated annealing algorithm for the clustering problem, Pattern Recognit. 24 (10) (1991) 1003–1008.

U. Maulik, S. Bandyopadhyay, Genetic algorithm-based clustering technique, Pattern Recognit. 33 (9) (2000) 1455–1465.

P. Das, D.K. Das, S. Dey, A modified bee colony optimization (MBCO) and its hybridization with k-means for an application to data clustering, Appl. Soft Comput. 70 (2018) 590–603.

27 T. Niknam, B. Amiri, An efficient hybrid approach based on PSO, ACO and k-means for cluster analysis, Appl. Soft Comput. 10 (1) (2010) 183–197.

30 S.I. Boushaki, N. Kamel, O. Bendjeghaba, A new quantum chaotic cuckoo search algorithm for data clustering, Expert Syst. Appl. 96 (2018) 358–372.

36 A. Hatamlou, S. Abdullah, M. Hatamlou, Data clustering using big bang–big crunch algorithm, in: Innovative Computing Technology, Springer Berlin Heidelberg, Berlin, Heidelberg, 2011.

X.H. Han, L. Quan, X.Y. Xiong, M. Almeter, J. Xiang, Y. Lan, A novel data clustering algorithm based on modified gravitational search algorithm, Eng. Appl. Artif. Intell. 61 (2017) 1–7.

S.J. Nanda, G. Panda, A survey on nature inspired metaheuristic algorithms for partitional clustering, Swarm Evol. Comput. 16 (2014) 1–18.

L. H.N., A. S. Reddy and K. Naidu, "Analysis of Diabetic Prediction Using Machine Learning Algorithms on BRFSS Dataset," 2023 7th International Conference on Trends in Electronics and Informatics (ICOEI), Tirunelveli, India, 2023, pp. 1024-1028, doi: 10.1109/ICOEI56765.2023.10125804.

Hu Peng, Wenhui Xiao, Yupeng Han, Aiwen Jiang, Zhenzhen Xu, Mengmeng Li, Zhijian Wu, Multi-strategy firefly algorithm with selective ensemble for complex engineering optimization problems, Applied Soft Computing, Volume 120, 2022, 108634, ISSN 1568-4946, https://doi.org/10.1016/j.asoc.2022.108634.

http://archive.ics.uci.edu/ml/

Goldberg D. E., Genetic Algorithms in Search, Optimisation and Machine Learning, Reading, Mass.: Addison Wesley (1989).

Shilane D., Martikainen J., Dudoit S., Ovaska S. J., A general framework for statistical performance comparison of evolutionary computation algorithms, Information Sciences: an Int. Journal, 178, 2870-2879 (2008).

Yang, L. ., & Daimin, G. . (2023). Children’s Perspective on Digital Picure Book: A Brief Analysis . International Journal on Recent and Innovation Trends in Computing and Communication, 11(3), 166–177. https://doi.org/10.17762/ijritcc.v11i3.6336

Thomas, C., Wright, S., Hernandez, M., Flores, A., & García, M. Enhancing Student Engagement in Engineering Education with Machine Learning. Kuwait Journal of Machine Learning, 1(2). Retrieved from http://kuwaitjournals.com/index.php/kjml/article/view/123

Downloads

Published

12.07.2023

How to Cite

K., L. P. ., Suryanarayana, G. ., Swapna, N. ., Bhaskar, T. ., & Kiran, A. . (2023). Optimizing K-Means Clustering using the Artificial Firefly Algorithm. International Journal of Intelligent Systems and Applications in Engineering, 11(9s), 461–468. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3154

Issue

Section

Research Article