A Grid and Density Based Adaptive Clustering Algorithm for Spatio-Temporal Data Mining

Authors

  • Swati Meshram, Kishor P. Wagh

Keywords:

Clustering, core points, seismology, spatio-temporal data, pattern mining

Abstract

The Indian subcontinent experiences seismic activities which are visualized in India’s Seismic map. These seismic spatio-temporal characteristics need to analyze to understand the evolution. Clustering is a machine learning technique to highlight the patterns of grouping similar objects in the spatio-temporal dimensional. Our research work in this paper proposes a novel algorithm to analyse the  spatio-temporal data for patterns through clustering. This is a hybrid method based on grid and density clustering. We have devised a method to find the required total number of core points for density clustering. The efficiency of our algorithm is higher due to appropriate selection of core points with respect to the density in the region. In addition, proposed algorithm requires minimal user defined parameters and minimizes Euclidean distance computation to the neighboring core points in the current region and not with all of the core points. The algorithm has been experimentally tested for correctness of results and performance. It is observed from the results, the Earthquake spatio-temporal data has clustering tendency and the events indicate higher correlation with respect to frequency and time. The quality of clustering is effective and efficient with the silhouette index 0.93.

Downloads

Download data is not yet available.

References

S. Meshram and K. P. Wagh, “Mining Intelligent Spatial Clustering Patterns: A Comparative Analysis of Different Approaches,” in 2021 8th International Conference on Computing for Sustainable Global Development (INDIACom), Mar. 2021, pp. 325–330

Pelleg, Dan; Moore, Andrew (1999). "Accelerating exact k -means algorithms with geometric reasoning". Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining. San Diego, California, United States: ACM Press. pp. 277–281. doi:10.1145/312129.312248. ISBN 9781581131437. S2CID 13907420

Ester, Martin; Kriegel, Hans-Peter; Sander, Jörg; Xu, Xiaowei (1996). Simoudis, Evangelos; Han, Jiawei; Fayyad, Usama M. (eds.). A density-based algorithm for discovering clusters in large spatial databases with noise (PDF). Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD-96). AAAI Press. pp. 226–231. CiteSeerX 10.1.1.121.9220. ISBN 1-57735-004-9.

Palla, Gergely; Derényi, Imre; Vicsek, Tamás (2006). "The Critical Point of k-Clique Percolation in the Erdős–Rényi Graph". Journal of Statistical Physics. 128 (1–2): 219–227.

S. Goil, H. Nagesh, and A. Choudhary, “Mafia: efficient and scalable subspace clustering for very large data sets,” Technical Report CPDC TR-9906-010, Northwestern University, 1999

G. Karypis, E. Han, and V. Kumar, “Chameleon: Hierarchical clustering using dynamic modeling,” Computer, vol. 32, no. 8 pp. 68– 75, 1999

T. Barton, T. Bruna, and P. Kordik, “Chameleon 2: An Improved Graph-Based Clustering Algorithm,” ACM Transactions on Knowledge Discovery from Data, vol. 13, no. 1, 2019

Jae-Gil Lee, Jiawei Han, Kyu-Young Whang,” Trajectory Clustering: A Partition-and-Group Framework∗”, SIGMOD’07, June 11–14, 2007, Beijing, China.

Kisilevich, S., Mansmann, F., Nanni, M. and Rinzivillo, S., 2010. Spatio-temporal clustering (pp. 855-874). Springer US.

Shi Z, Pun-Cheng LSC. Spatiotemporal Data Clustering: A Survey of Methods. ISPRS International Journal of Geo-Information. 2019; 8(3):112. https://doi.org/10.3390/ijgi8030112

.M. Siljander, R. Uusitalo, P. Pellikka, S. Isosomppi, O. VapalahtiSpatiotemporal clustering patterns and sociodemographic determinants of COVID-19 (SARS-CoV-2) infections in Helsinki, Finland”Spat. Spatio-Tempor. Epidemiol., 41 (2022), Article 100493.

S.-Q. Yang, Z.-G. Fang, C.-X. Lv, S.-Y. An, P. Guan, D.-S. Huang, W. Wu,“Spatiotemporal cluster analysis of COVID-19 and its relationship with environmental factors at the city level in mainland China”, Environ. Sci. Pollut. Res., 29 (9) (2022), pp. 13386-13395.

Jaya,I.G.N.M. and Folmer, H. (2021), Identifying Spatiotemporal Clusters by Means of Agglomerative Hierarchical Clustering and Bayesian Regression Analysis with Spatiotemporally Varying Coefficients: Methodology and Application to Dengue Disease in Bandung, Indonesia. Geogr Anal, 53: 767-817. https://doi.org/10.1111/gean.12264

D. Zhang, K. Lee, I. Lee,“Hierarchical trajectory clustering for spatio-temporal periodic pattern mining”,Expert Syst. Appl., 92 (2018), pp. 1-11

Fotheringham, S., C. Brunsdon, and M. Charlton. (2002). Geographically Weighted Regression, The Analysis of Spatially Varying Relationships. New York, NY: Wiley.

Ndiath, M., B. Cisse, J. L. Ndiaye, J. Gomis, O. Bathiery, A. Dia, and B. Faye. (2015). “Application of GeographicallyWeighted Regression Analysis to Assess Risk Factors for Malaria Hotspots in Keur Soce Health and Demographic Surveillance Site.” Malaria Journal 14(463), 1–11.

Gelfand, A., H. J. Kim, C. Sirmans, and S. Banerjee. (2003). “Spatial Modeling With Spatially Varying Coefficient Processes.” Journal of the American Statistical Association 98(462), 387–96.

Scott, D.W. Multivariate Density Estimation: Theory, Practice, and Visualization; John Wiley & Sons: Hoboken, NJ, USA, 2015

Silverman, B.W. Density Estimation for Statistics and Data Analysis; CRC Press: Boca Raton, FL, USA, 1986; Volume 26.

Naoufal Rouky, Abdellah Bousouf, Othmane Benmoussa, MouhseneFri,” A spatio temporal analysis of traffic congestion patterns using clustering algorithms: A case study of Casablanca” Decision Analytics Journal24 January 2024

Downloads

Published

24.03.2024

How to Cite

Kishor P. Wagh, S. M. . (2024). A Grid and Density Based Adaptive Clustering Algorithm for Spatio-Temporal Data Mining . International Journal of Intelligent Systems and Applications in Engineering, 12(3), 2485–2491. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/5720

Issue

Section

Research Article