A Grid and Density Based Adaptive Clustering Algorithm for Spatio-Temporal Data Mining
Keywords:
Clustering, core points, seismology, spatio-temporal data, pattern miningAbstract
The Indian subcontinent experiences seismic activities which are visualized in India’s Seismic map. These seismic spatio-temporal characteristics need to analyze to understand the evolution. Clustering is a machine learning technique to highlight the patterns of grouping similar objects in the spatio-temporal dimensional. Our research work in this paper proposes a novel algorithm to analyse the spatio-temporal data for patterns through clustering. This is a hybrid method based on grid and density clustering. We have devised a method to find the required total number of core points for density clustering. The efficiency of our algorithm is higher due to appropriate selection of core points with respect to the density in the region. In addition, proposed algorithm requires minimal user defined parameters and minimizes Euclidean distance computation to the neighboring core points in the current region and not with all of the core points. The algorithm has been experimentally tested for correctness of results and performance. It is observed from the results, the Earthquake spatio-temporal data has clustering tendency and the events indicate higher correlation with respect to frequency and time. The quality of clustering is effective and efficient with the silhouette index 0.93.
Downloads
References
S. Meshram and K. P. Wagh, “Mining Intelligent Spatial Clustering Patterns: A Comparative Analysis of Different Approaches,” in 2021 8th International Conference on Computing for Sustainable Global Development (INDIACom), Mar. 2021, pp. 325–330
Pelleg, Dan; Moore, Andrew (1999). "Accelerating exact k -means algorithms with geometric reasoning". Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining. San Diego, California, United States: ACM Press. pp. 277–281. doi:10.1145/312129.312248. ISBN 9781581131437. S2CID 13907420
Ester, Martin; Kriegel, Hans-Peter; Sander, Jörg; Xu, Xiaowei (1996). Simoudis, Evangelos; Han, Jiawei; Fayyad, Usama M. (eds.). A density-based algorithm for discovering clusters in large spatial databases with noise (PDF). Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD-96). AAAI Press. pp. 226–231. CiteSeerX 10.1.1.121.9220. ISBN 1-57735-004-9.
Palla, Gergely; Derényi, Imre; Vicsek, Tamás (2006). "The Critical Point of k-Clique Percolation in the Erdős–Rényi Graph". Journal of Statistical Physics. 128 (1–2): 219–227.
S. Goil, H. Nagesh, and A. Choudhary, “Mafia: efficient and scalable subspace clustering for very large data sets,” Technical Report CPDC TR-9906-010, Northwestern University, 1999
G. Karypis, E. Han, and V. Kumar, “Chameleon: Hierarchical clustering using dynamic modeling,” Computer, vol. 32, no. 8 pp. 68– 75, 1999
T. Barton, T. Bruna, and P. Kordik, “Chameleon 2: An Improved Graph-Based Clustering Algorithm,” ACM Transactions on Knowledge Discovery from Data, vol. 13, no. 1, 2019
Jae-Gil Lee, Jiawei Han, Kyu-Young Whang,” Trajectory Clustering: A Partition-and-Group Framework∗”, SIGMOD’07, June 11–14, 2007, Beijing, China.
Kisilevich, S., Mansmann, F., Nanni, M. and Rinzivillo, S., 2010. Spatio-temporal clustering (pp. 855-874). Springer US.
Shi Z, Pun-Cheng LSC. Spatiotemporal Data Clustering: A Survey of Methods. ISPRS International Journal of Geo-Information. 2019; 8(3):112. https://doi.org/10.3390/ijgi8030112
.M. Siljander, R. Uusitalo, P. Pellikka, S. Isosomppi, O. VapalahtiSpatiotemporal clustering patterns and sociodemographic determinants of COVID-19 (SARS-CoV-2) infections in Helsinki, Finland”Spat. Spatio-Tempor. Epidemiol., 41 (2022), Article 100493.
S.-Q. Yang, Z.-G. Fang, C.-X. Lv, S.-Y. An, P. Guan, D.-S. Huang, W. Wu,“Spatiotemporal cluster analysis of COVID-19 and its relationship with environmental factors at the city level in mainland China”, Environ. Sci. Pollut. Res., 29 (9) (2022), pp. 13386-13395.
Jaya,I.G.N.M. and Folmer, H. (2021), Identifying Spatiotemporal Clusters by Means of Agglomerative Hierarchical Clustering and Bayesian Regression Analysis with Spatiotemporally Varying Coefficients: Methodology and Application to Dengue Disease in Bandung, Indonesia. Geogr Anal, 53: 767-817. https://doi.org/10.1111/gean.12264
D. Zhang, K. Lee, I. Lee,“Hierarchical trajectory clustering for spatio-temporal periodic pattern mining”,Expert Syst. Appl., 92 (2018), pp. 1-11
Fotheringham, S., C. Brunsdon, and M. Charlton. (2002). Geographically Weighted Regression, The Analysis of Spatially Varying Relationships. New York, NY: Wiley.
Ndiath, M., B. Cisse, J. L. Ndiaye, J. Gomis, O. Bathiery, A. Dia, and B. Faye. (2015). “Application of GeographicallyWeighted Regression Analysis to Assess Risk Factors for Malaria Hotspots in Keur Soce Health and Demographic Surveillance Site.” Malaria Journal 14(463), 1–11.
Gelfand, A., H. J. Kim, C. Sirmans, and S. Banerjee. (2003). “Spatial Modeling With Spatially Varying Coefficient Processes.” Journal of the American Statistical Association 98(462), 387–96.
Scott, D.W. Multivariate Density Estimation: Theory, Practice, and Visualization; John Wiley & Sons: Hoboken, NJ, USA, 2015
Silverman, B.W. Density Estimation for Statistics and Data Analysis; CRC Press: Boca Raton, FL, USA, 1986; Volume 26.
Naoufal Rouky, Abdellah Bousouf, Othmane Benmoussa, MouhseneFri,” A spatio temporal analysis of traffic congestion patterns using clustering algorithms: A case study of Casablanca” Decision Analytics Journal24 January 2024
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.