Design of An Incremental Q Learning Model for Improving Efficiency of Rule-Mining Based Automatic Clustering Architectures

Authors

  • Jayshri Harde Asst.Prof. GHRU Saikheda
  • Swapnili Karmore Associate Professor, GHRIET, Nagpur

Keywords:

Incremental Q Learning Model, Genetic Algorithm, Rule-mining, Automatic Clustering, Efficiency Enhancements

Abstract

Increasing the efficacy of rule-mining-based automatic clustering architectures is a prerequisite for many data-driven applications. In this paper, we present a novel method, the Incremental Q Learning Model (IQLM), which combines Q learning and the Genetic Algorithm for iterative optimizations. Our model addresses the need to improve clustering effectiveness by maximizing inter-class variance and minimizing intra-class variance levels. Existing methods frequently struggle to achieve optimal feature selection and parameter tuning, which can have a substantial impact on clustering performance under real-time scenarios. Our IQLM incorporates a Genetic Algorithm Model that performs feature selection, retaining high variance features from input datasets and samples, in order to circumvent these limitations. This method assists in identifying key discriminative characteristics, thereby enhancing the accuracy of clustering process. In our method, the Q Learning Model computes an Iterative Q Value that quantifies the ratio of Inter Cluster Centroid Variance to Intraclass Sample Variance levels. This Q Value is a dynamic measure of clustering effectiveness. The model optimizes the clustering parameters by iteratively adjusting the values of Minimum Support (for FPGrowth, Apriori, and FPMax), Epsilon, and Min Samples for DBSCAN. As a result, the Q Values are recalculated, and a reward function based on the estimated improvement is derived for different datasets & samples. The efficacy of our proposed model is demonstrated by empirical evaluations conducted on diverse data sets. In comparison to existing methods, the IQLM is 45% efficient in terms of precision, accuracy & recall levels. Our proposed IQLM's characteristics make it suitable for real-time scenarios. Its ability to dynamically modify clustering parameters and feature selection ensures adaptability in fluctuating data environments. The achieved efficiency enhancement improves the scalability and usability of the automatic clustering architecture, making it suitable for a wide range of data-driven applications in real-world environments.

Downloads

Download data is not yet available.

References

T. Wang et al., "A Deep Clustering via Automatic Feature Embedded Learning for Human Activity Recognition," in IEEE Transactions on Circuits and Systems for Video Technology, vol. 32, no. 1, pp. 210-223, Jan. 2022, doi: 10.1109/TCSVT.2021.3057469.

S. Bobek, M. Kuk, M. Szelążek and G. J. Nalepa, "Enhancing Cluster Analysis With Explainable AI and Multidimensional Cluster Prototypes," in IEEE Access, vol. 10, pp. 101556-101574, 2022, doi: 10.1109/ACCESS.2022.3208957.

X. Shi, Y. D. Wong, C. Chai, M. Z. -F. Li, T. Chen and Z. Zeng, "Automatic Clustering for Unsupervised Risk Diagnosis of Vehicle Driving for Smart Road," in IEEE Transactions on Intelligent Transportation Systems, vol. 23, no. 10, pp. 17451-17465, Oct. 2022, doi: 10.1109/TITS.2022.3166838.

A. S. Shirkhorshidi, T. Y. Wah, S. M. R. Shirkhorshidi and S. Aghabozorgi, "Evolving Fuzzy Clustering Approach: An Epoch Clustering That Enables Heuristic Postpruning," in IEEE Transactions on Fuzzy Systems, vol. 29, no. 3, pp. 560-568, March 2021, doi: 10.1109/TFUZZ.2019.2956900.

J. Guan, S. Li, X. He, J. Zhu, J. Chen and P. Si, "SMMP: A Stable-Membership-Based Auto-Tuning Multi-Peak Clustering Algorithm," in IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 5, pp. 6307-6319, 1 May 2023, doi: 10.1109/TPAMI.2022.3213574.

T. Zhang, M. Zhou, X. Guo, L. Qi and A. Abusorrah, "A Density-Center-Based Automatic Clustering Algorithm for IoT Data Analysis," in IEEE Internet of Things Journal, vol. 9, no. 24, pp. 24682-24694, 15 Dec.15, 2022, doi: 10.1109/JIOT.2022.3194886.

Y. A. Badr, K. T. Wassif and M. Othman, "Automatic Clustering of DNA Sequences With Intelligent Techniques," in IEEE Access, vol. 9, pp. 140686-140699, 2021, doi: 10.1109/ACCESS.2021.3119560.

J. -X. Chen, Y. -J. Gong, W. -N. Chen, M. Li and J. Zhang, "Elastic Differential Evolution for Automatic Data Clustering," in IEEE Transactions on Cybernetics, vol. 51, no. 8, pp. 4134-4147, Aug. 2021, doi: 10.1109/TCYB.2019.2941707.

J. Jia, M. Luo, S. Ma, L. Wang and Y. Liu, "Consensus-Clustering-Based Automatic Distribution Matching for Cross-Domain Image Steganalysis," in IEEE Transactions on Knowledge and Data Engineering, vol. 35, no. 6, pp. 5665-5679, 1 June 2023, doi: 10.1109/TKDE.2022.3155924.

S. B. Son, S. Park and J. Kim, "Entropy-Aware Similarity for Balanced Clustering: A Case Study With Melanoma Detection," in IEEE Access, vol. 11, pp. 46892-46902, 2023, doi: 10.1109/ACCESS.2023.3275561.

P. Singh and S. Ganapathy, "Self-Supervised Representation Learning With Path Integral Clustering for Speaker Diarization," in IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 29, pp. 1639-1649, 2021, doi: 10.1109/TASLP.2021.3075100.

X. Shang, T. Yang, S. Han, M. Song and B. Xue, "Interference-Suppressed and Cluster-Optimized Hyperspectral Target Extraction Based on Density Peak Clustering," in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 14, pp. 4999-5014, 2021, doi: 10.1109/JSTARS.2021.3078452.

A. H. A. El-Naga, S. Sayed, A. Salah and H. Mohsen, "Consensus Nature Inspired Clustering of Single-Cell RNA-Sequencing Data," in IEEE Access, vol. 10, pp. 98079-98094, 2022, doi: 10.1109/ACCESS.2022.3202187.

J. Chen, Y. Zhang, L. Wu, T. You and X. Ning, "An Adaptive Clustering-Based Algorithm for Automatic Path Planning of Heterogeneous UAVs," in IEEE Transactions on Intelligent Transportation Systems, vol. 23, no. 9, pp. 16842-16853, Sept. 2022, doi: 10.1109/TITS.2021.3131473.

W. Xia, T. Wang, Q. Gao, M. Yang and X. Gao, "Graph Embedding Contrastive Multi-Modal Representation Learning for Clustering," in IEEE Transactions on Image Processing, vol. 32, pp. 1170-1183, 2023, doi: 10.1109/TIP.2023.3240863.

Y. Lu, Y. -M. Cheung and Y. Y. Tang, "Self-Adaptive Multiprototype-Based Competitive Learning Approach: A k-Means-Type Algorithm for Imbalanced Data Clustering," in IEEE Transactions on Cybernetics, vol. 51, no. 3, pp. 1598-1612, March 2021, doi: 10.1109/TCYB.2019.2916196.

H. Chen, Y. Zhou, K. Mei, N. Wang and G. Cai, "A New Density Peak Clustering Algorithm With Adaptive Clustering Center Based on Differential Privacy," in IEEE Access, vol. 11, pp. 1418-1431, 2023, doi: 10.1109/ACCESS.2022.3233196.

Y. Peng, B. Cui, H. Yin, Y. Zhang and P. Du, "Automatic SAR Change Detection Based on Visual Saliency and Multi-Hierarchical Fuzzy Clustering," in IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, vol. 15, pp. 7755-7769, 2022, doi: 10.1109/JSTARS.2022.3199017.

F. Du et al., "SVM-Assisted Adaptive Kernel Power Density Clustering Algorithm for Millimeter Wave Channels," in IEEE Transactions on Antennas and Propagation, vol. 70, no. 6, pp. 4014-4026, June 2022, doi: 10.1109/TAP.2022.3140538.

A. A. Qaffas, M. A. B. Hajkacem, C. -E. B. Ncir and O. Nasraoui, "Interpretable Multi-Criteria ABC Analysis Based on Semi-Supervised Clustering and Explainable Artificial Intelligence," in IEEE Access, vol. 11, pp. 43778-43792, 2023, doi: 10.1109/ACCESS.2023.3272403.

E. Battistella et al., "COMBING: Clustering in Oncology for Mathematical and Biological Identification of Novel Gene Signatures," in IEEE/ACM Transactions on Computational Biology and Bioinformatics, vol. 19, no. 6, pp. 3317-3331, 1 Nov.-Dec. 2022, doi: 10.1109/TCBB.2021.3123910.

Y. Li, Q. Xu, W. Li and J. Nie, "Automatic Clustering-Based Two-Branch CNN for Hyperspectral Image Classification," in IEEE Transactions on Geoscience and Remote Sensing, vol. 59, no. 9, pp. 7803-7816, Sept. 2021, doi: 10.1109/TGRS.2020.3038425.

Z. Lan, P. Gao, P. Wang, Y. Wang, J. Liang and G. Hu, "Automatic First Arrival Time Identification Using Fuzzy C-Means and AIC," in IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-13, 2022, Art no. 5907613, doi: 10.1109/TGRS.2021.3121032.

X. Hu, Y. Tang, W. Pedrycz, K. Di, J. Jiang and Y. Jiang, "Fuzzy Clustering With Knowledge Extraction and Granulation," in IEEE Transactions on Fuzzy Systems, vol. 31, no. 4, pp. 1098-1112, April 2023, doi: 10.1109/TFUZZ.2022.3195033.

B. Merikhi and M. R. Soleymani, "Automatic Data Clustering Framework Using Nature-Inspired Binary Optimization Algorithms," in IEEE Access, vol. 9, pp. 93703-93722, 2021, doi: 10.1109/ACCESS.2021.3091397.

Earshia V., D. ., & M., S. . (2023). Interpolation of Low-Resolution Images for Improved Accuracy Using an ANN Quadratic Interpolator . International Journal on Recent and Innovation Trends in Computing and Communication, 11(4s), 135–140. https://doi.org/10.17762/ijritcc.v11i4s.6319

Paul Garcia, Ian Martin, Laura López, Sigurðsson Ólafur, Matti Virtanen. Predictive Analytics in Education: Leveraging Machine Learning for Student Success. Kuwait Journal of Machine Learning, 2(1). Retrieved from http://kuwaitjournals.com/index.php/kjml/article/view/164

Veeraiah, V., Anand, R., Mishra, K. N., Dhabliya, D., Ajagekar, S. S., & Kanse, R. (2022). Investigating scope of energy efficient routing in adhoc network. Paper presented at the PDGC 2022 - 2022 7th International Conference on Parallel, Distributed and Grid Computing, 681-686. doi:10.1109/PDGC56933.2022.10053344 Retrieved from www.scopus.com

Downloads

Published

03.09.2023

How to Cite

Harde , J. ., & Karmore, S. . (2023). Design of An Incremental Q Learning Model for Improving Efficiency of Rule-Mining Based Automatic Clustering Architectures. International Journal of Intelligent Systems and Applications in Engineering, 12(1s), 591–598. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3494

Issue

Section

Research Article