A Novel Data Stream High Utility Itemset Miner with the Batch Transaction Processing Model

Authors

  • Subba Reddy Meruva, BonduVenkateswarlu

Keywords:

High Utility Itemset, Association Mining, HUI Mining, Data Stream, Batch Model

Abstract

High-utility mining techniques play a significant role in effectively finding the high utility itemsets (HUIs). These techniques aim to find the HUIs based on threshold values of minimum utility. Real-time applications, such as dynamic retail store transactions, continuous web data stream, and item updates in sensor network databases, need dynamic HUI mining techniques. Recently, an incremental mining-based high utility itemset (IM-HUM) was developed to handle the dynamic itemsets in the HUI mining process using incremental schedulers. It was primarily focused on processing HUI based on time frame schedulers rather than considering the amounts or size of items processed during the particular scheduler. It becomes tedious when a reasonable number of items cannot be processed in the prescribed schedule. For this reason, in the proposed work, the reasonable number of items in the data stream is defined by fixing the size of the batch of items instead of considering schedulers. The proposed data stream high utility miner is implemented using a batch model, say  (DS-HUI-BM). It is superior to other state-of-the-art HUI mining techniques for both sparse and dense datasets, and the same is illustrated in the experimental section.

Downloads

Download data is not yet available.

References

Chee, CH., Jaafar, J., Aziz, I.A. et al. Algorithms for frequent itemset mining: a literature review. Artif Intell Rev 52, 2603–2621 (2019). https://doi.org/10.1007/s10462-018-9629-z

Meruva, Subba Reddy and Venkateswarlu Bondu. "Review of Association Mining Methods for the Extraction of Rules Based on the Frequency and Utility Factors." IJITPM vol.12, no.4 2021: pp.1-10. http://doi.org/10.4018/IJITPM.2021100101

D. Chen, S. L. Sain and K. Guo, "Data mining for the online retail industry: A case study of RFM model-based customer segmentation using data mining", J. Database Marketing Customer Strategy Manage., vol. 19, no. 3, pp. 197-208, Sep. 2012.

J. Han, J. Pei and Y. Yin, "Mining frequent patterns without candidate generation", Proc. ACM SIGMOD Int. Conf. Manage. Data, vol. 29, pp. 1-12, 2000.

Nguyen L.T.T., Mai T., Vo B. (2019) High Utility Association Rule Mining. In: Fournier-Viger P., Lin JW., Nkambou R., Vo B., Tseng V. (eds) High-Utility Pattern Mining. Studies in Big Data, vol 51. Springer, Cham. https://doi.org/10.1007/978-3-030-04921-8_6

Tseng et al., 2013, V.S. Tseng, B.E. Shie, C.W. Wu, P.S. Yu, Efficient algorithms for mining high utility itemsets from transactional databases, IEEE Trans. Knowl. Data Eng., 25 (2013), pp. 1772-1786

Tseng, V.S., Wu, C.-W., Shie, B.-E., Yu, P.S.: Up-growth: an efficient algorithm for high utility itemset mining. In: ACM SIGKDD, pp. 253–262. ACM (2010)

Tseng VS, Shie BE, Wu CW, Philip SY (2012) Efficient algorithms for mining high utility itemsets from transactional databases. IEEE Trans Knowl Data Eng 25(8):1772–1786

Lin JCW, Gan W, Hong TP (2016) Maintaining the discovered high-utility itemsets with transaction modification. Appl Intell 44(1):166–178

K. Rajendra Prasad (2017), Optimized high-utility itemsets mining for effective association mining paper, International Journal of Electrical and Computer, Vol. 7, Issue. 5, pp: 2911-2918

T.-P. Hong, C.-H. Lee and S.-L. Wang, "Effective utility mining with the measure of average utility", Expert Syst. Appl., vol. 38, no. 7, pp. 8259-8265, Jul. 2011.

B. Vo, L. T. T. Nguyen, N. Bui, T. D. D. Nguyen, V. -N. Huynh and T. -P. Hong, "An Efficient Method for Mining Closed Potential High-Utility Itemsets," in IEEE Access, vol. 8, pp. 31813-31822, 2020, doi: 10.1109/ACCESS.2020.2974104.

R. J. Hilderman, C. L. Carter, H. J. Hamilton and N. Cercone, "Mining market basket data using share measures and characterized itemsets", Proc. PAKDD, pp. 159-173, 1998.

Chan R, Yang Q, and Shen Y-D (2003) Mining high utility itemsets, in IEEE International Conference on Data mining, pp. 19–26

Krishnamoorthy S (2019) A comparative study of top-K high utility itemset mining methods. High-Utility Pattern Mining, pp 47–74

Liu, J., Wang, K., Fung, B.C.: Direct discovery of high utility itemsets without candidate generation. In: Proceedings of the 12th IEEE International Conference on Data Mining, pp. 984–989. IEEE (2012)

Zida, S., Fournier-Viger, P., Lin, J.CW. et al. EFIM: a fast and memory efficient algorithm for high-utility itemset mining. Knowl Inf Syst 51, 595–625 (2017). https://doi.org/10.1007/s10115-016-0986-0

Using Length Upper-Bound Reduction. In: Fujita, H., Ali, M., Selamat, A., Sasaki, J., Kurematsu, M. (eds) Trends in Applied Knowledge-Based Systems and Data Science. IEA/AIE 2016. Lecture Notes in Computer Science(), vol 9799. Springer, Cham. https://doi.org/10.1007/978-3-319-42007-3_11

Li, HF., Huang, HY. & Lee, SY. Fast and memory efficient mining of high-utility itemsets from data streams: with and without negative item profits. Knowl Inf Syst 28, 495–522 (2011). https://doi.org/10.1007/s10115-010-0330-z

Fageeri, S.O., Hossain, S.M.E., Arockiasamy, S., Al-Salmi, T.Y. (2022). High-Utility Pattern Mining Using ULB-Miner. In: Aurelia, S., Hiremath, S.S., Subramanian, K., Biswas, S.K. (eds) Sustainable Advanced Computing. Lecture Notes in Electrical Engineering, vol 840. Springer, Singapore. https://doi.org/10.1007/978-981-16-9012-9_17

Liu Y., Liao W., Choudhary A. (2005) A Two-Phase Algorithm for Fast Discovery of High Utility Itemsets. In: Ho T.B., Cheung D., Liu H. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2005. Lecture Notes in Computer Science, vol 3518. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11430919_79

Subba Reddy Meruva, Dr. Bondu Venkateswarlu, Tree Integrated High Utility Miner for Improving an Efficiency of Association Mining, Vol., Vol. 83, 2020, pp:15938-15946

E. Hikmawati, N. U. Maulidevi and K. Surendro, "Pruning Strategy on Adaptive Rule Model by Sorting Utility Items," in IEEE Access, vol. 10, pp. 91650-91662, 2022,doi: 10.1109/ACCESS.2022.3202307.

S. R. Meruva and B. Venkateswarlu, "A Fast and Effective Tree-based Mining Technique for Extraction of High Utility Itemsets," 2022 6th International Conference on Electronics, Communication and Aerospace Technology, Coimbatore, India, 2022, pp. 1393-1399,doi: 10.1109/ICECA55336.2022.10009213

Downloads

Published

26.03.2024

How to Cite

Subba Reddy Meruva. (2024). A Novel Data Stream High Utility Itemset Miner with the Batch Transaction Processing Model. International Journal of Intelligent Systems and Applications in Engineering, 12(21s), 3858 –. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/6157

Issue

Section

Research Article