A Novel Data Stream High Utility Itemset Miner with the Batch Transaction Processing Model
Keywords:
High Utility Itemset, Association Mining, HUI Mining, Data Stream, Batch ModelAbstract
High-utility mining techniques play a significant role in effectively finding the high utility itemsets (HUIs). These techniques aim to find the HUIs based on threshold values of minimum utility. Real-time applications, such as dynamic retail store transactions, continuous web data stream, and item updates in sensor network databases, need dynamic HUI mining techniques. Recently, an incremental mining-based high utility itemset (IM-HUM) was developed to handle the dynamic itemsets in the HUI mining process using incremental schedulers. It was primarily focused on processing HUI based on time frame schedulers rather than considering the amounts or size of items processed during the particular scheduler. It becomes tedious when a reasonable number of items cannot be processed in the prescribed schedule. For this reason, in the proposed work, the reasonable number of items in the data stream is defined by fixing the size of the batch of items instead of considering schedulers. The proposed data stream high utility miner is implemented using a batch model, say (DS-HUI-BM). It is superior to other state-of-the-art HUI mining techniques for both sparse and dense datasets, and the same is illustrated in the experimental section.
Downloads
References
Chee, CH., Jaafar, J., Aziz, I.A. et al. Algorithms for frequent itemset mining: a literature review. Artif Intell Rev 52, 2603–2621 (2019). https://doi.org/10.1007/s10462-018-9629-z
Meruva, Subba Reddy and Venkateswarlu Bondu. "Review of Association Mining Methods for the Extraction of Rules Based on the Frequency and Utility Factors." IJITPM vol.12, no.4 2021: pp.1-10. http://doi.org/10.4018/IJITPM.2021100101
D. Chen, S. L. Sain and K. Guo, "Data mining for the online retail industry: A case study of RFM model-based customer segmentation using data mining", J. Database Marketing Customer Strategy Manage., vol. 19, no. 3, pp. 197-208, Sep. 2012.
J. Han, J. Pei and Y. Yin, "Mining frequent patterns without candidate generation", Proc. ACM SIGMOD Int. Conf. Manage. Data, vol. 29, pp. 1-12, 2000.
Nguyen L.T.T., Mai T., Vo B. (2019) High Utility Association Rule Mining. In: Fournier-Viger P., Lin JW., Nkambou R., Vo B., Tseng V. (eds) High-Utility Pattern Mining. Studies in Big Data, vol 51. Springer, Cham. https://doi.org/10.1007/978-3-030-04921-8_6
Tseng et al., 2013, V.S. Tseng, B.E. Shie, C.W. Wu, P.S. Yu, Efficient algorithms for mining high utility itemsets from transactional databases, IEEE Trans. Knowl. Data Eng., 25 (2013), pp. 1772-1786
Tseng, V.S., Wu, C.-W., Shie, B.-E., Yu, P.S.: Up-growth: an efficient algorithm for high utility itemset mining. In: ACM SIGKDD, pp. 253–262. ACM (2010)
Tseng VS, Shie BE, Wu CW, Philip SY (2012) Efficient algorithms for mining high utility itemsets from transactional databases. IEEE Trans Knowl Data Eng 25(8):1772–1786
Lin JCW, Gan W, Hong TP (2016) Maintaining the discovered high-utility itemsets with transaction modification. Appl Intell 44(1):166–178
K. Rajendra Prasad (2017), Optimized high-utility itemsets mining for effective association mining paper, International Journal of Electrical and Computer, Vol. 7, Issue. 5, pp: 2911-2918
T.-P. Hong, C.-H. Lee and S.-L. Wang, "Effective utility mining with the measure of average utility", Expert Syst. Appl., vol. 38, no. 7, pp. 8259-8265, Jul. 2011.
B. Vo, L. T. T. Nguyen, N. Bui, T. D. D. Nguyen, V. -N. Huynh and T. -P. Hong, "An Efficient Method for Mining Closed Potential High-Utility Itemsets," in IEEE Access, vol. 8, pp. 31813-31822, 2020, doi: 10.1109/ACCESS.2020.2974104.
R. J. Hilderman, C. L. Carter, H. J. Hamilton and N. Cercone, "Mining market basket data using share measures and characterized itemsets", Proc. PAKDD, pp. 159-173, 1998.
Chan R, Yang Q, and Shen Y-D (2003) Mining high utility itemsets, in IEEE International Conference on Data mining, pp. 19–26
Krishnamoorthy S (2019) A comparative study of top-K high utility itemset mining methods. High-Utility Pattern Mining, pp 47–74
Liu, J., Wang, K., Fung, B.C.: Direct discovery of high utility itemsets without candidate generation. In: Proceedings of the 12th IEEE International Conference on Data Mining, pp. 984–989. IEEE (2012)
Zida, S., Fournier-Viger, P., Lin, J.CW. et al. EFIM: a fast and memory efficient algorithm for high-utility itemset mining. Knowl Inf Syst 51, 595–625 (2017). https://doi.org/10.1007/s10115-016-0986-0
Using Length Upper-Bound Reduction. In: Fujita, H., Ali, M., Selamat, A., Sasaki, J., Kurematsu, M. (eds) Trends in Applied Knowledge-Based Systems and Data Science. IEA/AIE 2016. Lecture Notes in Computer Science(), vol 9799. Springer, Cham. https://doi.org/10.1007/978-3-319-42007-3_11
Li, HF., Huang, HY. & Lee, SY. Fast and memory efficient mining of high-utility itemsets from data streams: with and without negative item profits. Knowl Inf Syst 28, 495–522 (2011). https://doi.org/10.1007/s10115-010-0330-z
Fageeri, S.O., Hossain, S.M.E., Arockiasamy, S., Al-Salmi, T.Y. (2022). High-Utility Pattern Mining Using ULB-Miner. In: Aurelia, S., Hiremath, S.S., Subramanian, K., Biswas, S.K. (eds) Sustainable Advanced Computing. Lecture Notes in Electrical Engineering, vol 840. Springer, Singapore. https://doi.org/10.1007/978-981-16-9012-9_17
Liu Y., Liao W., Choudhary A. (2005) A Two-Phase Algorithm for Fast Discovery of High Utility Itemsets. In: Ho T.B., Cheung D., Liu H. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2005. Lecture Notes in Computer Science, vol 3518. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11430919_79
Subba Reddy Meruva, Dr. Bondu Venkateswarlu, Tree Integrated High Utility Miner for Improving an Efficiency of Association Mining, Vol., Vol. 83, 2020, pp:15938-15946
E. Hikmawati, N. U. Maulidevi and K. Surendro, "Pruning Strategy on Adaptive Rule Model by Sorting Utility Items," in IEEE Access, vol. 10, pp. 91650-91662, 2022,doi: 10.1109/ACCESS.2022.3202307.
S. R. Meruva and B. Venkateswarlu, "A Fast and Effective Tree-based Mining Technique for Extraction of High Utility Itemsets," 2022 6th International Conference on Electronics, Communication and Aerospace Technology, Coimbatore, India, 2022, pp. 1393-1399,doi: 10.1109/ICECA55336.2022.10009213
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.