Developing an Efficient FP-Growth Algorithm using Ordered Frequent Itemsets Matrix for Big Data
Keywords:
Big Data, Aprioiri Algorithm, FP-Growth Algorithm, Support CountAbstract
Mining big data is difficult. Problems require an efficient algorithm and software computer for computation in big datasets The FP Growth Algorithm needs a lot of memory and requires a long time for computation and extract result. In this work, we propose modifications to the workings of the FP-Growth algorithm. The suggested algorithm will reduce the time in mining and decrease the number of frequently created items, yielding a significant reduction in decision-making in big datasets through our use of the proposed matrix OFIM instead of the tree used in those algorithms. The matrix OFIM allows for efficient storage and retrieval of frequent itemsets, resulting in faster computation and extraction of results compared to the traditional tree-based approach. Additionally, our algorithm optimizes memory usage by minimizing the number of frequently created items, further enhancing its performance in handling big datasets.
Downloads
References
S. P. Tamba, M. Sitanggang, B. C. Situmorang, G. L. Panjaitan, and M. Nababan, “Application of data mining to determine the level of fish sales in pt. trans retail with fp-growth method,” INFOKUM, 2022, pp. 905–913.
T. Patil, R. Rana, and P. Singh, “Distributed frequent pattern analysis in big data.”International Research Journal of Modernization in Engineering Technology and Science ,2022,pp.1-3.
A. Ayu, A. P. Windarto, and D. Suhendro, “Implementasi data mining dengan metode fp-growth terhadap data penjualan barang sebagai strategi penjualan pada cv. a & a copier,” Resolusi: Rekayasa Teknik Informatika dan Informasi, 2021, pp. 67–75.
A. Siswandi, A. S. Sunge, and R. Y. Wulandari, “Analisa data mining dengan metode klasifikasi untuk produk cacat pada pt. shuangying international indonesia,” Jurnal SIGMA, 2018, pp. 153–156.
B. Anwar, A. Ambiyar, and F. Fadhilah, “Application of the fp-growth method to determine drug sales patterns,” Sinkron: jurnal dan penelitian teknik informatika, 2023, pp. 405–414.
M. M. Hasan and S. Z. Mishu, “An adaptive method for mining frequent itemsets based on apriori and fp growth algorithm,” in 2018 International Conference on Computer, Communication, Chemical, Material and Electronic Engineering (IC4ME2). IEEE, 2018, pp. 1–4.
A. Almira, S. Suendri, and A. Ikhwan, “Implementasi data mining menggunakan algoritma fp-growth pada analisis pola pencurian daya listrik,” Jurnal Informatika Universitas Pamulang,2021, pp. 442–448.
J. Han, J. Pei, and Y. Yin, “Mining frequent patterns without candidate generation,” ACM sigmod record, 2000, no. 2, pp. 1– 12.
F. Wei and L. Xiang, “Improved frequent pattern mining algorithm based on fp-tree,” in Proceedings of The Fourth International Conference on Information Science and Cloud Computing (ISCC2015), 2015, pp. 18–19.
R. Krupali, D. Garg, and K. Kotecha, “An improved approach of fp-growth tree for frequent itemset mining using partition projection and parallel projection techniques,” International Recent and Innovation Trends in Computing and Communication, 2017, pp. 929–934.
M. Shridhar and M. Parmar, “Survey on association rule mining and its approaches,” Int J Comput Sci Eng, 2017, no. 3, pp. 129–135.
R. Agrawal, R. Srikant et al., “Fast algorithms for mining association rules,” in Proc. 20th int. conf. very large data bases, 1994, pp. 487–499.
H. Khanali and B. Vaziri, “A survey on improved algorithms for mining association rules,” Int. J. Comput. Appl, 2017, p. 8887.
J. Lu, W. Xu, K. Zhou, and Z. Guo, “Frequent itemset mining algorithm based on linear table,” Journal of Database Management (JDM), 2023, pp. 1–21.
M. Barkhan, R. Ramazani, and A. Kabani, “An algorithm to create sorted fp-growth tree for extracting association rules,” Research square, 2022, pp. 1–9.
M. K. Sohrabi and M. H. HASANNEJAD, “Association rule mining using new fp-linked list algorithm,” Journal of Advances in Computer Research,2016, pp. 23–34.
K. BHARATHI and D. B. DEVENDER, “Frequent itemset mining from big data using fp-growth algorithm,” Complexity International Journal (CIJ) , 2020, pp. 582–591.
B. Zhang, “Optimization of fp-growth algorithm based on cloud computing and computer big data,” International Journal of System Assurance Engineering and Management, 2021, pp. 853–863.
S. X. Le Zhang, X. Li, X. Wu, and P.-C. Chang, “An improved fp-growth algorithm based on projection database mining in big data,” Journal of Information Hiding and Multimedia Signal Processing, 2019, pp. 81–90.
M. El Hadi Benelhadj, M. M. Deye, and Y. Slimani, “Signaturebased tree for finding frequent itemsets,” Journal of Communications Software and Systems, 2023, pp. 70–80.
S. Bhise and S. Kale, “Effieient algorithms to find frequent itemset using data mining,” Int. Res. J. Eng. Technol., 2017, pp. 2645–2648.
A. S. Alhegami and H. A. Alsaeedi, “A framework for incremental parallel mining of interesting association patterns for big data,” International Journal of Computing, 2020, pp. 106–117.
H. A. Alsaeedi and A. S. Alhegami, “An incremental interesting maximal frequent itemset mining based on fp-growth algorithm,” Complexity, 2022,
C. J. Merz, “Uci repository of machine learning databases,” URL: http://www. ics. uci. edu/˜ mlearn/MLRepository. html, 1998
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.