Leading the Way in Efficient Web Content Mining through Advanced Classification and Clustering Techniques
Keywords:
Data Mining, Heterogeneous networks, Knowledge Discovery, Text mining, Web miningAbstract
The clustering techniques in online content mining for knowledge discovery is the main topic of the abstract for the article "Clustering Techniques in Knowledge Discovery for Web Content Mining". The application of association rule mining, sequential pattern discovery, and clustering as data mining techniques for knowledge extraction is mentioned.
When the data comes from the online, web mining—the process of obtaining information from web data—is referred to as a subset of knowledge discovery from databases (KDD). A particular kind of web mining called web use mining (WUM) seeks to identify, assess, and make use of hidden knowledge from online data sources. Data from user registration forms, server access logs, user profiles, and transactions are used in web use mining.
It is mentioned that one technique utilized in online content mining for knowledge discovery is clustering algorithms. In the context of online content mining, clustering is the process of assembling comparable data points into groups according to their shared traits or patterns. Clustering may be used to find page sets, page sequences, and page graphs.
The use of text analysis methods for knowledge discovery from unstructured materials, including feature extraction, theme indexing, clustering, and summarization, is also mentioned in the abstract. Press releases, emails, notes, contracts, government reports, and news feeds are just a few of the documents from which valuable information may be extracted thanks to these strategies.
An overview of the use of clustering algorithms in knowledge discovery for online content mining is given in the abstract overall. It highlights the use of text analysis tools to extract knowledge from unstructured documents and the clustering approach in online use mining.
Downloads
References
Shu, Xiaoling & Ye, Yiwan. (2022). Knowledge Discovery: Methods from data mining and machine learning. Social Science Research. 110. 102817. 10.1016/j.ssresearch.2022.102817
Allahyari, Mehdi & Pouriyeh, Seyedamin & Assefi, Mehdi & Safaei, Saied & Trippe, Elizabeth & Gutiérrez, Juan & Kochut, Krys. (2017). A Brief Survey of Text Mining: Classification, Clustering and Extraction Techniques.
Dash, Yajnaseni. (2013). A Review of Clustering and Classification Techniques in Data Mining.
P. Madhura, M. Padmavathamma, 2015, A Web Mining Process for Knowledge Discovery of Web usage Patterns, INTERNATIONAL JOURNAL OF ENGINEERING RESEARCH & TECHNOLOGY (IJERT) NCACI – 2015
Xiaoling Shu, Yiwan Ye,Knowledge Discovery: Methods from data mining and machine learning, Social Science Research, Volume 110, 2023,102817,ISSN 0049-089X,
Antonia Kyriakopoulou,”Text Classification Aided by Clustering: a Literature Review” in “Tools in Artificial Intelligence” doi: 10.5772/6083
Ngai, Eric & Xiu, Li & Chau, Dorothy. (2009). Application of data mining techniques in customer relationship management: A literature review and classification. Expert Syst. Appl. 36. 2592-2602. 10.1016/j.eswa.2008.02.021.
Lorena Siguenza-Guzman, Victor Saquicela, Elina Avila-Ordóñez, Joos Vandewalle, Dirk Cattrysse, Literature Review of Data Mining Applications in Academic Libraries, The Journal of Academic Librarianship, Volume 41, Issue 4, 2015, Pages 499-510, ISSN 0099-1333,
Shafiq Alam, Gillian Dobbie, Yun Sing Koh, Patricia Riddle, Saeed Ur Rehman, Research on particle swarm optimization based clustering: A systematic review of literature and techniques, Swarm and Evolutionary Computation, Volume 17, 2014, Pages 1-13, ISSN 2210-6502,https://doi.org/10.1016/j.swevo.2014.02.001.
International Journal of Scientific Research in Computer Science, Engineering and Information Technology A Survey on Text Mining - Techniques, Application 2023
K. Mohan, “A survey on web structure mining,” International Journal of Advanced Computer Research, vol. 1, no. 1, pp. 715–720, 2017.
S. Ahmad, A. A. Bakar, and M. R. Yaakub, “Movie revenue prediction based on purchase intention mining using YouTube trailer reviews,” Information Processing & Management, vol. 57, no. 5, Article ID 102278, 2020.
Prem Sagar Sharma, Divakar Yadav, R. N. Thakur, "Web Page Ranking Using Web Mining Techniques: A Comprehensive Survey", Mobile Information Systems, vol. 2022, Article ID 7519573, 19 pages, 2022. https://doi.org/10.1155/2022/7519573
http://paginas.fe.up.pt/~ec/files_0506/slides/06_WebMining.pdf [Accessed on Feb. 6,2013]
Chintandeep Kaur, Rinkle Rani Aggarwal,” Web Mining Tasks & Types: A Survey”, International Journal of Research in IT & Management, Volume 2, Issue 2 (ISSN 2231-4334), February 2012, Pp- 547-559.
K. Wang and H. Liu. Discovering T ypical Structures of Documents: A Road Map Approach. In 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 146–154, 1998.
Bassiou, N. and Kotropoulos, C. 2006. Color Histogram Equalization using Probability Smoothening. Proceedings of XIV European Signal Processing Conference
Shu, Xiaoling & Ye, Yiwan. (2022). Knowledge Discovery: Methods from data mining and machine learning. Social Science Research. 110. 102817. 10.1016/j.ssresearch.2022.102817.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.