Leading the Way in Efficient Web Content Mining through Advanced Classification and Clustering Techniques


  • Yogesha T., Thimmaraju S. N.


Data Mining, Heterogeneous networks, Knowledge Discovery, Text mining, Web mining


The clustering techniques in online content mining for knowledge discovery is the main topic of the abstract for the article "Clustering Techniques in Knowledge Discovery for Web Content Mining". The application of association rule mining, sequential pattern discovery, and clustering as data mining techniques for knowledge extraction is mentioned.

When the data comes from the online, web mining—the process of obtaining information from web data—is referred to as a subset of knowledge discovery from databases (KDD).  A particular kind of web mining called web use mining (WUM) seeks to identify, assess, and make use of hidden knowledge from online data sources. Data from user registration forms, server access logs, user profiles, and transactions are used in web use mining.

It is mentioned that one technique utilized in online content mining for knowledge discovery is clustering algorithms. In the context of online content mining, clustering is the process of assembling comparable data points into groups according to their shared traits or patterns. Clustering may be used to find page sets, page sequences, and page graphs.

The use of text analysis methods for knowledge discovery from unstructured materials, including feature extraction, theme indexing, clustering, and summarization, is also mentioned in the abstract. Press releases, emails, notes, contracts, government reports, and news feeds are just a few of the documents from which valuable information may be extracted thanks to these strategies.

An overview of the use of clustering algorithms in knowledge discovery for online content mining is given in the abstract overall. It highlights the use of text analysis tools to extract knowledge from unstructured documents and the clustering approach in online use mining.


Download data is not yet available.


Shu, Xiaoling & Ye, Yiwan. (2022). Knowledge Discovery: Methods from data mining and machine learning. Social Science Research. 110. 102817. 10.1016/j.ssresearch.2022.102817

Allahyari, Mehdi & Pouriyeh, Seyedamin & Assefi, Mehdi & Safaei, Saied & Trippe, Elizabeth & Gutiérrez, Juan & Kochut, Krys. (2017). A Brief Survey of Text Mining: Classification, Clustering and Extraction Techniques.

Dash, Yajnaseni. (2013). A Review of Clustering and Classification Techniques in Data Mining.

P. Madhura, M. Padmavathamma, 2015, A Web Mining Process for Knowledge Discovery of Web usage Patterns, INTERNATIONAL JOURNAL OF ENGINEERING RESEARCH & TECHNOLOGY (IJERT) NCACI – 2015

Xiaoling Shu, Yiwan Ye,Knowledge Discovery: Methods from data mining and machine learning, Social Science Research, Volume 110, 2023,102817,ISSN 0049-089X,

Antonia Kyriakopoulou,”Text Classification Aided by Clustering: a Literature Review” in “Tools in Artificial Intelligence” doi: 10.5772/6083

Ngai, Eric & Xiu, Li & Chau, Dorothy. (2009). Application of data mining techniques in customer relationship management: A literature review and classification. Expert Syst. Appl. 36. 2592-2602. 10.1016/j.eswa.2008.02.021.

Lorena Siguenza-Guzman, Victor Saquicela, Elina Avila-Ordóñez, Joos Vandewalle, Dirk Cattrysse, Literature Review of Data Mining Applications in Academic Libraries, The Journal of Academic Librarianship, Volume 41, Issue 4, 2015, Pages 499-510, ISSN 0099-1333,

Shafiq Alam, Gillian Dobbie, Yun Sing Koh, Patricia Riddle, Saeed Ur Rehman, Research on particle swarm optimization based clustering: A systematic review of literature and techniques, Swarm and Evolutionary Computation, Volume 17, 2014, Pages 1-13, ISSN 2210-6502,https://doi.org/10.1016/j.swevo.2014.02.001.

International Journal of Scientific Research in Computer Science, Engineering and Information Technology A Survey on Text Mining - Techniques, Application 2023

K. Mohan, “A survey on web structure mining,” International Journal of Advanced Computer Research, vol. 1, no. 1, pp. 715–720, 2017.

S. Ahmad, A. A. Bakar, and M. R. Yaakub, “Movie revenue prediction based on purchase intention mining using YouTube trailer reviews,” Information Processing & Management, vol. 57, no. 5, Article ID 102278, 2020.

Prem Sagar Sharma, Divakar Yadav, R. N. Thakur, "Web Page Ranking Using Web Mining Techniques: A Comprehensive Survey", Mobile Information Systems, vol. 2022, Article ID 7519573, 19 pages, 2022. https://doi.org/10.1155/2022/7519573

http://paginas.fe.up.pt/~ec/files_0506/slides/06_WebMining.pdf [Accessed on Feb. 6,2013]

Chintandeep Kaur, Rinkle Rani Aggarwal,” Web Mining Tasks & Types: A Survey”, International Journal of Research in IT & Management, Volume 2, Issue 2 (ISSN 2231-4334), February 2012, Pp- 547-559.

K. Wang and H. Liu. Discovering T ypical Structures of Documents: A Road Map Approach. In 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 146–154, 1998.

Bassiou, N. and Kotropoulos, C. 2006. Color Histogram Equalization using Probability Smoothening. Proceedings of XIV European Signal Processing Conference

Shu, Xiaoling & Ye, Yiwan. (2022). Knowledge Discovery: Methods from data mining and machine learning. Social Science Research. 110. 102817. 10.1016/j.ssresearch.2022.102817.




How to Cite

Thimmaraju S. N., Y. T. . (2024). Leading the Way in Efficient Web Content Mining through Advanced Classification and Clustering Techniques. International Journal of Intelligent Systems and Applications in Engineering, 12(21s), 1191–1195. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/5571



Research Article