Enhancing Dark Web Classification: A Dynamic Crawler and Robust Classification Framework

Authors

  • Sasirekha Devarajan Department of Computer Science, Anna Adarsh College for Women, Chennai-600040, Tamil Nadu, India.
  • Pakutharivu Panneerselvam Department of Computer Science, Anna Adarsh College for Women, Chennai-600040, Tamil Nadu, India.
  • Aditya Mudigonda JNIAS School of Planning and Architecture, Hyderabad, Telangana 500034, India.
  • Perichetla Kandaswamy Hemalatha Department of Mathematics, Vel Tech Rangarajan Dr Sagunthala R&D Institute of Science and Technology, Avadi.600062, Chennai, Tamil Nadu, India.

Keywords:

Data Mining module, Dark Web Classification, Neural Network, Support Vector Machine, Dark Web Links

Abstract

The dark web presents significant challenges for law enforcement agencies due to its anonymous and constantly evolving nature, making it difficult to trace and monitor illegal activities. To combat this issue, a proposed system collects and cleans dark web pages, focusing on studying the market that specializes in selling illegal and harmful products through three key modules. The Crawler module accesses the market through the Tor Network, gathers data, and extracts crucial information about the products, sellers, and prices. The Pre-Processing module cleans and organizes the extracted data, ensuring its integrity and transforming it into a mineable format. The Data Mining module extracts insights and knowledge from the processed data, using techniques like clustering, classification, and association rule mining to identify patterns and trends. These modules provide valuable insights to law enforcement agencies and security researchers to combat illicit activities on the dark web market. The system's efficiency is evaluated through metrics like throughput and speedup, demonstrating its capability to handle large datasets and improve performance through parallel processing. Additionally, the proposed Support Vector Machine (SVM) with Neural Network (NN) outperforms other methodologies, highlighting its accuracy in predicting dark web links and establishing a robust classification framework. The research contributes to a comprehensive understanding of the dark web landscape and fosters advancements in cybersecurity and law enforcement practices. By integrating these solutions, this study aims to enhance the accuracy, adaptability, and effectiveness of dark web classification. The research contributes to a comprehensive understanding of the dark web landscape and fosters advancements in cybersecurity and law enforcement practices.

Downloads

Download data is not yet available.

References

Lautenschlager, S. (2016). Surface web, deep web, dark web—What’s the difference. Cambia research [Online] Available at: https://www.cambiaresearch.com/articles/85/surface-web-deep-web-dark-web-whats-the-difference.

Dingledine, R., Mathewson, N., & Syverson, P. F. (2004, August). Tor: The second-generation Ahmad, Maynard, A., S., & Gupta, A. (2019). The dark web as a phenomenon: A review and research agenda. In 30th Australasian Conference on Information Systems. Perth, Australia. Onion router USENIX security symposium, 4.

Baney, L. L., & Lewis, N. M. (2018). Internet pharmacies: Trends, opportunities, and risks. Health Law, 31, 1.

Magnúsdóttir, H. (2019). Darknet drug markets in a Swedish context: A descriptive analysis of Wall Street market and flugsvamp 3.0.

Meland, P. H., Bayoumy, Y. F. F., & Sindre, G. (2020). The Ransomware-as-a-Service economy within the darknet. Computers and Security, 92, 101762. https://doi.org/10.1016/j.cose.2020.101762

Bracci, A., Nadini, M., Aliapoulios, M., McCoy, D., Gray, I., Teytelboym, A., Gallo, A., & Baronchelli, A. (2022). Vaccines and more: The response of Dark Web marketplaces to the ongoing COVID-19 pandemic. PLOS ONE, 17(11), e0275288. https://doi.org/10.1371/journal.pone.0275288

Abdel Samad, Y. (2021). Case study: Dark web markets. Dark web investigation (pp. 237–247).

Rajawat, A. S., Bedi, P., Goyal, S. B., Kautish, S., Xihua, Z., Aljuaid, H., & Mohamed, A. W. (2022). Dark web data classification using neural network. Computational Intelligence and Neuroscience, 2022, 8393318. https://doi.org/10.1155/2022/8393318

Ahmad, A., Maynard, S., & Gupta, A. (2019). The dark web as a phenomenon: A review and research agenda. In 30th Australasian Conference on Information Systems. Perth, Australia.

Rhumorbarbe, D., Werner, D., Gilliéron, Q., Staehli, L., Broséus, J., & Rossy, Q. (2018). Characterizing the online weapons trafficking on crypto markets. Forensic Science International, 283, 16–20. https://doi.org/10.1016/j.forsciint.2017.12.008

Liggett, R., Lee, J. R., Roddy, A. L., & Wallin, M. A. (2020). The dark web as a platform for crime: An exploration of illicit drug, firearm, CSAM, and cybercrime markets. The Palgrave handbook of international cybercrime and cyberdeviance (pp. 91–116).

Nazah, S., Huda, S., Abawajy, J., & Hassan, M. M. (2020). Evolution of dark web threat analysis and detection: A systematic approach. IEEE Access, 8, 171796–171819. https://doi.org/10.1109/ACCESS.2020.3024198

Gulati, H., Saxena, A., Pawar, N., Tanwar, P., & Sharma, S. (2022, January). Dark web in modern world theoretical perspective: A survey. In International Conference on Computer Communication and Informatics (ICCCI), 2022 (pp. 1–10). IEEE Publications. https://doi.org/10.1109/ICCCI54379.2022.9740785

Dalvi, A., Paranjpe, S., Amale, R., Kurumkar, S., Kazi, F., & Bhirud, S. G. (2021, May). SpyDark: Surface and dark web crawler. In 2nd International Conference on Secure Cyber Computing and Communications (ICSCCC), 2021 (pp. 45–49). IEEE Publications. https://doi.org/10.1109/ICSCCC51823.2021.9478098

Kadoguchi, M., Hayashi, S., Hashimoto, M., & Otsuka, A. (2019, July). Exploring the dark web for cyber threat intelligence using machine leaning. In IEEE International Conference on Intelligence and Security Informatics (ISI), 2019 (pp. 200–202). IEEE Publications. https://doi.org/10.1109/ISI.2019.8823360

Sasnouskaya, T. (2023). Unveiling the dark web and the impact of REvil's cyberattacks.

Saleem, J., Islam, R., & Kabir, M. A. (2022). The anonymity of the dark web: A survey. IEEE Access, 10, 33628–33660. https://doi.org/10.1109/ACCESS.2022.3161547

Qasem, A. E., & Sajid, M. (2022, October). Exploring the effect of N-grams with BOW and TF-IDF representations on detecting fake news. In International Conference on Data Analytics for Business and Industry (ICDABI), 2022 (pp. 741–746). IEEE Publications. https://doi.org/10.1109/ICDABI56818.2022.10041537

Nezhad, S. Z. (2023). Dark web traffic detection using supervised machine learning.

Basheer, R., & Alkhatib, B. (2021). Threats from the dark: A review over dark web investigation research for cyber threat intelligence. Journal of Computer Networks and Communications, 2021, 1–21. https://doi.org/10.1155/2021/1302999

Prabha, C., & Mittal, A. (2023, February). Dark Web: A review on the deeper side of the Web. In O. International (Ed.) Technology Conference on Emerging Technologies for Sustainable Development (OTCON), 2022 (pp. 1–6). IEEE Publications. https://doi.org/ 10.1109/OTCON56053.2023.10113989

Alaidi, A. H. M. (2022). Dark web illegal activities crawling and classifying using data mining techniques. iJIM. Roa’a, M. ALRikabi, H.T.S., Aljazaery, I. A, and Abbood, S.H., 16(10), 123.

Downloads

Published

30.11.2023

How to Cite

Devarajan, S. ., Panneerselvam, P. ., Mudigonda, A. ., & Hemalatha, P. K. . (2023). Enhancing Dark Web Classification: A Dynamic Crawler and Robust Classification Framework. International Journal of Intelligent Systems and Applications in Engineering, 12(6s), 01–09. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3925

Issue

Section

Research Article