Machine Learning-Based Phishing Detection: Improving Accuracy and Adaptability

Authors

  • Akhil Mittal, Pandi Kirupa Gopalakrishna Pandian

Keywords:

Machine Learning, Neural Network, Decision Tree, Random Forest, Feature Engineering

Abstract

In this modern era, phishing has become a great problem. Because of this, it can be observed that the personal information of people are leaked from emails & websites. Hence, it is needed that these instances of phishing are to be reduced. In doing one of the best tools that can be used is Machine Learning. This is a process of using historical data for making prediction of future scenarios. In this project, the details of the approached that can be used for the detection of phishing are analyzed. Moreover, the algorithms that are used by ML for this purpose are also envisaged here. The description of the process of collection of data is presented here. In addition to this, the results that shows the effectiveness of ML in the detection of phishing is also discussed here.

Downloads

Download data is not yet available.

References

Aljofey, A., Jiang, Q., Qu, Q., Huang, M. and Niyigena, J.P., 2020. An effective phishing detection model based on character level convolutional neural network from URL. Electronics, 9(9), p.1514.

Casimiro, M., Romano, P., Garlan, D., Moreno, G.A., Kang, E. and Klein, M., 2021, September. Self-Adaptation for Machine Learning Based Systems. In ECSA (Companion).

Deval, S.K., Tripathi, M., Bezawada, B. and Ray, I., 2021, October. “X-Phish: Days of Future Past”: Adaptive & Privacy Preserving Phishing Detection. In 2021 IEEE Conference on Communications and Network Security (CNS) (pp. 227-235). IEEE.

Do, N.Q., Selamat, A., Krejcar, O., Yokoi, T. and Fujita, H., 2021. Phishing webpage classification via deep learning-based algorithms: an empirical study. Applied Sciences, 11(19), p.9210.

Gandotra, E. and Gupta, D., 2021. An efficient approach for phishing detection using machine learning. Multimedia security: algorithm development, analysis and applications, pp.239-253.

Gupta, B.B., Yadav, K., Razzak, I., Psannis, K., Castiglione, A. and Chang, X., 2021. A novel approach for phishing URLs detection using lexical based machine learning in a real-time environment. Computer Communications, 175, pp.47-57.

Obaid, A.J., Ibrahim, K.K., Abdulbaqi, A.S. and Nejrs, S.M., 2021. An adaptive approach for internet phishing detection based on log data. Periodicals of Engineering and Natural Sciences, 9(4), pp.622-631.

Odeh, A., Keshta, I. and Abdelfattah, E., 2021. PHIBOOST-a novel phishing detection model using Adaptive boosting approach. Jordanian Journal of Computers and Information Technology (JJCIT), 7(01).

Rashid, J., Mahmood, T., Nisar, M.W. and Nazir, T., 2020, November. Phishing detection using machine learning technique. In 2020 first international conference of smart systems and emerging technologies (SMARTTECH) (pp. 43-46). IEEE.

Salahdine, F., El Mrabet, Z. and Kaabouch, N., 2021, December. Phishing attacks detection a machine learning-based approach. In 2021 IEEE 12th Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON) (pp. 0250-0255). IEEE.

Somesha, M., Pais, A.R., Rao, R.S. and Rathour, V.S., 2020. Efficient deep learning techniques for the detection of phishing websites. Sādhanā, 45, pp.1-18.

Tang, L. and Mahmoud, Q.H., 2021. A survey of machine learning-based solutions for phishing website detection. Machine Learning and Knowledge Extraction, 3(3), pp.672-694.

Kaur, Jagbir. "Streaming Data Analytics: Challenges and Opportunities." International Journal of Applied Engineering & Technology, vol. 5, no. S4, July-August 2023, pp. 10-16.https://romanpub.com/resources/ijaetv5-s4-july-aug-2023-2.pdf

Pandi Kirupa Kumari Gopalakrishna Pandian, Satyanarayan kanungo, J. K. A. C. P. K. C. (2022). Ethical Considerations in Ai and Ml: Bias Detection and Mitigation Strategies. International Journal on Recent and Innovation Trends in Computing and Communication, 10(12), 248–253. Retrieved from https://ijritcc.org/index.php/ijritcc/article/view/10511

Ashok : "Ashok Choppadandi, Jagbir Kaur, Pradeep Kumar Chenchala, Akshay Agarwal, Varun Nakra, Pandi Kirupa Gopalakrishna Pandian, 2021. "Anomaly Detection in Cybersecurity: Leveraging Machine Learning Algorithms" ESP Journal of Engineering & Technology Advancements 1(2): 34-41.")

Kaur, J. (2021). Big Data Visualization Techniques for Decision Support Systems. Jishu/Journal of Propulsion Technology, 42(4). https://propulsiontechjournal.com/index.php/journal/article/view/5701

Ashok : "Choppadandi, A., Kaur, J.,Chenchala, P. K., Nakra, V., & Pandian, P. K. K. G. (2020). Automating ERP Applications for Taxation Compliance using Machine Learning at SAP Labs. International Journal of Computer Science and Mobile Computing, 9(12), 103-112. https://doi.org/10.47760/ijcsmc.2020.v09i12.014

Chenchala, P. K., Choppadandi, A., Kaur, J., Nakra, V., & Pandian, P. K. G. (2020). Predictive Maintenance and Resource Optimization in Inventory Identification Tool Using ML. International Journal of Open Publication and Exploration, 8(2), 43-50. https://ijope.com/index.php/home/article/view/127

Kaur, J., Choppadandi, A., Chenchala, P. K., Nakra, V., & Pandian, P. K. G. (2019). AI Applications in Smart Cities: Experiences from Deploying ML Algorithms for Urban Planning and Resource Optimization. Tuijin Jishu/Journal of Propulsion Technology, 40(4), 50-56.

Case Studies on Improving User Interaction and Satisfaction using AI-Enabled Chatbots for Customer Service . (2019). International Journal of Transcontinental Discoveries, ISSN: 3006-628X, 6(1), 29-34. https://internatioHappy Guru Purnima sir charan sparsh aljournals.org/index.php/ijtd/article/view/98

Kaur, J., Choppadandi, A., Chenchala, P. K., Nakra, V., & Pandian, P. K. G. (2019). Case Studies on Improving User Interaction and Satisfaction using AI-Enabled Chatbots for Customer Service. International Journal

of Transcontinental Discoveries, 6(1), 29-34. https://internationaljournals.org/index.php/ijtd/article/view/98

Choppadandi, A., Kaur, J., Chenchala, P. K., Kanungo, S., & Pandian, P. K. K. G. (2019). AI-Driven Customer Relationship Management in PK Salon Management System. International Journal of Open Publication and Exploration, 7(2), 28-35. https://ijope.com/index.php/home/article/view/128

Ashok Choppadandi, Jagbir Kaur, Pradeep Kumar Chenchala, Akshay Agarwal, Varun Nakra, Pandi Kirupa Gopalakrishna Pandian, 2021. "Anomaly Detection in Cybersecurity: Leveraging Machine Learning Algorithms" ESP Journal of Engineering & Technology Advancements 1(2): 34-41.

Ashok Choppadandi et al, International Journal of Computer Science and Mobile Computing, Vol.9 Issue.12, December- 2020, pg. 103-112. ( Google scholar indexed)

Choppadandi, A., Kaur, J., Chenchala, P. K., Nakra, V., & Pandian, P. K. K. G. (2020). Automating ERP Applications for Taxation Compliance using Machine Learning at SAP Labs. International Journal of Computer Science and Mobile Computing, 9(12), 103-112. https://doi.org/10.47760/ijcsmc.2020.v09i12.014

[Chenchala, P. K., Choppadandi, A., Kaur, J., Nakra, V., & Pandian, P. K. G. (2020). Predictive Maintenance and Resource Optimization in Inventory Identification Tool Using ML. International Journal of Open Publication and Exploration, 8(2), 43-50. https://ijope.com/index.php/home/article/view/127]

AI-Driven Customer Relationship Management in PK Salon Management System. (2019). International Journal of Open Publication and Exploration, ISSN: 3006-2853, 7(2), 28-35. https://ijope.com/index.php/home/article/view/128

Pradeep Kumar Chenchala. (2023). Social Media Sentiment Analysis for Enhancing Demand Forecasting Models Using Machine Learning Models. International Journal on Recent and Innovation Trends in Computing and Communication, 11(6), 595–601. Retrieved from https://www.ijritcc.org/index.php/ijritcc/article/view/10762

Tilala, Mitul, Saigurudatta Pamulaparthyvenkata, Abhip Dilip Chawda, and Abhishek Pandurang Benke. "Explore the Technologies and Architectures Enabling Real-Time Data Processing within Healthcare Data Lakes, and How They Facilitate Immediate Clinical Decision-Making and Patient Care Interventions." European Chemical Bulletin 11, no. 12 (2022): 4537-4542. https://doi.org/10.53555/ecb/2022.11.12.425.

Mitul Tilala, Abhip Dilip Chawda, Abhishek Pandurang Benke, Akshay Agarwal. (2022). Regulatory Intelligence: Leveraging Data Analytics for Regulatory Decision-Making. International Journal of Multidisciplinary Innovation and Research Methodology, ISSN: 2960-2068, 1(1), 78–83. Retrieved from https://ijmirm.com/index.php/ijmirm/article/view/77

Mitul Tilala. (2023). Real-Time Data Processing in Healthcare: Architectures and Applications for Immediate Clinical Insights. International Journal on Recent and Innovation Trends in Computing and Communication, 11(11), 1119–1125. Retrieved from https://www.ijritcc.org/index.php/ijritcc/article/view/10629

Tilala, Mitul, and Abhip Dilip Chawda. "Evaluation of Compliance Requirements for Annual Reports in Pharmaceutical Industries." NeuroQuantology 18, no. 11 (November 2020): 138-145. https://doi.org/10.48047/nq.2020.18.11.NQ20244.

Dodda, Suresh, Navin Kamuni, Venkata Sai Mahesh Vuppalapati, Jyothi Swaroop Arlagadda Narasimharaju, and Preetham Vemasani. "AI-driven Personalized Recommendations: Algorithms and Evaluation." Propulsion Tech Journal 44, no. 6 (December 1, 2023). https://propulsiontechjournal.com/index.php/journal/article/view/5587

Kamuni, Navin, Suresh Dodda, Venkata Sai Mahesh Vuppalapati, Jyothi Swaroop Arlagadda, and Preetham Vemasani. "Advancements in Reinforcement Learning Techniques for Robotics." Journal of Basic Science and Engineering 19, no. 1 (2022): 101-111. ISSN: 1005-0930.

Dodda, Suresh, Navin Kamuni, Jyothi Swaroop Arlagadda, Venkata Sai Mahesh Vuppalapati, and Preetham Vemasani. "A Survey of Deep Learning Approaches for Natural Language Processing Tasks." International Journal on Recent and Innovation Trends in Computing and Communication 9, no. 12 (December 2021): 27-36. ISSN: 2321-8169. http://www.ijritcc.org

Jigar Shah , Joel lopes , Nitin Prasad , Narendra Narukulla , Venudhar Rao Hajari , Lohith Paripati. (2023). Optimizing Resource Allocation And Scalability In Cloud-Based Machine Learning Models. Migration Letters, 20(S12), 1823–1832. Retrieved from https://migrationletters.com/index.php/ml/article/view/10652

Joel lopes, Arth Dave, Hemanth Swamy, Varun Nakra, & Akshay Agarwal. (2023). Machine Learning Techniques And Predictive Modeling For Retail Inventory Management Systems. Educational Administration: Theory and Practice, 29(4), 698–706. https://doi.org/10.53555/kuey.v29i4.5645

Narukulla, Narendra, Joel Lopes, Venudhar Rao Hajari, Nitin Prasad, and Hemanth Swamy. "Real-Time Data Processing and Predictive Analytics Using Cloud-Based Machine Learning." Tuijin Jishu/Journal of Propulsion Technology 42, no. 4 (2021): 91-102.

Nitin Prasad. (2022). Security Challenges and Solutions in Cloud-Based Artificial Intelligence and Machine Learning Systems. International Journal on Recent and Innovation Trends in Computing and Communication, 10(12), 286–292. Retrieved from https://www.ijritcc.org/index.php/ijritcc/article/view/10750

Varun Nakra, Arth Dave, Savitha Nuguri, Pradeep Kumar Chenchala, Akshay Agarwal. (2023). Robo-Advisors in Wealth Management: Exploring the Role of AI and ML in Financial Planning. European Economic Letters (EEL), 13(5), 2028–2039. Retrieved from https://www.eelet.org.uk/index.php/journal/article/view/1514

Varun Nakra. (2023). Enhancing Software Project Management and Task Allocation with AI and Machine Learning. International Journal on Recent and Innovation Trends in Computing and Communication, 11(11), 1171–1178. Retrieved from https://www.ijritcc.org/index.php/ijritcc/article/view/10684

Joel lopes, Arth Dave, Hemanth Swamy, Varun Nakra, & Akshay Agarwal. (2023). Machine Learning Techniques And Predictive Modeling For Retail Inventory Management Systems. Educational Administration: Theory and Practice, 29(4), 698–706. https://doi.org/10.53555/kuey.v29i4.5645

Big Data Analytics using Machine Learning Techniques on Cloud Platforms. (2019). International Journal of Business Management and Visuals, ISSN: 3006-2705, 2(2), 54-58. https://ijbmv.com/index.php/home/article/view/76

Shah, J., Prasad, N., Narukulla, N., Hajari, V. R., & Paripati, L. (2019). Big Data Analytics using Machine Learning Techniques on Cloud Platforms. International Journal of Business Management and Visuals, 2(2), 54-58. https://ijbmv.com/index.php/home/article/view/76

Cygan, Kamil J., Ehdieh Khaledian, Lili Blumenberg, Robert R. Salzler, Darshit Shah, William Olson, Lynn E. Macdonald, Andrew J. Murphy, and Ankur Dhanik. "Rigorous Estimation of Post-Translational Proteasomal Splicing in the Immunopeptidome." bioRxiv (2021): 1-24. https://doi.org/10.1101/2021.05.26.445792

Shah, Darshit, Ankur Dhanik, Kamil Cygan, Olav Olsen, William Olson, and Robert Salzler. "Proteogenomics and de novo Sequencing Based Approach for Neoantigen Discovery from the Immunopeptidomes of Patient CRC Liver Metastases Using Mass Spectrometry." The Journal of Immunology 204, no. 1_Supplement (2020): 217.16-217.16. American Association of Immunologists.

Mahesula, Swetha, Itay Raphael, Rekha Raghunathan, Karan Kalsaria, Venkat Kotagiri, Anjali B. Purkar, Manjushree Anjanappa, Darshit Shah, Vidya Pericherla, Yeshwant Lal Avinash Jadhav, Jonathan A.L. Gelfond, Thomas G. Forsthuber, and William E. Haskins. "Immunoenrichment Microwave & Magnetic (IM2) Proteomics for Quantifying CD47 in the EAE Model of Multiple Sclerosis." Electrophoresis 33, no. 24 (2012): 3820-3829. https://doi.org/10.1002/elps.201200515.

Big Data Analytics using Machine Learning Techniques on Cloud Platforms. (2019). International Journal of Business Management and Visuals, ISSN: 3006-2705, 2(2), 54-58. https://ijbmv.com/index.php/home/article/view/76

Cygan, K. J., Khaledian, E., Blumenberg, L., Salzler, R. R., Shah, D., Olson, W., & ... (2021). Rigorous estimation of post-translational proteasomal splicing in the immunopeptidome. bioRxiv, 2021.05.26.445792.

Mahesula, S., Raphael, I., Raghunathan, R., Kalsaria, K., Kotagiri, V., Purkar, A. B., & ... (2012). Immunoenrichment microwave and magnetic proteomics for quantifying CD 47 in the experimental autoimmune encephalomyelitis model of multiple sclerosis. Electrophoresis, 33(24), 3820-3829.

Mahesula, S., Raphael, I., Raghunathan, R., Kalsaria, K., Kotagiri, V., Purkar, A. B., & ... (2012). Immunoenrichment Microwave & Magnetic (IM2) Proteomics for Quantifying CD47 in the EAE Model of Multiple Sclerosis. Electrophoresis, 33(24), 3820.

Raphael, I., Mahesula, S., Kalsaria, K., Kotagiri, V., Purkar, A. B., Anjanappa, M., & ... (2012). Microwave and magnetic (M2) proteomics of the experimental autoimmune encephalomyelitis animal model of multiple sclerosis. Electrophoresis, 33(24), 3810-3819.

Salzler, R. R., Shah, D., Doré, A., Bauerlein, R., Miloscio, L., Latres, E., & ... (2016). Myostatin deficiency but not anti‐myostatin blockade induces marked proteomic changes in mouse skeletal muscle. Proteomics, 16(14), 2019-2027.

Shah, D., Anjanappa, M., Kumara, B. S., & Indiresh, K. M. (2012). Effect of post-harvest treatments and packaging on shelf life of cherry tomato cv. Marilee Cherry Red. Mysore Journal of Agricultural Sciences.

Shah, D., Dhanik, A., Cygan, K., Olsen, O., Olson, W., & Salzler, R. (2020). Proteogenomics and de novo sequencing based approach for neoantigen discovery from the immunopeptidomes of patient CRC liver metastases using Mass Spectrometry. The Journal of Immunology, 204(1_Supplement), 217.16-217.16.

Shah, D., Salzler, R., Chen, L., Olsen, O., & Olson, W. (2019). High-Throughput Discovery of Tumor-Specific HLA-Presented Peptides with Post-Translational Modifications. MSACL 2019 US.

Srivastava, M., Copin, R., Choy, A., Zhou, A., Olsen, O., Wolf, S., Shah, D., & ... (2022). Proteogenomic identification of Hepatitis B virus (HBV) genotype-specific HLA-I restricted peptides from HBV-positive patient liver tissues. Frontiers in Immunology, 13, 1032716.

Big Data Analytics using Machine Learning Techniques on Cloud Platforms. (2019). International Journal of Business Management and Visuals, ISSN: 3006-2705, 2(2), 54-58. https://ijbmv.com/index.php/home/article/view/76

Pavan Ogeti, Narendra Sharad Fadnavis, Gireesh Bhaulal Patil, Uday Krishna Padyana, Hitesh Premshankar Rai. (2022). Blockchain Technology for Secure and Transparent Financial Transactions. European Economic Letters (EEL), 12(2), 180–188. Retrieved from https://www.eelet.org.uk/index.php/journal/article/view/1283

Challa, S. S. S., Chawda, A. D., Benke, A. P., & Tilala, M. (2023). Regulatory intelligence: Leveraging data analytics for regulatory decision-making. International Journal on Recent and Innovation Trends in Computing and Communication, 11(11), 1426-1434. Retrieved from http://www.ijritcc.org

Fadnavis, N. S., Patil, G. B., Padyana, U. K., Rai, H. P., & Ogeti, P. (2021). Optimizing scalability and performance in cloud services: Strategies and solutions. International Journal on Recent and Innovation Trends in Computing and Communication, 9(2), 14-23. Retrieved from http://www.ijritcc.org

Challa, S. S. S., Tilala, M., Chawda, A. D., & Benke, A. P. (2021). Navigating regulatory requirements for complex dosage forms: Insights from topical, parenteral, and ophthalmic products. NeuroQuantology, 19(12), 971-994. https://doi.org/10.48047/nq.2021.19.12.NQ21307

Fadnavis, N. S., Patil, G. B., Padyana, U. K., Rai, H. P., & Ogeti, P. (2020). Machine learning applications in climate modeling and weather forecasting. NeuroQuantology, 18(6), 135-145. https://doi.org/10.48047/nq.2020.18.6.NQ20194

Downloads

Published

09.07.2024

How to Cite

Akhil Mittal. (2024). Machine Learning-Based Phishing Detection: Improving Accuracy and Adaptability. International Journal of Intelligent Systems and Applications in Engineering, 12(22s), 587–595. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/6524

Issue

Section

Research Article