Optimization of Naïve Bayes Classifier for Spam E-Mail Detection

Authors

  • Sai Charan Lanka, Kodali Pujita, Kommana Akhila, Shayan Mondal, P. Vidya Sagar, Suneetha Bulla

Keywords:

information, facilitates, Spam Mails, Machine Learning, optimize

Abstract

: E-Mail, a popular and official communication platform, widely used method of communication that facilitates exchange of information between individuals or organizations in a convenient and efficient way to send and receive any kind of information instantly from any corner of the world.  But, due to this drastic growth of the usage of E-Mail, spammers are using this platform to perform frauds through mails that mails are known as Spam Mails. Spam Mails can be detected and identified using various approaches. Among those approaches Machine Learning is widely used. In Machine Learning, Naïve Bayes Classifier stands out with the highest accuracy this is due to “Low False Positive Error Rate”. Although Naïve Bayes Classifier gives us the best accuracy among all other Machine Learning models, we can still optimize it to give a better accuracy

Downloads

Download data is not yet available.

References

Lanka, S. C., Akhila, K., Pujita, K., Sagar, P. V., Mondal, S., & Bulla, S. (2023, March). Spam based Email Identification and Detection using Machine Learning Techniques. In 2023 International Conference on Sustainable Computing and Data Communication Systems (ICSCDS) (pp. 69-74). IEEE.

Renuka, D. K., Visalakshi, P., & Sankar, T. J. I. J. C. A. (2015). Improving E-mail spam classification using ant colony optimization algorithm. Int. J. Comput. Appl, 22, 22-26.

Hossain, F., Uddin, M. N., & Halder, R. K. (2021, April). Analysis of optimized machine learning and deep learning techniques for spam detection. In 2021 IEEE International IOT, Electronics and Mechatronics Conference (IEMTRONICS) (pp. 1-7). IEEE.

Agarwal, K., & Kumar, T. (2018, June). Email spam detection using integrated approach of Naïve Bayes and particle swarm optimization. In 2018 Second International Conference on Intelligent Computing and Control Systems (ICICCS) (pp. 685-690). IEEE.

Sharma, P., & Bhardwaj, U. (2018). Machine Learning based Spam E-Mail Detection. International Journal of Intelligent Engineering & Systems, 11(3).

Faris, H., Aljarah, I., & Al-Shboul, B. (2016). A hybrid approach based on particle swarm optimization and random forests for e-mail spam filtering. In Computational Collective Intelligence: 8th International Conference, ICCCI 2016, Halkidiki, Greece, September 28-30, 2016. Proceedings, Part I 8 (pp. 498-508). Springer International Publishing.

Hassani, Z., Hajihashemi, V., Borna, K., & Sahraei Dehmajnoonie, I. (2020). A classification method for E-mail spam using a hybrid approach for feature selection optimization. Journal of Sciences, Islamic Republic of Iran, 31(2), 165-173.

Awad, W. A., & ELseuofi, S. M. (2011). Machine learning methods for spam e-mail classification. International Journal of Computer Science & Information Technology (IJCSIT), 3(1), 173-184.

Bhuiyan, H., Ashiquzzaman, A., Juthi, T. I., Biswas, S., & Ara, J. (2018). A survey of existing e-mail spam filtering methods considering machine learning techniques. Global Journal of Computer Science and Technology, 18(2), 20-29.

Peng, W., Huang, L., Jia, J., & Ingram, E. (2018, August). Enhancing the naive bayes spam filter through intelligent text modification detection. In 2018 17th IEEE international conference on trust, security and privacy in computing and communications/12th IEEE international conference on big data science and engineering (TrustCom/BigDataSE) (pp. 849-854). IEEE.

Al-Rawashdeh, G., Mamat, R., & Abd Rahim, N. H. B. (2019). Hybrid water cycle optimization algorithm with simulated annealing for spam e-mail detection. IEEE Access, 7, 143721-143734.

C. M. Shaik, N. M. Penumaka, S. K. Abbireddy, V. Kumar and S. S. Aravinth, "Bi-LSTM and Conventional Classifiers for Email Spam Filtering," 2023 Third International Conference on Artificial Intelligence and Smart Energy (ICAIS), Coimbatore, India, 2023, pp. 1350-1355, doi: 10.1109/ICAIS56108.2023.10073776.

S. Gibson, B. Issac, L. Zhang and S. M. Jacob, "Detecting Spam Email With Machine Learning Optimized With Bio-Inspired Metaheuristic Algorithms," in IEEE Access, vol. 8, pp. 187914-187932, 2020, doi: 10.1109/ACCESS.2020.3030751.

Hosseinalipour, A., Ghanbarzadeh, R. A novel approach for spam detection using horse herd optimization algorithm. Neural Comput & Applic 34, 13091–13105 (2022). https://doi.org/10.1007/s00521-022-07148-x

Ashraf S. Mashaleh, Noor Farizah Binti Ibrahim, Mohammed Azmi Al-Betar, Hossam M.J. Mustafa, Qussai M. Yaseen,Detecting Spam Email with Machine Learning Optimized with Harris Hawks optimizer (HHO) Algorithm,Procedia Computer Science,Volume 201,2022,Pages 659-664,ISSN 1877-0509,https://doi.org/10.1016/j.procs.2022.03.087.

Downloads

Published

27.03.2024

How to Cite

Shayan Mondal, P. Vidya Sagar, Suneetha Bulla, S. C. L. K. P. K. A. (2024). Optimization of Naïve Bayes Classifier for Spam E-Mail Detection. International Journal of Intelligent Systems and Applications in Engineering, 12(3), 1455–1460. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/5538

Issue

Section

Research Article