Credit Card Fraud Detection Using Machine Learning Algorithms: A Comparative Study of Six Models

Authors

  • Joseph J. Assabil, Ibidun Christiana Obagbuwa

Keywords:

AUPRC-score, Adaboost, Combined Resampling Technique, K- fold Cross Validation, K-NN algorithm, SMOTE, Xgboost.

Abstract

Credit Card Fraud (CCF) is a significant financial threat where individuals impersonate others to conduct unauthorized financial activities. This study implements a Combined Resampling Technique (CRT) as a specialized new approach for addressing class imbalances and enhancing model performance in credit card fraud (CCF) detection. Additionally, it evaluates the effectiveness of various machine learning models in accurately identifying fraudulent transactions. Performance metrics included cross-validation K-fold, AUPRC score, precision, recall, averages, and F1 scores. The models assessed encompass traditional algorithms like Logistic Regression, Decision Trees, and K-Nearest Neighbors (KNN), along with ensemble methods such as Random Forest, Adaboost, and Xgboost. The dataset utilised was a simulated data set containing credit card transactions spanning January 2019 to December 2020, involving 1000 customers and 800 merchants. The study addresses data imbalance using techniques like SMOTE and employs feature engineering for improved results. Notably, the K-NN algorithm demonstrated superior performance in detecting fraudulent transactions, making it a valuable tool in combating the CCF crisis.

Downloads

Download data is not yet available.

References

Dornadula, V.N. and Geetha, S., 2019. Credit card fraud detection using machine learning algorithms. Procedia computer science, 165, pp.631-641.

Taghiyev, K.R., Rustamov, T.H. and Hasanzade, A.A., 2021. Analysis Of Payment Cards Fraud Transactions And Measures To Prevent Them. Economic innovations, 23(2 (79)), pp.172-184.

Maniraj, S.P., Saini, A., Ahmed, S. and Sarkar, S., 2019. Credit card fraud detection using machine learning and data science. International Journal of Engineering Research, 8(9), pp.110-115.

Shirgave, S., Awati, C., More, R. and Patil, S., 2019. A review on credit card fraud detection using machine learning. International Journal of Scientific & technology research, 8(10), pp.1217-1220.

Mbakwe, A.B. and Adewale, S.A., Machine Learning and Applications: An International Journal (MLAIJ) Vol.9, No.4, December 2022

Azhan, M. and Meraj, S., 2020, December. Credit card fraud detection using machine learning and deep learning techniques. In 2020 3rd International Conference on Intelligent Sustainable Systems (ICISS) (pp. 514-518). IEEE.

Saheed, Y.K., Baba, U.A. and Raji, M.A., 2022. Big data analytics for credit card fraud detection using supervised machine learning models. In Big data analytics in the insurance market (pp. 31-56). Emerald Publishing Limited.

Bhavsar, A. and Patil, T., “Exploration of Various Machine Learning Algorithms”. In: Pratibha: International Journal of Science, Spirituality, Business And Technology (Ijssbt) (2021).

Chen, J.I.Z. and Lai, K.L., 2021. Deep convolution neural network model for credit-card fraud detection and alert. Journal of Artificial Intelligence, 3(02), pp.101-112.

Alfaiz, N.S. and Fati, S.M., 2022. Enhanced credit card fraud detection model using machine learning. Electronics, 11(4), p.662.

Trivedi, N.K., Simaiya, S., Lilhore, U.K. and Sharma, S.K., 2020. An efficient credit card fraud detection model based on machine learning methods. International Journal of Advanced Science and Technology, 29(5), pp.3414-3424.

Pudjihartono, N., Fadason, T., Kempa-Liehr, A.W. and O'Sullivan, J.M., 2022. A review of feature selection methods for machine learning-based disease risk prediction. Frontiers in Bioinformatics, 2, p.927312.

Chandrashekar, G. and Sahin, F., 2014. A survey on feature selection methods. Computers & electrical engineering, 40(1), pp.16-28.

He, S., Guo, F. and Zou, Q., 2020. MRMD2. 0: a python tool for machine learning with feature ranking and reduction. Current Bioinformatics, 15(10), pp.1213-1221.

Zhao, Z., Anand, R. and Wang, M., 2019, October. Maximum relevance and minimum redundancy feature selection methods for a marketing machine learning platform. In 2019 IEEE international conference on data science and advanced analytics (DSAA) (pp. 442-452). IEEE.

Tamir, A., Watson, E., Willett, B., Hasan, Q. and Yuan, J.S., 2021. Crime prediction and forecasting using machine learning algorithms. International Journal of Computer Science and Information Technologies, 12(2), pp.26-33.

Singh, A., Ranjan, R.K. and Tiwari, A., 2022. Credit card fraud detection under extreme imbalanced data: a comparative study of data-level algorithms. Journal of Experimental & Theoretical Artificial Intelligence, 34(4), pp.571-598.

Jain, Y., Tiwari, N., Dubey, S. and Jain, S., 2019. A comparative analysis of various credit card fraud detection techniques. International Journal of Recent Technology and Engineering, 7(5), pp.402-407.

Wong, T.T. and Yeh, P.Y., 2019. Reliable accuracy estimates from k-fold cross validation. IEEE Transactions on Knowledge and Data Engineering, 32(8), pp.1586-1594.

Kovács, G., 2019. An empirical comparison and evaluation of minority oversampling techniques on a large number of imbalanced datasets. Applied Soft Computing, 83, p.105662.

Bagga, S., Goyal, A., Gupta, N. and Goyal, A., 2020. Credit card fraud detection using pipeling and ensemble learning. Procedia Computer Science, 173, pp.104-112.

Wang, P., Fan, E. and Wang, P., 2021. Comparative analysis of image classification algorithms based on traditional machine learning and deep learning. Pattern recognition letters, 141, pp.61-67.

Tsangaratos, P. and Ilia, I., 2016. Comparison of a logistic regression and Naïve Bayes classifier in landslide susceptibility assessments: The influence of models complexity and training dataset size. Catena, 145, pp.164-179.

Pranckevičius, T. and Marcinkevičius, V., 2017. Comparison of naive bayes, random forest, decision tree, support vector machines, and logistic regression classifiers for text reviews classification. Baltic Journal of Modern Computing, 5(2), p.221.

Khatri, S., Arora, A. and Agrawal, A.P., 2020, January. Supervised machine learning algorithms for credit card fraud detection: a comparison. In 2020 10th international conference on cloud computing, data science & engineering (confluence) (pp. 680-683). IEEE.

Ileberi, E., Sun, Y. and Wang, Z., 2021. Performance evaluation of machine learning methods for credit card fraud detection using SMOTE and AdaBoost. IEEE Access, 9, pp.165286-165294.

Dighe, D., Patil, S. and Kokate, S., 2018, August. Detection of credit card fraud transactions using machine learning algorithms and neural networks: A comparative study. In 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA) (pp. 1-6). IEEE.

García, V., Mollineda, R.A. and Sánchez, J.S., 2008. On the k-NN performance in a challenging scenario of imbalance and overlapping. Pattern Analysis and Applications, 11, pp.269-280.

Downloads

Published

06.08.2024

How to Cite

Joseph J. Assabil. (2024). Credit Card Fraud Detection Using Machine Learning Algorithms: A Comparative Study of Six Models . International Journal of Intelligent Systems and Applications in Engineering, 12(23s), 862 –. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/7040

Issue

Section

Research Article