Credit Card Fraud Detection Using Machine Learning Algorithms: A Comparative Study of Six Models
Keywords:
AUPRC-score, Adaboost, Combined Resampling Technique, K- fold Cross Validation, K-NN algorithm, SMOTE, Xgboost.Abstract
Credit Card Fraud (CCF) is a significant financial threat where individuals impersonate others to conduct unauthorized financial activities. This study implements a Combined Resampling Technique (CRT) as a specialized new approach for addressing class imbalances and enhancing model performance in credit card fraud (CCF) detection. Additionally, it evaluates the effectiveness of various machine learning models in accurately identifying fraudulent transactions. Performance metrics included cross-validation K-fold, AUPRC score, precision, recall, averages, and F1 scores. The models assessed encompass traditional algorithms like Logistic Regression, Decision Trees, and K-Nearest Neighbors (KNN), along with ensemble methods such as Random Forest, Adaboost, and Xgboost. The dataset utilised was a simulated data set containing credit card transactions spanning January 2019 to December 2020, involving 1000 customers and 800 merchants. The study addresses data imbalance using techniques like SMOTE and employs feature engineering for improved results. Notably, the K-NN algorithm demonstrated superior performance in detecting fraudulent transactions, making it a valuable tool in combating the CCF crisis.
Downloads
References
Dornadula, V.N. and Geetha, S., 2019. Credit card fraud detection using machine learning algorithms. Procedia computer science, 165, pp.631-641.
Taghiyev, K.R., Rustamov, T.H. and Hasanzade, A.A., 2021. Analysis Of Payment Cards Fraud Transactions And Measures To Prevent Them. Economic innovations, 23(2 (79)), pp.172-184.
Maniraj, S.P., Saini, A., Ahmed, S. and Sarkar, S., 2019. Credit card fraud detection using machine learning and data science. International Journal of Engineering Research, 8(9), pp.110-115.
Shirgave, S., Awati, C., More, R. and Patil, S., 2019. A review on credit card fraud detection using machine learning. International Journal of Scientific & technology research, 8(10), pp.1217-1220.
Mbakwe, A.B. and Adewale, S.A., Machine Learning and Applications: An International Journal (MLAIJ) Vol.9, No.4, December 2022
Azhan, M. and Meraj, S., 2020, December. Credit card fraud detection using machine learning and deep learning techniques. In 2020 3rd International Conference on Intelligent Sustainable Systems (ICISS) (pp. 514-518). IEEE.
Saheed, Y.K., Baba, U.A. and Raji, M.A., 2022. Big data analytics for credit card fraud detection using supervised machine learning models. In Big data analytics in the insurance market (pp. 31-56). Emerald Publishing Limited.
Bhavsar, A. and Patil, T., “Exploration of Various Machine Learning Algorithms”. In: Pratibha: International Journal of Science, Spirituality, Business And Technology (Ijssbt) (2021).
Chen, J.I.Z. and Lai, K.L., 2021. Deep convolution neural network model for credit-card fraud detection and alert. Journal of Artificial Intelligence, 3(02), pp.101-112.
Alfaiz, N.S. and Fati, S.M., 2022. Enhanced credit card fraud detection model using machine learning. Electronics, 11(4), p.662.
Trivedi, N.K., Simaiya, S., Lilhore, U.K. and Sharma, S.K., 2020. An efficient credit card fraud detection model based on machine learning methods. International Journal of Advanced Science and Technology, 29(5), pp.3414-3424.
Pudjihartono, N., Fadason, T., Kempa-Liehr, A.W. and O'Sullivan, J.M., 2022. A review of feature selection methods for machine learning-based disease risk prediction. Frontiers in Bioinformatics, 2, p.927312.
Chandrashekar, G. and Sahin, F., 2014. A survey on feature selection methods. Computers & electrical engineering, 40(1), pp.16-28.
He, S., Guo, F. and Zou, Q., 2020. MRMD2. 0: a python tool for machine learning with feature ranking and reduction. Current Bioinformatics, 15(10), pp.1213-1221.
Zhao, Z., Anand, R. and Wang, M., 2019, October. Maximum relevance and minimum redundancy feature selection methods for a marketing machine learning platform. In 2019 IEEE international conference on data science and advanced analytics (DSAA) (pp. 442-452). IEEE.
Tamir, A., Watson, E., Willett, B., Hasan, Q. and Yuan, J.S., 2021. Crime prediction and forecasting using machine learning algorithms. International Journal of Computer Science and Information Technologies, 12(2), pp.26-33.
Singh, A., Ranjan, R.K. and Tiwari, A., 2022. Credit card fraud detection under extreme imbalanced data: a comparative study of data-level algorithms. Journal of Experimental & Theoretical Artificial Intelligence, 34(4), pp.571-598.
Jain, Y., Tiwari, N., Dubey, S. and Jain, S., 2019. A comparative analysis of various credit card fraud detection techniques. International Journal of Recent Technology and Engineering, 7(5), pp.402-407.
Wong, T.T. and Yeh, P.Y., 2019. Reliable accuracy estimates from k-fold cross validation. IEEE Transactions on Knowledge and Data Engineering, 32(8), pp.1586-1594.
Kovács, G., 2019. An empirical comparison and evaluation of minority oversampling techniques on a large number of imbalanced datasets. Applied Soft Computing, 83, p.105662.
Bagga, S., Goyal, A., Gupta, N. and Goyal, A., 2020. Credit card fraud detection using pipeling and ensemble learning. Procedia Computer Science, 173, pp.104-112.
Wang, P., Fan, E. and Wang, P., 2021. Comparative analysis of image classification algorithms based on traditional machine learning and deep learning. Pattern recognition letters, 141, pp.61-67.
Tsangaratos, P. and Ilia, I., 2016. Comparison of a logistic regression and Naïve Bayes classifier in landslide susceptibility assessments: The influence of models complexity and training dataset size. Catena, 145, pp.164-179.
Pranckevičius, T. and Marcinkevičius, V., 2017. Comparison of naive bayes, random forest, decision tree, support vector machines, and logistic regression classifiers for text reviews classification. Baltic Journal of Modern Computing, 5(2), p.221.
Khatri, S., Arora, A. and Agrawal, A.P., 2020, January. Supervised machine learning algorithms for credit card fraud detection: a comparison. In 2020 10th international conference on cloud computing, data science & engineering (confluence) (pp. 680-683). IEEE.
Ileberi, E., Sun, Y. and Wang, Z., 2021. Performance evaluation of machine learning methods for credit card fraud detection using SMOTE and AdaBoost. IEEE Access, 9, pp.165286-165294.
Dighe, D., Patil, S. and Kokate, S., 2018, August. Detection of credit card fraud transactions using machine learning algorithms and neural networks: A comparative study. In 2018 Fourth International Conference on Computing Communication Control and Automation (ICCUBEA) (pp. 1-6). IEEE.
García, V., Mollineda, R.A. and Sánchez, J.S., 2008. On the k-NN performance in a challenging scenario of imbalance and overlapping. Pattern Analysis and Applications, 11, pp.269-280.
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.