Harmonizing Algorithms: An Approach to Enhancing Audio Deepfake Detection

Authors

  • Shwetambari Borade, Nilakshi Jain, Bhavesh Patel, Vineet Kumar, Yash Nagare, Shubham Kolaskar, Jayan Shah, Pratham Shah, Mustansir Godhrawala

Keywords:

Audio Deepfake Detection, Comparative Model Verification, Ethical Audio Forensics, Real-time Speech Authenticity, SVM-Neuron Network Fusion

Abstract

This research aims to enhance the detection of audio deepfakes by developing a real-time, highly accurate methodology that addresses existing technological and ethical gaps in the field. Employing advanced algorithms for feature extraction, the study innovatively utilizes a multifaceted approach by integrating an MFCC-based SVM classifier, which achieved a remarkable 97.28% accuracy, and a Neural Network with attention mechanisms, with a 91.04% accuracy rate. A novel aspect of our methodology is the use of multiple models in tandem to verify the authenticity of input audio, significantly boosting the reliability of detection. Leveraging the 'For-Original' dataset for exhaustive training and validation, our methods have shown exceptional effectiveness in distinguishing genuine audio from synthetic counterparts. These findings not only demonstrate significant improvements in existing deepfake detection techniques but also introduce a novel approach to comparative model analysis. This contribution is pivotal in advancing the field of digital media integrity, offering new avenues for ensuring the authenticity of audio content in the era of sophisticated digital forgeries.

Downloads

Download data is not yet available.

References

M. A. Khder, S. Shorman, D. T. Aldoseri and M. M. Saeed, "Artificial Intelligence into Multimedia Deepfakes Creation and Detection," 2023 International Conference on IT Innovation and Knowledge Discovery (ITIKD), Manama, Bahrain, 2023, pp. 1-5, doi: 10.1109/ITIKD56332.2023.10099744.

O. A. Shaaban, R. Yildirim and A. A. Alguttar, "Audio Deepfake Approaches," in IEEE Access, vol. 11, pp. 132652-132682, 2023, doi: 10.1109/ACCESS.2023.3333866.

JH. H. Kilinc and F. Kaledibi, "Audio Deepfake Detection by using Machine and Deep Learning," 2023 10th International Conference on Wireless Networks and Mobile Communications (WINCOM), Istanbul, Turkiye, 2023, pp. 1-5, doi: 10.1109/WINCOM59760.2023.10323004.

W. Yang et al., "AVoiD-DF: Audio-Visual Joint Learning for Detecting Deepfake," in IEEE Transactions on Information Forensics and Security, vol. 18, pp. 2015-2029, 2023, doi: 10.1109/TIFS.2023.3262148.

T. -P. Doan, L. Nguyen-Vu, S. Jung and K. Hong, "BTS-E: Audio Deepfake Detection Using Breathing-Talking-Silence Encoder," ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, 2023, pp. 1-5, doi: 10.1109/ICASSP49357.2023.10095927.

R. L. M. A. P. C. Wijethunga, D. M. K. Matheesha, A. A. Noman, K. H. V. T. A. De Silva, M. Tissera and L. Rupasinghe, "Deepfake Audio Detection: A Deep Learning Based Solution for Group Conversations," 2020 2nd International Conference on Advancements in Computing (ICAC), Malabe, Sri Lanka, 2020, pp. 192-197, doi: 10.1109/ICAC51239.2020.9357161.

M. Li, Y. Ahmadiadli and X. -P. Zhang, "Robust Deepfake Audio Detection via Bi-Level Optimization," 2023 IEEE 25th International Workshop on Multimedia Signal Processing (MMSP), Poitiers, France, 2023, pp. 1-6, doi: 10.1109/MMSP59012.2023.10337724.

L. Wang, B. Yeoh and J. W. Ng, "Synthetic Voice Detection and Audio Splicing Detection using SE-Res2Net-Conformer Architecture," 2022 13th International Symposium on Chinese Spoken Language Processing (ISCSLP), Singapore, Singapore, 2022, pp. 115-119, doi: 10.1109/ISCSLP57327.2022.10037999.

A. Khovrat and V. Kobziev, "Using Recurrent and Convulation Neural Networks to Indentify the Fake Audio Messages," 2023 IEEE 7th International Conference on Methods and Systems of Navigation and Motion Control (MSNMC), Kyiv, Ukraine, 2023, pp. 174-177, doi: 10.1109/MSNMC61017.2023.10329236.

Yi, Jiangyan & Wang, Chenglong & Tao, Jianhua & Zhang, Xiaohui & Zhang, Chu & Zhao, Yan. (2023). Audio Deepfake Detection: A Survey.

Downloads

Published

24.03.2024

How to Cite

Jayan Shah, Pratham Shah, Mustansir Godhrawala, S. B. N. J. B. P. V. K. Y. N. S. K. . (2024). Harmonizing Algorithms: An Approach to Enhancing Audio Deepfake Detection. International Journal of Intelligent Systems and Applications in Engineering, 12(3), 1297–1304. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/5520

Issue

Section

Research Article

Similar Articles

You may also start an advanced similarity search for this article.