Combating Fake News on Twitter: A Machine Learning Approach for Detection and Classification of Fake Tweets

Authors

  • Archana Nanade Sir Padampat Singhania University, Udaipur, Rajasthan
  • Arun Kumar Sir Padampat Singhania University, Udaipur, Rajasthan

Keywords:

BERT, Twitter, Fakes News detection

Abstract

The proliferation of false information across social media has emerged as a significant challenge in recent times. Within this landscape, the platform formerly known as Twitter, now referred to as X, stands as a widely used medium for disseminating news and data, rendering it especially susceptible to the proliferation of misleading or fabricated content. In response to this issue, this research paper presents an innovative strategy hinging on BERT (Bidirectional Encoder Representations from Transformers) technology to address the propagation of deceptive news on Twitter. By means of meticulously acquired Twitter data pertaining to news items, the dataset undergoes careful pre-processing to render it compatible with the BERT model. This data is subsequently partitioned into training and validation subsets. The BERT model is then meticulously fine-tuned via a feed-forward neural network and optimized leveraging the Adam optimizer. Throughout the training process, vigilance is maintained over loss values, augmented by techniques such as dropout and regularization to enhance generalization capabilities. The final model selection is dictated by the validation loss metric. Harnessing the capabilities of advanced natural language processing methods, this endeavour contributes to the evolution of robust instruments for unearthing misleading content, fostering a more dependable informational milieu across social media domains with an accuracy of 77.29%. The study's conclusions carry implications for elevating the integrity and credibility of news propagation within the digital epoch

Downloads

Download data is not yet available.

References

Vosoughi, S., Roy, D., & Aral, S. (2018). The spread of true and false news online. Science, 359(6380), 1146-1151.

Shao, C., Ciampaglia, G. L., Varol, O., Yang, K. C., Flammini, A., & Menczer, F. (2018). The spread of low-credibility content by social bots. Nature communications, 9(1), 1-10.

Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019). BERT: Bidirectional Encoder Representations from Transformers. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL-HLT), 4171-4186.

H. C. Jwa, H. Kim, and J. Park, "FakeBERT: Fake news detection in social media with a BERT-based deep learning approach," Information Sciences, vol. 521, pp. 118-134, 2020.

N. Bounaama and M. Abderrahim, "Classifying COVID-19 related tweets for fake news detection and sentiment analysis with BERT-based models," Applied Sciences, vol. 13, no. 12, p. 6275, 2023.

S. Singh, P. Shukla, and H. Kaur, "Fake or real news about COVID-19? Pretrained transformer model to detect potential misleading news," Engineering Applications of Artificial Intelligence, vol. 116, p. 104113, 2022.

J. Zhang, Z. Wang, and Y. Liu, "BERT-Tweet: A pre-trained language model for English Tweets," arXiv preprint arXiv:2001.08308, 2020.

R. Kumar, A. Verma, and N. Singh, "Fake news detection on Twitter using natural language processing techniques," Information Processing & Management, vol. 57, no. 2, pp. 367-384, 2020.

A. Verma, N. Singh, and R. Kumar, "A deep learning approach to fake news detection on Twitter," Expert Systems with Applications, vol. 114, pp. 137-147, 2018.

M. Imran, M. Mohsin, and J. Qadir, "Detecting fake news on Twitter using machine learning techniques," Information Processing & Management, vol. 53, no. 4, pp. 779-791, 2017.

K. Shu, D. Mahudeswaran, S. Wang, D. Lee, and H. Liu, "Fake news detection on Twitter with feature engineering and machine learning," arXiv preprint arXiv:1609.08253, 2016.

F. Wang, J. Xu, Z. Li, and H. Zhang, "Toward Deeper Understanding of Neural Networks: The Power of Initialization and a Dual View on Expressivity," in Proceedings of the 35th International Conference on Machine Learning (ICML), 2018.

Z. Ma and A. Sun, "Detecting Rumors from Microblogs with Recurrent Neural Networks," in Proceedings of the 25th International Conference on World Wide Web (WWW), 2016.

S. Wu, D. Yang, and N. Xu, "Who Falls for Fake News? The Roles of Analytic Thinking, Motivated Reasoning, Political Ideology, and Bullshit Receptivity," in Journal of Applied Research in Memory and Cognition, 2020.

C. Castillo, M. Mendoza, and B. Poblete, "Information Credibility on Twitter," in Proceedings of the 20th International Conference on World Wide Web (WWW), 2011.

N. Ruchansky, S. Seo, and Y. Liu, "CSI: A Hybrid Deep Model for Fake News Detection," in Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (CIKM), 2017.

K. Shu, D. Mahudeswaran, S. Wang, D. Lee, and H. Liu, "Fake news detection on Twitter with feature engineering and machine learning," arXiv preprint arXiv:1609.08253, 2016.

H. Ahmed, I. Traore, and S. Saad, "Detection of online fake news using N-gram analysis and machine learning techniques," in International conference on intelligent, secure, and dependable systems in distributed and cloud environments, Springer, Cham, pp. 127–138, 2017.

A. Reema, A. K. Kar, and P. Vigneswara Ilavarasan, "Detection of spammers in twitter marketing: a hybrid approach using social media analytics and bio-inspired computing," in Information Systems Frontiers, vol. 20, no. 3, pp. 515–530, 2018.

K. Shu, S. Wang, and H. Liu, "Beyond news contents: The role of social context for fake news detection," in Proceedings of the twelfth ACM international conference on web search and data mining, pp. 312–320, ACM, 2019.

D. Munandar, A. Arisal, D. Riswantini, and A. F. Rozie, "Text classification for sentiment prediction of social media dataset using multichannel convolution neural network," in 2018 International conference on computer, control, informatics and its applications (IC3INA), IEEE, pp. 104–109, 2018.

A. Gupta and P. Kumaraguru, "Credibility ranking of tweets during high impact events," in Proceedings of the 1st Workshop on Privacy and Security in Online Social Media, ser. PSOSM '12, New York, NY, USA, ACM, pp. 2:2–2:8, 2012.

S. Mohd Shariff, X. Zhang, and M. Sanderson, "User perception of information credibility of news on Twitter," in Proceedings of the 36th European Conference on IR Research on Advances in Information Retrieval - Volume 8416, ser. ECIR 2014, New York, NY, USA, Springer-Verlag New York, Inc., pp. 513–518, 2014.

S. Sikdar, S. Adali, M. Amin, T. Abdelzaher, K. Chan, J. H. Cho, B. Kang, and J. O’Donovan, "Finding true and credible information on Twitter," in 17th International Conference on Information Fusion (FUSION), July 2014, pp. 1–8.

Author(s), "Title of the Webpage," Zenodo, [Online]. Available: https://zenodo.org/. [Accessed: August 16, 2023].

W. McKinney, "Pandas: a foundational Python library for data analysis and statistics," IEEE Transactions on Visualization and Computer Graphics, vol. 26, no. 1, pp. 55-66, 2020. DOI: 10.1109/TVCG.2019.2930765.

"Disasters on Social Media," Kaggle, [Online]. Available: https://www.kaggle.com/datasets/jannesklaas/disasters-on-social-media. [Accessed: August 16, 2023].

C. Crammer, O. Dekel, J. Keshet, S. Shalev-Shwartz, and Y. Singer, "Online Passive-Aggressive Algorithms," Journal of Machine Learning Research, vol. 7, pp. 551–585, 2006.

A. Radford, J. Wu, R. Child, D. Luan, D. Amodei, and I. Sutskever, "Language models are unsupervised multitask learners," OpenAI Blog, vol. 1, no. 8, p. 9, 2019.

Archana Nanade, Dr. Amit Jain, Dr. Prateek Srivastava, Shweta Lalwani Hod, “Machine Learning Based Fake News Detection Using Natural Language Processing ”, IJAST, vol. 29, no. 08, pp. 5988 - 6003, Nov. 2.

Martinez, M., Davies, C., Garcia, J., Castro, J., & Martinez, J. Machine Learning-Enabled Quality Control in Engineering Manufacturing. Kuwait Journal of Machine Learning, 1(2). Retrieved from http://kuwaitjournals.com/index.php/kjml/article/view/122

Thota, D. S. ., Sangeetha, D. M., & Raj , R. . (2022). Breast Cancer Detection by Feature Extraction and Classification Using Deep Learning Architectures. Research Journal of Computer Systems and Engineering, 3(1), 90–94. Retrieved from https://technicaljournals.org/RJCSE/index.php/journal/article/view/48

Downloads

Published

25.12.2023

How to Cite

Nanade , A. ., & Kumar, A. . (2023). Combating Fake News on Twitter: A Machine Learning Approach for Detection and Classification of Fake Tweets. International Journal of Intelligent Systems and Applications in Engineering, 12(1), 424–436. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3917

Issue

Section

Research Article

Most read articles by the same author(s)