Designing A Model for Fake News Detection in Social Media Using Machine Learning Techniques

A Kumari  Shalini; Sameer  Saxena; B Suresh  Kumar

Authors

A Kumari Shalini Research Scholar, Amity University, Jaipur
Sameer Saxena Associate Professor, Amity University, Jaipur
B Suresh Kumar Associate Professor, Sanjay Ghodawat University, Kolhapur

Keywords:

Social media, fake news detection, machine learning, recurrent neural network, feature extraction and selection, classification

Abstract

Fake identity is a critical problem nowadays of social media. Fake news is rapidly spread by fake identities and bots that generate the trustworthiness issue on social media. Identifying the profiles and accounts using soft computing algorithms is necessary to improve the trustworthiness of social media. In this work, we proposed Recurrent Neural Network to identify fake identities on social media. Initially, we extract data from social media such as Twitter.com using Twitter API. Hybrid feature extraction has been done based on the characteristics of data. It generates training rules which are associated with a fake and legitimate profiles generated by a human. In pre-processing and filtration process, all bot entries are eliminated using a policy-based approach. To generate strict rules that improve the classification accuracy, the training of dataset primarily focuses on attributes such as friends count, the total number of followers, tweet counts, re-tweets count, etc., The Recurrent Neural Network (RNN) categorizes each profile based on training and testing modules. This work focuses on classifying bots or human entries according to their extracted features using machine learning. Once the training phase is completed, features are extracted from the dataset based on the term frequency on which the classification technique is applied. The proposed work is very effective in detecting malicious accounts from an imbalanced dataset in social media. The system provides maximum accuracy for the classification of fake and real identities on the social media dataset. It achieves good accuracy with Recurrent Neural Network (RNN) using the different activation functions. The system improves the classification accuracy with the increase in the number of folds in cross-validation. In experiment analysis, we have done testing on synthetic and real-time social media datasets; We achieve around 96% accuracy on the real-time Twitter dataset while 98% accuracy on synthetic social media datasets.

Downloads

Download data is not yet available.

References

Van Der Walt, Estée, and Jan Eloff. "Using machine learning to detect fake identities: bots vs humans." IEEE Access 6 (2018): 6540-6549.

Shu, Kai, et al. "The role of user profiles for fake news detection." Proceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining. 2019.

Islam, Mohammad Raihanul, Sathappan Muthiah, and Naren Ramakrishnan. "RumorSleuth: joint detection of rumor veracity and user stance." 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM). IEEE, 2019.

Al Asad, Nafiz, et al. "Depression Detection by Analyzing Social Media Posts of User." 2019 IEEE International Conference on Signal Processing, Information, Communication & Systems (SPICSCON). IEEE, 2019.

Li, Neng, et al. "Fake reviews tell no tales? dissecting click farming in content-generated social networks." China Communications 15.4 (2018): 98-109.

Meshram, Shweta, Rajesh Babu, and Jayanth Adhikari. "Detecting Psychological Stress using Machine Learning over Social Media Interaction." 2020 5th International Conference on Communication and Electronics Systems (ICCES). IEEE, 2020.

Zarei, Koosha, Reza Farahbakhsh, and Noël Crespi. "Typification of impersonated accounts on instagram." 2019 IEEE 38th International Performance Computing and Communications Conference (IPCCC). IEEE, 2019.

Elghanuni, Ramzi H., Musab AM Ali, and Marwa B. Swidan. "An Overview of Anomaly Detection for Online Social Network." 2019 IEEE 10th Control and System Graduate Research Colloquium (ICSGRC). IEEE, 2019.

Gheewala, Shivangi, and Rakesh Patel. "Machine learning based Twitter Spam account detection: a review." 2018 Second International Conference on Computing Methodologies and Communication (ICCMC). IEEE, 2018.

Daha, Fatima Zohra, and Saniika Hewavitharana. "Deep neural architecture with character embedding for semantic frame detection." 2019 IEEE 13th International Conference on Semantic Computing (ICSC). IEEE, 2019.

Fazil, Mohd, and Muhammad Abulaish. "A hybrid approach for detecting automated spammers in twitter." IEEE Transactions on Information Forensics and Security 13.11 (2018): 2707-2719.

Aydin, Ilhan, S. E. V. İ. Mehmet, and Mehmet Umut Salur. "Detection of Fake Twitter Accounts with Machine Learning Algorithms." 2018 International Conference on Artificial Intelligence and Data Processing (IDAP). IEEE, 2018.

Sowmya, P., and Madhumita Chatterjee. "Detection of Fake and Clone accounts in Twitter using Classification and Distance Measure Algorithms." 2020 International Conference on Communication and Signal Processing (ICCSP). IEEE, 2020.

Pakaya, Farhan Nurdiatama, Muhammad Okky Ibrohim, and Indra Budi. "Malicious Account Detection on Twitter Based on Tweet Account Features using Machine Learning." 2019 Fourth International Conference on Informatics and Computing (ICIC). IEEE, 2019.

Akyon, Fatih Cagatay, and M. Esat Kalfaoglu. "Instagram Fake and Automated Account Detection." 2019 Innovations in Intelligent Systems and Applications Conference (ASYU). IEEE, 2019.

Designing A Model for Fake News Detection in Social Media Using Machine Learning Techniques

Authors

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

License

Most read articles by the same author(s)

Announcements

Information for Authors

ijisae

Information

trindex