An Approach towards Fake News Detection using Machine Learning Techniques

Authors

  • Vyankatesh Rampurkar, Thirupurasundari D.R.

Keywords:

Confusion Matrix, ISOT Dataset, Machine Learning, Navie Bayes, Logistic Regression

Abstract

In the digital age, the spread of false information has become a widespread and difficult problem. The Naive Bayes & logistic regression algorithms are used in this paper to provide a novel methodology for the detection of bogus news stories. The aim of this study is to improve the efficacy of the identification of fake news in digital material, consequently fostering information credibility and integrity within the digital ecosystem. We start this investigation by gathering a wide dataset of news articles from both reputable and phoney sources. We preprocess the textual input using techniques like tokenization, stop-word removal, and stemming to aid in feature extraction. During the feature selection phase, the term frequency-inverse document frequency (TF-IDF) is used to estimate the word importance of each article. Next, the Naive Bayes algorithm is used to divide news stories into two groups: phoney and real. In order to determine the probability that an article will fall into a particular category, Naive Bayes uses a probabilistic technique under the assumption that the characteristics (words) are conditionally independent. Logistic Regression models the probability of a news article being fake or genuine based on a set of relevant textual features. The primary goal of logistic regression is to achieve high accuracy in classifying news articles as fake or genuine, with an emphasis on feature engineering and model evaluation. The efficacy of the corresponding methods is determined by utilizing the confusion matrix to evaluate the correctness of the model. The findings suggest that Logistic Regression is effective in detecting fake news and contributes to the trustworthiness of information sources in the digital age.

Downloads

Download data is not yet available.

References

G. Aceto, D. Ciuonzo, A. Montieri, and A. Pescapè, ‘‘MIMETIC: Mobile encrypted traffic classification using multimodal deep learning,’’ Comput. Netw., vol. 165, Dec. 2019, Art. no. 106944.

G. E. R. Agudelo, O. J. S. Parra, and J. B. Velandia, ‘‘raising a model for fake news detection using machine learning in Python,’’ in Proc. Conf. e- Bus., e-Services e-Soc. Cham, Switzerland: Springer, 2018, pp. 596–604..

H. Ahmed, I. Traore, and S. Saad, ‘‘Detecting opinion spams and fake news using text classification, ’’ Secur. Privacy, vol. 1, no. 1, p. e9, Jan. 2018.

O. Ajao, D. Bhowmik, and S. Zargari, ‘‘Fake news identification on Twitter with hybrid CNN and RNN models, ’’ in Proc. 9th Int. Conf. Social Media Soc., Jul. 2018, pp. 226–230.

M. Amjad, G. Sidorov, A. Zhila, H. Gómez-Adorno, I. Voronkov, and A. Gelbukh, ‘‘‘Bend the truth’: Benchmark dataset for fake news detection in urdu language and its evaluation, ’’ J. Intell. Fuzzy Syst., vol. 39, no. 2, pp. 2457–2469, 2020.

S. D. Bhattacharjee, A. Talukder, and B. V. Balantrapu, ‘‘Active learning based news veracity detection with feature weighting and deep-shallow fusion, ’’ in Proc. IEEE Int. Conf. Big Data (Big Data), Dec. 2017, pp. 556–565.

C.-C. Chang and C.-J. Lin, ‘‘LIBSVM: A library for support vector machines, ’’ ACM Trans. Intell. Syst. Technol., vol. 2, no. 3, p. 27, 2011.

H.-L. Chen, B. Yang, J. Liu, and D.-Y. Liu, ‘‘A support vector machine classifier with rough set-based feature selection for breast cancer diagno- sis, ’’ Expert Syst. Appl., vol. 38, no. 7, pp. 9014–9022, Jul. 2011.

Kakarwal, Sangeeta, and Pradip Paithane. "Automatic pancreas segmentation using ResNet-18 deep learning approach." System research and information technologies 2 (2022): 104-116.

N. Cristianini and J. Shawe-Taylor, An Introduction to Support Vector Machines and Other Kernelbased Learning Methods. Cambridge, U.K.: Cambridge Univ. Press, 2000.

M. L. Della Vedova, E. Tacchini, S. Moret, G. Ballarin, M. DiPierro, and L. de Alfaro, ‘‘Automatic online fake news detection combining content and social signals, ’’ in Proc. 22nd Conf. Open Innov. Assoc. (FRUCT), May 2018, pp. 272–279.

A. L. Edwards, ‘‘Note on the ‘correction for continuity’ in testing the significance of the difference between correlated proportions, ’’ Psychome- trika, vol. 13, no. 3, pp. 185–187, 1948.

P. H. A. Faustini and T. F. Covões, ‘‘Fake news detection in multi- ple platforms and languages, ’’ Expert Syst. Appl., vol. 158, Nov. 2020, Art. no. 113503.

S. Gilda, ‘‘Evaluating machine learning algorithms for fake news detection, ’’ in Proc. IEEE 15th Student Conf. Res. Develop. (SCOReD), Dec. 2017, pp. 110–115.

M. H. Goldani, S. Momtazi, and R. Safabakhsh, ‘‘Detecting fake news with capsule neural networks,’’ 2020, arXiv:2002.01030.

Paithane, Pradip Mukundrao. "Yoga Posture Detection Using Machine Learning." Artificial Intelligence in Information and Communication Technologies, Healthcare and Education: A Roadmap Ahead 27 (2022).

A. U. Haq, J. Li, M. H. Memon, J. Khan, S. U. Din, I. Ahad, R. Sun, and Z. Lai, ‘‘Comparative analysis of the classification performance of machine learning classifiers and deep neural network classifier for predic- tion of Parkinson disease,’’ in Proc. 15th Int. Comput. Conf. Wavelet Act. Media Technol. Inf. Process. (ICCWAMTIP), Dec. 2018, pp. 101–106

Paithane, Pradip, Sarita Jibhau Wagh, and Sangeeta Kakarwal. "Optimization of route distance using k-NN algorithm for on-demand food delivery." System research and information technologies 1 (2023): 85-101.

A. Jain, A. Shakya, H. Khatter, and A. K. Gupta, ‘‘A smart system for fake news detection using machine learning,’’ in Proc. Int. Conf. Issues Challenges Intell. Comput. Techn. (ICICT), vol. 1, Sep. 2019, pp. 1–4.

R. K. Kaliyar, A. Goswami, and P. Narang, ‘‘Multiclass fake news detection using ensemble machine learning,’’ in Proc. IEEE 9th Int. Conf. Adv. Comput. (IACC), Dec. 2019, pp. 103–107.

R. K. Kaliyar, A. Goswami, P. Narang, and S. Sinha, ‘‘FNDNet—A deep convolutional neural network for fake news detection, ’’ Cognit. Syst. Res., vol. 61, pp. 32–44, Jun. 2020

S. Kula, M. Choraś, R. Kozik, P. Ksieniewicz, and M. Woźniak, ‘‘Sentiment analysis for fake news detection by means of neural networks, ’’ in Proc. Int. Conf. Comput. Sci. Cham, Switzerland: Springer, 2020, pp. 653–666.

S. Kumar, R. Asthana, S. Upadhyay, N. Upreti, and M. Akbar, ‘‘Fake news detection using deep learning models: A novel approach, ’’ Trans. Emerg. Telecommun. Technol., vol. 31, no. 2, p. e3767, Feb. 2020

Y. Long, ‘‘Fake news detection through multi-perspective speaker profiles, version I17-2043,’’ Assoc. Comput. Linguistics, Strouds- burg, PA, USA, Tech. Rep., Nov. 2017, vol. 2. [Online].

W. Y. Wang, ‘‘‘Liar, liar pants on fire’: A new benchmark dataset for fake news detection,’’ 2017, arXiv: 1705.00648.

Downloads

Published

24.03.2024

How to Cite

Vyankatesh Rampurkar. (2024). An Approach towards Fake News Detection using Machine Learning Techniques. International Journal of Intelligent Systems and Applications in Engineering, 12(3), 2868–2874. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/5797

Issue

Section

Research Article