An Approach towards Fake News Detection using Machine Learning Techniques

Vyankatesh Rampurkar

Authors

Vyankatesh Rampurkar, Thirupurasundari D.R.

Keywords:

Confusion Matrix, ISOT Dataset, Machine Learning, Navie Bayes, Logistic Regression

Abstract

In the digital age, the spread of false information has become a widespread and difficult problem. The Naive Bayes & logistic regression algorithms are used in this paper to provide a novel methodology for the detection of bogus news stories. The aim of this study is to improve the efficacy of the identification of fake news in digital material, consequently fostering information credibility and integrity within the digital ecosystem. We start this investigation by gathering a wide dataset of news articles from both reputable and phoney sources. We preprocess the textual input using techniques like tokenization, stop-word removal, and stemming to aid in feature extraction. During the feature selection phase, the term frequency-inverse document frequency (TF-IDF) is used to estimate the word importance of each article. Next, the Naive Bayes algorithm is used to divide news stories into two groups: phoney and real. In order to determine the probability that an article will fall into a particular category, Naive Bayes uses a probabilistic technique under the assumption that the characteristics (words) are conditionally independent. Logistic Regression models the probability of a news article being fake or genuine based on a set of relevant textual features. The primary goal of logistic regression is to achieve high accuracy in classifying news articles as fake or genuine, with an emphasis on feature engineering and model evaluation. The efficacy of the corresponding methods is determined by utilizing the confusion matrix to evaluate the correctness of the model. The findings suggest that Logistic Regression is effective in detecting fake news and contributes to the trustworthiness of information sources in the digital age.

Downloads

Download data is not yet available.

References

G. Aceto, D. Ciuonzo, A. Montieri, and A. Pescapè, ‘‘MIMETIC: Mobile encrypted traffic classification using multimodal deep learning,’’ Comput. Netw., vol. 165, Dec. 2019, Art. no. 106944.

G. E. R. Agudelo, O. J. S. Parra, and J. B. Velandia, ‘‘raising a model for fake news detection using machine learning in Python,’’ in Proc. Conf. e- Bus., e-Services e-Soc. Cham, Switzerland: Springer, 2018, pp. 596–604..

H. Ahmed, I. Traore, and S. Saad, ‘‘Detecting opinion spams and fake news using text classification, ’’ Secur. Privacy, vol. 1, no. 1, p. e9, Jan. 2018.

O. Ajao, D. Bhowmik, and S. Zargari, ‘‘Fake news identification on Twitter with hybrid CNN and RNN models, ’’ in Proc. 9th Int. Conf. Social Media Soc., Jul. 2018, pp. 226–230.

M. Amjad, G. Sidorov, A. Zhila, H. Gómez-Adorno, I. Voronkov, and A. Gelbukh, ‘‘‘Bend the truth’: Benchmark dataset for fake news detection in urdu language and its evaluation, ’’ J. Intell. Fuzzy Syst., vol. 39, no. 2, pp. 2457–2469, 2020.

S. D. Bhattacharjee, A. Talukder, and B. V. Balantrapu, ‘‘Active learning based news veracity detection with feature weighting and deep-shallow fusion, ’’ in Proc. IEEE Int. Conf. Big Data (Big Data), Dec. 2017, pp. 556–565.

C.-C. Chang and C.-J. Lin, ‘‘LIBSVM: A library for support vector machines, ’’ ACM Trans. Intell. Syst. Technol., vol. 2, no. 3, p. 27, 2011.

H.-L. Chen, B. Yang, J. Liu, and D.-Y. Liu, ‘‘A support vector machine classifier with rough set-based feature selection for breast cancer diagno- sis, ’’ Expert Syst. Appl., vol. 38, no. 7, pp. 9014–9022, Jul. 2011.

Kakarwal, Sangeeta, and Pradip Paithane. "Automatic pancreas segmentation using ResNet-18 deep learning approach." System research and information technologies 2 (2022): 104-116.

N. Cristianini and J. Shawe-Taylor, An Introduction to Support Vector Machines and Other Kernelbased Learning Methods. Cambridge, U.K.: Cambridge Univ. Press, 2000.

M. L. Della Vedova, E. Tacchini, S. Moret, G. Ballarin, M. DiPierro, and L. de Alfaro, ‘‘Automatic online fake news detection combining content and social signals, ’’ in Proc. 22nd Conf. Open Innov. Assoc. (FRUCT), May 2018, pp. 272–279.

A. L. Edwards, ‘‘Note on the ‘correction for continuity’ in testing the significance of the difference between correlated proportions, ’’ Psychome- trika, vol. 13, no. 3, pp. 185–187, 1948.

P. H. A. Faustini and T. F. Covões, ‘‘Fake news detection in multi- ple platforms and languages, ’’ Expert Syst. Appl., vol. 158, Nov. 2020, Art. no. 113503.

S. Gilda, ‘‘Evaluating machine learning algorithms for fake news detection, ’’ in Proc. IEEE 15th Student Conf. Res. Develop. (SCOReD), Dec. 2017, pp. 110–115.

M. H. Goldani, S. Momtazi, and R. Safabakhsh, ‘‘Detecting fake news with capsule neural networks,’’ 2020, arXiv:2002.01030.

Paithane, Pradip Mukundrao. "Yoga Posture Detection Using Machine Learning." Artificial Intelligence in Information and Communication Technologies, Healthcare and Education: A Roadmap Ahead 27 (2022).

A. U. Haq, J. Li, M. H. Memon, J. Khan, S. U. Din, I. Ahad, R. Sun, and Z. Lai, ‘‘Comparative analysis of the classification performance of machine learning classifiers and deep neural network classifier for predic- tion of Parkinson disease,’’ in Proc. 15th Int. Comput. Conf. Wavelet Act. Media Technol. Inf. Process. (ICCWAMTIP), Dec. 2018, pp. 101–106

Paithane, Pradip, Sarita Jibhau Wagh, and Sangeeta Kakarwal. "Optimization of route distance using k-NN algorithm for on-demand food delivery." System research and information technologies 1 (2023): 85-101.

A. Jain, A. Shakya, H. Khatter, and A. K. Gupta, ‘‘A smart system for fake news detection using machine learning,’’ in Proc. Int. Conf. Issues Challenges Intell. Comput. Techn. (ICICT), vol. 1, Sep. 2019, pp. 1–4.

R. K. Kaliyar, A. Goswami, and P. Narang, ‘‘Multiclass fake news detection using ensemble machine learning,’’ in Proc. IEEE 9th Int. Conf. Adv. Comput. (IACC), Dec. 2019, pp. 103–107.

R. K. Kaliyar, A. Goswami, P. Narang, and S. Sinha, ‘‘FNDNet—A deep convolutional neural network for fake news detection, ’’ Cognit. Syst. Res., vol. 61, pp. 32–44, Jun. 2020

S. Kula, M. Choraś, R. Kozik, P. Ksieniewicz, and M. Woźniak, ‘‘Sentiment analysis for fake news detection by means of neural networks, ’’ in Proc. Int. Conf. Comput. Sci. Cham, Switzerland: Springer, 2020, pp. 653–666.

S. Kumar, R. Asthana, S. Upadhyay, N. Upreti, and M. Akbar, ‘‘Fake news detection using deep learning models: A novel approach, ’’ Trans. Emerg. Telecommun. Technol., vol. 31, no. 2, p. e3767, Feb. 2020

Y. Long, ‘‘Fake news detection through multi-perspective speaker profiles, version I17-2043,’’ Assoc. Comput. Linguistics, Strouds- burg, PA, USA, Tech. Rep., Nov. 2017, vol. 2. [Online].

W. Y. Wang, ‘‘‘Liar, liar pants on fire’: A new benchmark dataset for fake news detection,’’ 2017, arXiv: 1705.00648.

An Approach towards Fake News Detection using Machine Learning Techniques

Authors

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

License

Announcements

Information for Authors

ijisae

Information

trindex