An Approach towards Fake News Detection using Machine Learning Techniques
Keywords:
Confusion Matrix, ISOT Dataset, Machine Learning, Navie Bayes, Logistic RegressionAbstract
In the digital age, the spread of false information has become a widespread and difficult problem. The Naive Bayes & logistic regression algorithms are used in this paper to provide a novel methodology for the detection of bogus news stories. The aim of this study is to improve the efficacy of the identification of fake news in digital material, consequently fostering information credibility and integrity within the digital ecosystem. We start this investigation by gathering a wide dataset of news articles from both reputable and phoney sources. We preprocess the textual input using techniques like tokenization, stop-word removal, and stemming to aid in feature extraction. During the feature selection phase, the term frequency-inverse document frequency (TF-IDF) is used to estimate the word importance of each article. Next, the Naive Bayes algorithm is used to divide news stories into two groups: phoney and real. In order to determine the probability that an article will fall into a particular category, Naive Bayes uses a probabilistic technique under the assumption that the characteristics (words) are conditionally independent. Logistic Regression models the probability of a news article being fake or genuine based on a set of relevant textual features. The primary goal of logistic regression is to achieve high accuracy in classifying news articles as fake or genuine, with an emphasis on feature engineering and model evaluation. The efficacy of the corresponding methods is determined by utilizing the confusion matrix to evaluate the correctness of the model. The findings suggest that Logistic Regression is effective in detecting fake news and contributes to the trustworthiness of information sources in the digital age.
Downloads
References
G. Aceto, D. Ciuonzo, A. Montieri, and A. Pescapè, ‘‘MIMETIC: Mobile encrypted traffic classification using multimodal deep learning,’’ Comput. Netw., vol. 165, Dec. 2019, Art. no. 106944.
G. E. R. Agudelo, O. J. S. Parra, and J. B. Velandia, ‘‘raising a model for fake news detection using machine learning in Python,’’ in Proc. Conf. e- Bus., e-Services e-Soc. Cham, Switzerland: Springer, 2018, pp. 596–604..
H. Ahmed, I. Traore, and S. Saad, ‘‘Detecting opinion spams and fake news using text classification, ’’ Secur. Privacy, vol. 1, no. 1, p. e9, Jan. 2018.
O. Ajao, D. Bhowmik, and S. Zargari, ‘‘Fake news identification on Twitter with hybrid CNN and RNN models, ’’ in Proc. 9th Int. Conf. Social Media Soc., Jul. 2018, pp. 226–230.
M. Amjad, G. Sidorov, A. Zhila, H. Gómez-Adorno, I. Voronkov, and A. Gelbukh, ‘‘‘Bend the truth’: Benchmark dataset for fake news detection in urdu language and its evaluation, ’’ J. Intell. Fuzzy Syst., vol. 39, no. 2, pp. 2457–2469, 2020.
S. D. Bhattacharjee, A. Talukder, and B. V. Balantrapu, ‘‘Active learning based news veracity detection with feature weighting and deep-shallow fusion, ’’ in Proc. IEEE Int. Conf. Big Data (Big Data), Dec. 2017, pp. 556–565.
C.-C. Chang and C.-J. Lin, ‘‘LIBSVM: A library for support vector machines, ’’ ACM Trans. Intell. Syst. Technol., vol. 2, no. 3, p. 27, 2011.
H.-L. Chen, B. Yang, J. Liu, and D.-Y. Liu, ‘‘A support vector machine classifier with rough set-based feature selection for breast cancer diagno- sis, ’’ Expert Syst. Appl., vol. 38, no. 7, pp. 9014–9022, Jul. 2011.
Kakarwal, Sangeeta, and Pradip Paithane. "Automatic pancreas segmentation using ResNet-18 deep learning approach." System research and information technologies 2 (2022): 104-116.
N. Cristianini and J. Shawe-Taylor, An Introduction to Support Vector Machines and Other Kernelbased Learning Methods. Cambridge, U.K.: Cambridge Univ. Press, 2000.
M. L. Della Vedova, E. Tacchini, S. Moret, G. Ballarin, M. DiPierro, and L. de Alfaro, ‘‘Automatic online fake news detection combining content and social signals, ’’ in Proc. 22nd Conf. Open Innov. Assoc. (FRUCT), May 2018, pp. 272–279.
A. L. Edwards, ‘‘Note on the ‘correction for continuity’ in testing the significance of the difference between correlated proportions, ’’ Psychome- trika, vol. 13, no. 3, pp. 185–187, 1948.
P. H. A. Faustini and T. F. Covões, ‘‘Fake news detection in multi- ple platforms and languages, ’’ Expert Syst. Appl., vol. 158, Nov. 2020, Art. no. 113503.
S. Gilda, ‘‘Evaluating machine learning algorithms for fake news detection, ’’ in Proc. IEEE 15th Student Conf. Res. Develop. (SCOReD), Dec. 2017, pp. 110–115.
M. H. Goldani, S. Momtazi, and R. Safabakhsh, ‘‘Detecting fake news with capsule neural networks,’’ 2020, arXiv:2002.01030.
Paithane, Pradip Mukundrao. "Yoga Posture Detection Using Machine Learning." Artificial Intelligence in Information and Communication Technologies, Healthcare and Education: A Roadmap Ahead 27 (2022).
A. U. Haq, J. Li, M. H. Memon, J. Khan, S. U. Din, I. Ahad, R. Sun, and Z. Lai, ‘‘Comparative analysis of the classification performance of machine learning classifiers and deep neural network classifier for predic- tion of Parkinson disease,’’ in Proc. 15th Int. Comput. Conf. Wavelet Act. Media Technol. Inf. Process. (ICCWAMTIP), Dec. 2018, pp. 101–106
Paithane, Pradip, Sarita Jibhau Wagh, and Sangeeta Kakarwal. "Optimization of route distance using k-NN algorithm for on-demand food delivery." System research and information technologies 1 (2023): 85-101.
A. Jain, A. Shakya, H. Khatter, and A. K. Gupta, ‘‘A smart system for fake news detection using machine learning,’’ in Proc. Int. Conf. Issues Challenges Intell. Comput. Techn. (ICICT), vol. 1, Sep. 2019, pp. 1–4.
R. K. Kaliyar, A. Goswami, and P. Narang, ‘‘Multiclass fake news detection using ensemble machine learning,’’ in Proc. IEEE 9th Int. Conf. Adv. Comput. (IACC), Dec. 2019, pp. 103–107.
R. K. Kaliyar, A. Goswami, P. Narang, and S. Sinha, ‘‘FNDNet—A deep convolutional neural network for fake news detection, ’’ Cognit. Syst. Res., vol. 61, pp. 32–44, Jun. 2020
S. Kula, M. Choraś, R. Kozik, P. Ksieniewicz, and M. Woźniak, ‘‘Sentiment analysis for fake news detection by means of neural networks, ’’ in Proc. Int. Conf. Comput. Sci. Cham, Switzerland: Springer, 2020, pp. 653–666.
S. Kumar, R. Asthana, S. Upadhyay, N. Upreti, and M. Akbar, ‘‘Fake news detection using deep learning models: A novel approach, ’’ Trans. Emerg. Telecommun. Technol., vol. 31, no. 2, p. e3767, Feb. 2020
Y. Long, ‘‘Fake news detection through multi-perspective speaker profiles, version I17-2043,’’ Assoc. Comput. Linguistics, Strouds- burg, PA, USA, Tech. Rep., Nov. 2017, vol. 2. [Online].
W. Y. Wang, ‘‘‘Liar, liar pants on fire’: A new benchmark dataset for fake news detection,’’ 2017, arXiv: 1705.00648.
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.