Social Media Sentiment Analysis with Multi-Token Concatenated Embedding and Semantixpert Probabilistic Classifier for Fake News Detection

Authors

  • B. Hemalatha, M. Soranamageswari

Keywords:

Byte Pair Encoding, Sentiment analysis, polarity prediction, fake news detection, social media.

Abstract

In the field of false news identification, sentiment analysis is an effective strategy for enhancing detection accuracy. Sentiment analysis is critical for understanding user emotions and intents, but traditional methods frequently neglect the complex interplay between text and emojis, particularly in recognizing sarcasm. To address this limitation, a novel approach, the "Multi-Token Concatenated Embedding and SemantiXpert Probabilistic Classifier" is proposed which is aimed at improving sarcasm detection, polarity prediction accuracy, and overall sentiment prediction accuracy. The existing sentiment analysis approaches fail to capture the indirect interaction between text and emojis, resulting in the loss of crucial sentiment cues and misunderstanding of sarcasm in news content. Hence, Multi-Token Concatenated Embedding approach is introduced, which includes Multi-Run Byte Pair Encoding (MR-BPE) to capture sub-word patterns and the Concat-ViLT model to encapsulate the textual representation of emojis, thus enhancing the model's ability to understand the multimodal context of text-emoji pairs. Moreover, existing semantic classification methods are based on lexical patterns and mean-zero feature assumptions, which leads to misclassifications and lower accuracy in recognizing the genuine emotional tone of mixed sentiment comments. For semantic classification, a SemantiXpert Probabilistic Classifier is proposed which includes a Contrastive Semantic Clause Filter Network to understand user intentions and domain context and a Polarized Probabilistic Classifier is utilized to enhance the polarity prediction thereby reducing misclassifications and improving accuracy in sentiment prediction based fake news detection. As a result, the proposed model outperforms existing methods, achieving higher accuracy, sensitivity, and AUC values.

Downloads

Download data is not yet available.

References

M. Swartz and A. Crooks, “Comparison of emoji use in names, profiles, and tweets,” In 2020 IEEE 14th International Conference on Semantic Computing (ICSC), pp. 375-380, 2020, February. IEEE.

Alrumaih, A. Al-Sabbagh, R. Alsabah, H. Kharrufa and J. Baldwin, “Sentiment analysis of comments in social media,” International Journal of Electrical & Computer Engineering, vol. 10, no. 6, pp. 2088-8708, 2020.

Debnath, N. Pinnaparaju, M. Shrivastava, V. Varma and I. Augenstein, “Semantic textual similarity of sentences with emojis,” In Companion Proceedings of the Web Conference 2020, pp. 426-430, 2020, April.

Surikov and E. Egorova, “Alternative method sentiment analysis using emojis and emoticons,” Procedia Computer Science, vol. 178, pp. 182-193, 2020.

S. Gupta, O. Garg, R. Mehrotra and A. Singh, “Social media anatomy of text and emoji in expressions,” In Smart Computing Techniques and Applications: Proceedings of the Fourth International Conference on Smart Computing and Informatics, vol. 2, pp. 41-49, 2021. Springer Singapore.

M. Krommyda, A. Rigos, K. Bouklas and A. Amditis, “An experimental analysis of data annotation methodologies for emotion detection in short text posted on social media,” In Informatics, vol. 8, no. 1, pp. 19, 2021, March. MDPI.

F. Benrouba and R. Boudour, “Emotional sentiment analysis of social media content for mental health safety,” Social Network Analysis and Mining, vol. 13, no. 1, pp. 17, 2023.

S. Al-Azani and E.S.M. El-Alfy, “Early and late fusion of emojis and text to enhance opinion mining,” IEEE Access, vol. 9, pp. 121031-121045, 2021.

N.V. Babu and E.G.M. Kanaga, “Sentiment analysis in social media data for depression detection using artificial intelligence: a review,” SN Computer Science, vol. 3, pp. 1-20, 2022.

T.P. Kumar and B.V. Vardhan, “A Pragmatic Approach to Emoji based Multimodal Sentiment Analysis using Deep Neural Networks,” Journal of Algebraic Statistics, vol. 13, no. 1, pp. 473-482, 2022.

Z. Ahanin and M.A. Ismail, “Feature extraction based on fuzzy clustering and emoji embeddings for emotion classification,” International Journal of Technology Management and Information System, vol. 2, no. 1, pp. 102-112, 2020.

A. Yoo and J.T. Rayz, “Understanding emojis for sentiment analysis,” In The International FLAIRS Conference Proceedings, vol. 34, 2021, April.

M.A. Ullah, S.M. Marium, S.A. Begum and N.S. Dipa, “An algorithm and method for sentiment analysis using the text and emoticon,” ICT Express, vol. 6, no. 4, pp. 357-360, 2020.

L. Holthoff, “The Emoji Sentiment Lexicon: Analysing Consumer Emotions in Social Media Communication,” In Proceedings of the 49th European Marketing Academy (EMAC) Annual Conference, 2020, May.

S. Gupta, R. Singh and V. Singla, “Emoticon and text sarcasm detection in sentiment analysis,” In First International Conference on Sustainable Technologies for Computational Intelligence: Proceedings of ICTSCI 2019, pp. 1-10, 2020. Springer Singapore.

A.S. Talaat, “Sentiment analysis classification system using hybrid BERT models,” Journal of Big Data, vol. 10, no. 1, pp. 1-18, 2023.

L. Bryan-Smith, J. Godsall, F. George, K. Egode, N. Dethlefs and D. Parsons, “Real-time social media sentiment analysis for rapid impact assessment of floods,” Computers & Geosciences, vol. 178, pp. 105405, 2023.

Liu, F. Fang, X. Lin, T. Cai, X. Tan, J. Liu and X. Lu, “Improving sentiment analysis accuracy with emoji embedding,” Journal of Safety Science and Resilience, vol. 2, no. 4, pp. 246-252, 2021.

M. Shahzad, C. Freeman, M. Rahimi and H. Alhoori, “Predicting Facebook sentiments towards research,” Natural Language Processing Journal, vol. 3, pp. 100010, 2023.

S. Gupta, A. Singh and V. Kumar, “Emoji, Text, and Sentiment Polarity Detection Using Natural Language Processing,” Information, vol. 14, no. 4, pp. 222, 2023.

Alsayat, “Improving sentiment analysis for social media applications using an ensemble deep learning language model,” Arabian Journal for Science and Engineering, vol. 47, no. 2, pp. 2499-2511, 2022.

A.R. Pathak, M. Pandey and S. Rautaray, “Topic-level sentiment analysis of social media data using deep learning,” Applied Soft Computing, vol. 108, pp. 107440, 2021.

B.S.W. Bagus Satria Wiguna, C.V.H. Cinthia Vairra Hudiyanti, A.A.R. Alqis Alqis Rausanfita and A.Z.A. Agus Zainal Arifin, “Sarcasm detection engine for twitter sentiment analysis using textual and emoji feature,” Jurnal Ilmu Komputer dan Informasi, vol. 14, no. 1, pp. 1-8, 2021.

Q.A. Xu, C. Jayne and V. Chang, “An emoji feature-incorporated multi-view deep learning for explainable sentiment classification of social media reviews,” Technological Forecasting and Social Change, vol. 202, pp. 123326, 2024.

K.E. Naresh Kumar and V. Uma, “Intelligent sentinet-based lexicon for context-aware sentiment analysis: optimized neural network for sentiment classification on social media,” The Journal of Supercomputing, vol. 77, no. 11, pp. 12801-12825, 2021.

N. Shelke, S. Chaudhury, S. Chakrabarti, S.L. Bangare, G. Yogapriya and P. Pandey, “An efficient way of text-based emotion analysis from social media using LRA-DNN,” Neuroscience Informatics, vol. 2, no. 3, pp. 100048, 2022.

R.A. Potamias, G. Siolas and A.G. Stafylopatis, “A transformer-based approach to irony and sarcasm detection,” Neural Computing and Applications, vol. 32, no. 23, pp. 17309-17320, 2020.

M. Asif, A. Ishtiaq, H. Ahmad, H. Aljuaid and J. Shah, “Sentiment analysis of extremism in social media from textual information,” Telematics and Informatics, vol. 48, pp. 101345, 2020.

Downloads

Published

12.06.2024

How to Cite

B. Hemalatha. (2024). Social Media Sentiment Analysis with Multi-Token Concatenated Embedding and Semantixpert Probabilistic Classifier for Fake News Detection. International Journal of Intelligent Systems and Applications in Engineering, 12(4), 3639 –. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/6908

Issue

Section

Research Article