Sentiment Analysis on Omicron Tweets Using Hybrid Classifiers with Multiple Feature Extraction Techniques and Transformer Based Models

Authors

  • Rakesh Kumar Godi Department of Computer Science and Engineering, Manipal Institute of Technology, Manipal Academy of Higher Education (MAHE), Bengaluru, India
  • Mule Shrishail Basvant Associate Professor, Department of Electronics & Telecommunication Engineering, Sinhgad College of Engineering, Pune-41
  • A. Deepak Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, Tamilnadu
  • Arun Pratap Srivastava Lloyd Institute of Engineering & Technology, Greater Noida
  • Manoj Kumar T. Associate Professor, St Thomas College of Engineering and Technology Chengannur Kerala, 689521
  • Akhil Sankhyan Lloyd Law College, Greater Noida
  • Anurag Shrivastava Saveetha School of Engineering, Saveetha Institute of Medical and Technical Sciences, Chennai, Tamilnadu

Keywords:

Sentiment analysis, Omicron, Twitter analysis, NLP, Big Data, TextBlob, Machine learning, Deep learning, hybrid classifiers, TF-IDF, Word2Vec, GloVe, FastText, BERT, RoBERTa

Abstract

Since the beginning of Covid-19, the world has been in a dilemma to cope up with its effects. With time the coronavirus has evolved into variants that caused a lot of destruction to human race. One such variant is “Omicron”. This variant made its presence in many countries throughout the world. The government is left in a straining situation to curb the spread of this variant and to stop the evolution of coronavirus. Though the strict precautions were exercised, the evolution was unstoppable. To understand the thoughts and feelings of the public, twitter  can be considered as one of the best platforms for sentiment analysis. Analyzing the sentiments of people across the continents is horridly difficult but with the way technology has been making advancement in the world, analyzing has become a quiet easy job. In the existing studies on Covid-19, various word embedding techniques with machine learning and deep learning classifiers has been used for the analysis. Language based models have proven to achieve higher accuracy for sentiment analysis. Amidst these hybrid classifiers, have performed tremendously good. In the proposed work, seven Machine Learning hybrid classifiers are compared with four single classifiers using TF-IDF and Word2Vec. A proposed Deep Learning hybrid classifier is compared with two single classifiers using GloVe and FastText. Furthermore, language models like BERT and RoBERTa are employed in an effort to boost validation outcomes upto 93.39% and 93.47%.

Downloads

Download data is not yet available.

References

He, X., Hong, W., Pan, X., Lu, G., & Wei, X. (2021). SARS‐Cov‐2 omicron variant: Characteristics and prevention.MedComm, 2(4), 838-845. https://doi.org/10.1002/mco2.110

Hosgurmath, S., Petli, V., & Jalihal, V. (2022). An omicron variant tweeter sentiment analysis using NLP technique.Global Trasitions Proceedings, 3(1), 215-219. https://doi.org/10.1016/j.gltp.2022.03.025

Ghaderzadeh, M., Eshraghi, M. A., Asadi, F., Hosseini, A., Jafari, R., Bashash, D., & Abolghasemi, H. (2022). Efficient framework for detection of COVID-19 omicron and delta variants based on two intelligent phases of CNN models. Computational and Mathematical Methods in Medicine, 2022, 1- 10. https://doi.org/10.1155/2022/4838009

Thakur, N., & Han, C. Y. (2022). An exploratory study of tweets about the SARS-Cov-2 omicron variant: Insights from sentiment analysis, language interpretation, source tracking, type classification, and embedded URL detection. https://doi.org/10.20944/preprints202205.0238.v1

Alsafi, R. T. (2022). Lessons from SARS-Cov, MERS-Cov, and SARS-Cov-2 infections: What we know so far.Canadian Journal of Infectious Diseases and Medical Microbiology, 2022, 1- 13. https://doi.org/10.1155/2022/1156273

Ayris, D., Imtiaz, M., Horbury, K., Williams, B., Blackney, M., Hui See, C. S., & Shah, S. A. (2022). Novel deep learning approach to model and predict the spread of COVID-19. Intelligent Systems with Applications, 14, 200068. https://doi.org/10.1016/j.iswa.2022.200068

Reveilhac, M. (2022). The deployment of social media by political authorities and health experts to enhance public information during the COVID-19 pandemic. SSM - Population Health, 19, 101165. https://doi.org/10.1016/j.ssmph.2022.101165

Yaqub, U. (2021). Tweeting during the COVID-19 pandemic. Digital Government: Research and Practice,2(1), 1-7. https://doi.org/10.1145/3428090

Singh, H., Dahiya, N., Yadav, M., & Sehrawat, N. (2022). Emergence of SARS-Cov-2 new variants and their clinical significance. Canadian Journal of Infectious Diseases and Medical Microbiology, 2022, 1- 8. https://doi.org/10.1155/2022/7336309

Kaur, H., Ahsaan, S. U., Alankar, B., & Chang, V. (2021). A proposed sentiment analysis deep learning algorithm for analyzing COVID-19 tweets. Information Systems Frontiers, 23(6), 1417- 1429. https://doi.org/10.1007/s10796-021- 10135-7

Singh, M., Jakhar, A. K., & Pandey, S. (2021). Sentiment analysis on the impact of coronavirus in social life using the BERT model. Social Network Analysis and Mining, 11(1). https://doi.org/10.1007/s13278-021-00737-z

Zhu, Z. (2022). Deep learning for Chinese language sentiment extraction and analysis. Mathematical Problems in Engineering, 2022, 1-12. https://doi.org/10.1155/2022/8145445

Chen, C., Xu, B., Yang, J., & Liu, M. (2022). Sentiment analysis of animated film reviews using intelligent machine learning. Computational Intelligence and Neuroscience, 2022, 1- 8. https://doi.org/10.1155/2022/8517205

León-Sandoval, E., Zareei, M., Barbosa-Santillán, L. I., & Falcón Morales, L. E. (2022). Measuring the impact of language models in sentiment analysis for Mexico’s COVID-19 pandemic. Electronics, 11(16), 2483. https://doi.org/10.3390/electronics11162483

Choi, Y., Lee, J., & Paek, S. Y. (2022). Public awareness and sentiment toward COVID-19 vaccination in South Korea: Findings from big data analytics. International Journal of Environmental Research and Public Health, 19(16), 9914. https://doi.org/10.3390/ijerph19169914

Shahzad, A., Zafar, B., Ali, N., Jamil, U., Alghadhban, A. J., Assam, M., Ghamry, N. A., & Eldin, E. T. (2022). COVID-19 vaccines related user’s response categorization using machine learning techniques. Computation, 10(8), 141. https://doi.org/10.3390/computation10080141

Yao, Z., Yang, J., Liu, J., Keith, M., & Guan, C. (2021). Comparing tweet sentiments in megacities using machine learning techniques: In the midst of COVID-19. Cities, 116, 103273. https://doi.org/10.1016/j.cities.2021.103273

Gaye, B., Zhang, D., & Wulamu, A. (2021). A tweet sentiment classification approach using a hybridstacked ensemble technique. Information, 12(9), 374. https://doi.org/10.3390/info12090374

Naresh, A., & Venkata Krishna, P. (2020). An efficient approach for sentiment analysis using machinelearning algorithm. Evolutionary Intelligence, 14(2), 725-731. https://doi.org/10.1007/s12065-020-00429-1

Jain, P. K., Saravanan, V., & Pamula, R. (2021). A hybrid CNN-LSTM: A deep learning approach forconsumer sentiment analysis using qualitative user-generated contents. ACM Transactions on Asian and Low Resource Language Information Processing, 20(5), 1-15. https://doi.org/10.1145/3457206

Amin, S., Uddin, M. I., AlSaeed, D. H., Khan, A., & Adnan, M. (2021). Early detection of seasonal outbreaks from Twitter data using machine learning approaches. Complexity, 2021, 1-12. https://doi.org/10.1155/2021/5520366

Al-Hashedi, A., Al-Fuhaidi, B., Mohsen, A. M., Ali, Y., Gamal Al-Kaf, H. A., Al-Sorori, W., &Maqtary, N. (2022). Ensemble classifiers for Arabic sentiment analysis of social network (Twitter data) towards COVID-19-Related conspiracy theories. Applied Computational Intelligence and Soft Computing, 2022, 1- 10. https://doi.org/10.1155/2022/6614730

Shahi, T., Sitaula, C., & Paudel, N. (2022). A hybrid feature extraction method for Nepali COVID-19Related tweets classification. Computational Intelligence and Neuroscience, 2022, 1- 11. https://doi.org/10.1155/2022/5681574

Rodrigues, A. P., Fernandes, R., A, A., B, A., Shetty, A., K, A., Lakshmanna, K., & Shafi, R. M. (2022). Real- time Twitter spam detection and sentiment analysis using machine learning and deep learning techniques. Computational Intelligence and Neuroscience, 2022, 1-14. https://doi.org/10.1155/2022/5211949

A. Bandi and A. Fellah, “Socio-analyzer: A sentiment analysis using social media data,” in Proc. 28th Int. Conf. Softw. Eng. Data Eng., in EPiC Series in Computing, vol. 64, F. Harris, S. Dascalu, S. Sharma, and R. Wu, Eds.Amsterdam, The Netherlands: EasyChair, 2019, pp. 61–67

Naseem, U., Razzak, I., Khushi, M., Eklund, P. W., & Kim, J. (2021). COVIDSenti: A large-scale benchmark Twitter data set for COVID-19 sentiment analysis. IEEE Transactions on Computational Social Systems, 8(4), 1003- 1015. https://doi.org/10.1109/tcss.2021.3051189

U. Naseem, I. Razzak, and P. W. Eklund, “A survey of preprocessing techniques to improve short-text quality: A case study on hate speech detection on Twitter,” in Multimedia Tools and Applications. Springer, Nov. 2020, pp. 1–28. [Online]. Available: https://link. springer.com/article/10.1007/s11042-020-10082-6

Liu, N., & Zhao, J. (2022). A BERT-based aspect-level sentiment analysis algorithm for cross-domain text.Computational Intelligence and Neuroscience, 2022, 1-11. https://doi.org/10.1155/2022/8726621

NLTK :: Natural Language Toolkit

Ghosh, M., & Sanyal, G. (2018). Performance assessment of multiple classifiers based on ensemble feature selection scheme for sentiment analysis. Applied Computational Intelligence and Soft Computing, 2018, 1- 12. https://doi.org/10.1155/2018/8909357

Kim, S., & Gil, J. (2019). Research paper classification systems based on TF-IDF and LDA schemes. Human centric Computing and Information Sciences, 9(1). https://doi.org/10.1186/s13673-019-0192-7

Hasan, M. R., Maliha, M., & Arifuzzaman, M. (2019). Sentiment analysis with NLP on Twitter data. 2019 International Conference on Computer, Communication, Chemical, Materials and Electronic Engineering (IC4ME2). https://doi.org/10.1109/ic4me247184.2019.9036670

Al-Saqqa, S., & Awajan, A. (2019). The use of Word2vec model in sentiment analysis. Proceedings of the 2019 International Conference on Artificial Intelligence, Robotics and Control. https://doi.org/10.1145/3388218.3388229

Ni, R., & Cao, H. (2020). Sentiment analysis based on glove and LSTM-GRU. 2020 39th Chinese Control Conference (CCC). https://doi.org/10.23919/ccc50068.2020.9188578

Li, D., He, C., & Chen, M. (2021). Text sentiment analysis based on glove model and United network. Journal of Physics: Conference Series, 1748(3), 032046. https://doi.org/10.1088/1742- 6596/1748/3/032046

Shumaly, S., Yazdinejad, M., & Guo, Y. (2021). Persian sentiment analysis of an online store independent of pre- processing using convolutional neural network with fastText embeddings. PeerJ Computer Science, 7, e422. https://doi.org/10.7717/peerj-cs.422

Joulin, A., Grave, E., Bojanowski, P., & Mikolov, T. (2017). Bag of tricks for efficient text classification. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers. https://doi.org/10.18653/v1/e17-2068

Zhou, Z. G. (2022). Research on sentiment analysis model of short text based on deep learning. Scientific Programming, 2022, 1-7. https://doi.org/10.1155/2022/2681533

Zain, Z. M., & Alturki, N. M. (2021). COVID-19 pandemic forecasting using CNN-LSTM: A hybridapproach.

Journal of Control Science and Engineering, 2021, 1-23. https://doi.org/10.1155/2021/8785636

Alouffi, B., Alharbi, A., Sahal, R., & Saleh, H. (2021). An optimized hybrid deep learning model to detect COVID-

misleading information. Computational Intelligence and Neuroscience, 2021, 1-15. https://doi.org/10.1155/2021/9615034

Lin, H. Y., & Moh, T. (2021). Sentiment analysis on COVID tweets using COVID-Twitter-BERT with auxiliary sentence approach. Proceedings of the 2021 ACM Southeast Conference. https://doi.org/10.1145/3409334.3452074

Bai, Y., Zhang, Y., Xiao, K., Lou, Y., & Sun, K. (2021). A BERT-based approach for extracting prerequisiterelations among Wikipedia concepts. Mathematical Problems in Engineering, 2021, 1- 8. https://doi.org/10.1155/2021/3510402

Rahman, M. M., & Islam, M. N. (2021). Exploring the performance of ensemble machine learningclassifiers for sentiment analysis of COVID-19 tweets. Advances in Intelligent Systems and Computing, 383396. https://doi.org/10.1007/978-981-16-5157-1_30

Divyapushpalakshmi, M., & Ramalakshmi, R. (2021). An efficient sentimental analysis using hybrid deep learning and optimization technique for Twitter using parts of speech (POS) tagging. International Journal of Speech Technology, 24(2), 329-339. https://doi.org/10.1007/s10772-021-09801-7

Dang, C. N., Moreno-García, M. N., & De la Prieta, F. (2021). Hybrid deep learning models for sentimentanalysis.

Complexity, 2021, 1-16. https://doi.org/10.1155/2021/9986920

Kaur, H., Ahsaan, S. U., Alankar, B., & Chang, V. (2021). A proposed sentiment analysis deep learning algorithm for analyzing COVID-19 tweets. Information Systems Frontiers, 23(6), 1417-1429. https://doi.org/10.1007/s10796-021- 10135-7

Qaid, T. S., Mazaar, H., Al-Shamri, M. Y., Alqahtani, M. S., Raweh, A. A., & Alakwaa, W. (2021). Hybrid deep- learning and machine-learning models for predicting COVID-19. Computational Intelligence and Neuroscience, 2021, 1- 11. https://doi.org/10.1155/2021/9996737

Shrivastava, A., Chakkaravarthy, M., Shah, M.A..A Novel Approach Using Learning Algorithm for Parkinson’s Disease Detection with Handwritten Sketches. In Cybernetics and Systems, 2022

Shrivastava, A., Chakkaravarthy, M., Shah, M.A., A new machine learning method for predicting systolic and diastolic blood pressure using clinical characteristics. In Healthcare Analytics, 2023, 4, 100219

Shrivastava, A., Chakkaravarthy, M., Shah, M.A.,Health Monitoring based Cognitive IoT using Fast Machine Learning Technique. In International Journal of Intelligent Systems and Applications in Engineering, 2023, 11(6s), pp. 720–729

Shrivastava, A., Rajput, N., Rajesh, P., Swarnalatha, S.R., IoT-Based Label Distribution Learning Mechanism for Autism Spectrum Disorder for Healthcare Application. In Practical Artificial Intelligence for Internet of Medical Things: Emerging Trends, Issues, and Challenges, 2023, pp. 305–321

Boina, R., Ganage, D., Chincholkar, Y.D., .Chinthamu, N., Shrivastava, A., Enhancing Intelligence Diagnostic Accuracy Based on Machine Learning Disease Classification. In International Journal of Intelligent Systems and Applications in Engineering, 2023, 11(6s), pp. 765–774

Shrivastava, A., Pundir, S., Sharma, A., ...Kumar, R., Khan, A.K. Control of A Virtual System with Hand Gestures. In Proceedings - 2023 3rd International Conference on Pervasive Computing and Social Networking, ICPCSN 2023, 2023, pp. 1716–1721

Qasim, R., Bangyal, W. H., Alqarni, M. A., & Ali Almazroi, A. (2022). A fine-tuned BERT-based transfer learning approach for text classification. Journal of Healthcare Engineering, 2022, 1- 17. https://doi.org/10.1155/2022/3498123

Liao, W., Zeng, B., Yin, X., & Wei, P. (2020). An improved aspect-category sentiment analysis model for text sentiment analysis based on Roberta. Applied Intelligence, 51(6), 3522-3533. https://doi.org/10.1007/s10489- 020-01964-1

Samuel, J., Rahman, M. M., Ali, G., Esawi, E., & Samuel, Y. (2020). COVID-19 public sentiment insightsand machine learning for tweets classification. https://doi.org/10.31234/osf.io/sw2dn

Wang, H., Sun, K., & Wang, Y. (2022). Exploring the Chinese public’s perception of omicron variants on social media: LDA-based topic modeling and sentiment analysis. International Journal of Environmental Research and Public Health, 19(14), 8377. https://doi.org/10.3390/ijerph19148377

Downloads

Published

07.02.2024

How to Cite

Godi, R. K. ., Basvant, M. S. ., Deepak, A. ., Srivastava, A. P. ., Kumar T., M. ., Sankhyan, A. ., & Shrivastava, A. . (2024). Sentiment Analysis on Omicron Tweets Using Hybrid Classifiers with Multiple Feature Extraction Techniques and Transformer Based Models. International Journal of Intelligent Systems and Applications in Engineering, 12(15s), 257–275. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/4741

Issue

Section

Research Article

Most read articles by the same author(s)

1 2 3 4 5 > >>