Sentiment Analysis on Omicron Tweets Using Hybrid Classifiers with Multiple Feature Extraction Techniques and Transformer Based Models
Keywords:
Sentiment analysis, Omicron, Twitter analysis, NLP, Big Data, TextBlob, Machine learning, Deep learning, hybrid classifiers, TF-IDF, Word2Vec, GloVe, FastText, BERT, RoBERTaAbstract
Since the beginning of Covid-19, the world has been in a dilemma to cope up with its effects. With time the coronavirus has evolved into variants that caused a lot of destruction to human race. One such variant is “Omicron”. This variant made its presence in many countries throughout the world. The government is left in a straining situation to curb the spread of this variant and to stop the evolution of coronavirus. Though the strict precautions were exercised, the evolution was unstoppable. To understand the thoughts and feelings of the public, twitter can be considered as one of the best platforms for sentiment analysis. Analyzing the sentiments of people across the continents is horridly difficult but with the way technology has been making advancement in the world, analyzing has become a quiet easy job. In the existing studies on Covid-19, various word embedding techniques with machine learning and deep learning classifiers has been used for the analysis. Language based models have proven to achieve higher accuracy for sentiment analysis. Amidst these hybrid classifiers, have performed tremendously good. In the proposed work, seven Machine Learning hybrid classifiers are compared with four single classifiers using TF-IDF and Word2Vec. A proposed Deep Learning hybrid classifier is compared with two single classifiers using GloVe and FastText. Furthermore, language models like BERT and RoBERTa are employed in an effort to boost validation outcomes upto 93.39% and 93.47%.
Downloads
References
He, X., Hong, W., Pan, X., Lu, G., & Wei, X. (2021). SARS‐Cov‐2 omicron variant: Characteristics and prevention.MedComm, 2(4), 838-845. https://doi.org/10.1002/mco2.110
Hosgurmath, S., Petli, V., & Jalihal, V. (2022). An omicron variant tweeter sentiment analysis using NLP technique.Global Trasitions Proceedings, 3(1), 215-219. https://doi.org/10.1016/j.gltp.2022.03.025
Ghaderzadeh, M., Eshraghi, M. A., Asadi, F., Hosseini, A., Jafari, R., Bashash, D., & Abolghasemi, H. (2022). Efficient framework for detection of COVID-19 omicron and delta variants based on two intelligent phases of CNN models. Computational and Mathematical Methods in Medicine, 2022, 1- 10. https://doi.org/10.1155/2022/4838009
Thakur, N., & Han, C. Y. (2022). An exploratory study of tweets about the SARS-Cov-2 omicron variant: Insights from sentiment analysis, language interpretation, source tracking, type classification, and embedded URL detection. https://doi.org/10.20944/preprints202205.0238.v1
Alsafi, R. T. (2022). Lessons from SARS-Cov, MERS-Cov, and SARS-Cov-2 infections: What we know so far.Canadian Journal of Infectious Diseases and Medical Microbiology, 2022, 1- 13. https://doi.org/10.1155/2022/1156273
Ayris, D., Imtiaz, M., Horbury, K., Williams, B., Blackney, M., Hui See, C. S., & Shah, S. A. (2022). Novel deep learning approach to model and predict the spread of COVID-19. Intelligent Systems with Applications, 14, 200068. https://doi.org/10.1016/j.iswa.2022.200068
Reveilhac, M. (2022). The deployment of social media by political authorities and health experts to enhance public information during the COVID-19 pandemic. SSM - Population Health, 19, 101165. https://doi.org/10.1016/j.ssmph.2022.101165
Yaqub, U. (2021). Tweeting during the COVID-19 pandemic. Digital Government: Research and Practice,2(1), 1-7. https://doi.org/10.1145/3428090
Singh, H., Dahiya, N., Yadav, M., & Sehrawat, N. (2022). Emergence of SARS-Cov-2 new variants and their clinical significance. Canadian Journal of Infectious Diseases and Medical Microbiology, 2022, 1- 8. https://doi.org/10.1155/2022/7336309
Kaur, H., Ahsaan, S. U., Alankar, B., & Chang, V. (2021). A proposed sentiment analysis deep learning algorithm for analyzing COVID-19 tweets. Information Systems Frontiers, 23(6), 1417- 1429. https://doi.org/10.1007/s10796-021- 10135-7
Singh, M., Jakhar, A. K., & Pandey, S. (2021). Sentiment analysis on the impact of coronavirus in social life using the BERT model. Social Network Analysis and Mining, 11(1). https://doi.org/10.1007/s13278-021-00737-z
Zhu, Z. (2022). Deep learning for Chinese language sentiment extraction and analysis. Mathematical Problems in Engineering, 2022, 1-12. https://doi.org/10.1155/2022/8145445
Chen, C., Xu, B., Yang, J., & Liu, M. (2022). Sentiment analysis of animated film reviews using intelligent machine learning. Computational Intelligence and Neuroscience, 2022, 1- 8. https://doi.org/10.1155/2022/8517205
León-Sandoval, E., Zareei, M., Barbosa-Santillán, L. I., & Falcón Morales, L. E. (2022). Measuring the impact of language models in sentiment analysis for Mexico’s COVID-19 pandemic. Electronics, 11(16), 2483. https://doi.org/10.3390/electronics11162483
Choi, Y., Lee, J., & Paek, S. Y. (2022). Public awareness and sentiment toward COVID-19 vaccination in South Korea: Findings from big data analytics. International Journal of Environmental Research and Public Health, 19(16), 9914. https://doi.org/10.3390/ijerph19169914
Shahzad, A., Zafar, B., Ali, N., Jamil, U., Alghadhban, A. J., Assam, M., Ghamry, N. A., & Eldin, E. T. (2022). COVID-19 vaccines related user’s response categorization using machine learning techniques. Computation, 10(8), 141. https://doi.org/10.3390/computation10080141
Yao, Z., Yang, J., Liu, J., Keith, M., & Guan, C. (2021). Comparing tweet sentiments in megacities using machine learning techniques: In the midst of COVID-19. Cities, 116, 103273. https://doi.org/10.1016/j.cities.2021.103273
Gaye, B., Zhang, D., & Wulamu, A. (2021). A tweet sentiment classification approach using a hybridstacked ensemble technique. Information, 12(9), 374. https://doi.org/10.3390/info12090374
Naresh, A., & Venkata Krishna, P. (2020). An efficient approach for sentiment analysis using machinelearning algorithm. Evolutionary Intelligence, 14(2), 725-731. https://doi.org/10.1007/s12065-020-00429-1
Jain, P. K., Saravanan, V., & Pamula, R. (2021). A hybrid CNN-LSTM: A deep learning approach forconsumer sentiment analysis using qualitative user-generated contents. ACM Transactions on Asian and Low Resource Language Information Processing, 20(5), 1-15. https://doi.org/10.1145/3457206
Amin, S., Uddin, M. I., AlSaeed, D. H., Khan, A., & Adnan, M. (2021). Early detection of seasonal outbreaks from Twitter data using machine learning approaches. Complexity, 2021, 1-12. https://doi.org/10.1155/2021/5520366
Al-Hashedi, A., Al-Fuhaidi, B., Mohsen, A. M., Ali, Y., Gamal Al-Kaf, H. A., Al-Sorori, W., &Maqtary, N. (2022). Ensemble classifiers for Arabic sentiment analysis of social network (Twitter data) towards COVID-19-Related conspiracy theories. Applied Computational Intelligence and Soft Computing, 2022, 1- 10. https://doi.org/10.1155/2022/6614730
Shahi, T., Sitaula, C., & Paudel, N. (2022). A hybrid feature extraction method for Nepali COVID-19Related tweets classification. Computational Intelligence and Neuroscience, 2022, 1- 11. https://doi.org/10.1155/2022/5681574
Rodrigues, A. P., Fernandes, R., A, A., B, A., Shetty, A., K, A., Lakshmanna, K., & Shafi, R. M. (2022). Real- time Twitter spam detection and sentiment analysis using machine learning and deep learning techniques. Computational Intelligence and Neuroscience, 2022, 1-14. https://doi.org/10.1155/2022/5211949
A. Bandi and A. Fellah, “Socio-analyzer: A sentiment analysis using social media data,” in Proc. 28th Int. Conf. Softw. Eng. Data Eng., in EPiC Series in Computing, vol. 64, F. Harris, S. Dascalu, S. Sharma, and R. Wu, Eds.Amsterdam, The Netherlands: EasyChair, 2019, pp. 61–67
Naseem, U., Razzak, I., Khushi, M., Eklund, P. W., & Kim, J. (2021). COVIDSenti: A large-scale benchmark Twitter data set for COVID-19 sentiment analysis. IEEE Transactions on Computational Social Systems, 8(4), 1003- 1015. https://doi.org/10.1109/tcss.2021.3051189
U. Naseem, I. Razzak, and P. W. Eklund, “A survey of preprocessing techniques to improve short-text quality: A case study on hate speech detection on Twitter,” in Multimedia Tools and Applications. Springer, Nov. 2020, pp. 1–28. [Online]. Available: https://link. springer.com/article/10.1007/s11042-020-10082-6
Liu, N., & Zhao, J. (2022). A BERT-based aspect-level sentiment analysis algorithm for cross-domain text.Computational Intelligence and Neuroscience, 2022, 1-11. https://doi.org/10.1155/2022/8726621
NLTK :: Natural Language Toolkit
Ghosh, M., & Sanyal, G. (2018). Performance assessment of multiple classifiers based on ensemble feature selection scheme for sentiment analysis. Applied Computational Intelligence and Soft Computing, 2018, 1- 12. https://doi.org/10.1155/2018/8909357
Kim, S., & Gil, J. (2019). Research paper classification systems based on TF-IDF and LDA schemes. Human centric Computing and Information Sciences, 9(1). https://doi.org/10.1186/s13673-019-0192-7
Hasan, M. R., Maliha, M., & Arifuzzaman, M. (2019). Sentiment analysis with NLP on Twitter data. 2019 International Conference on Computer, Communication, Chemical, Materials and Electronic Engineering (IC4ME2). https://doi.org/10.1109/ic4me247184.2019.9036670
Al-Saqqa, S., & Awajan, A. (2019). The use of Word2vec model in sentiment analysis. Proceedings of the 2019 International Conference on Artificial Intelligence, Robotics and Control. https://doi.org/10.1145/3388218.3388229
Ni, R., & Cao, H. (2020). Sentiment analysis based on glove and LSTM-GRU. 2020 39th Chinese Control Conference (CCC). https://doi.org/10.23919/ccc50068.2020.9188578
Li, D., He, C., & Chen, M. (2021). Text sentiment analysis based on glove model and United network. Journal of Physics: Conference Series, 1748(3), 032046. https://doi.org/10.1088/1742- 6596/1748/3/032046
Shumaly, S., Yazdinejad, M., & Guo, Y. (2021). Persian sentiment analysis of an online store independent of pre- processing using convolutional neural network with fastText embeddings. PeerJ Computer Science, 7, e422. https://doi.org/10.7717/peerj-cs.422
Joulin, A., Grave, E., Bojanowski, P., & Mikolov, T. (2017). Bag of tricks for efficient text classification. Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers. https://doi.org/10.18653/v1/e17-2068
Zhou, Z. G. (2022). Research on sentiment analysis model of short text based on deep learning. Scientific Programming, 2022, 1-7. https://doi.org/10.1155/2022/2681533
Zain, Z. M., & Alturki, N. M. (2021). COVID-19 pandemic forecasting using CNN-LSTM: A hybridapproach.
Journal of Control Science and Engineering, 2021, 1-23. https://doi.org/10.1155/2021/8785636
Alouffi, B., Alharbi, A., Sahal, R., & Saleh, H. (2021). An optimized hybrid deep learning model to detect COVID-
misleading information. Computational Intelligence and Neuroscience, 2021, 1-15. https://doi.org/10.1155/2021/9615034
Lin, H. Y., & Moh, T. (2021). Sentiment analysis on COVID tweets using COVID-Twitter-BERT with auxiliary sentence approach. Proceedings of the 2021 ACM Southeast Conference. https://doi.org/10.1145/3409334.3452074
Bai, Y., Zhang, Y., Xiao, K., Lou, Y., & Sun, K. (2021). A BERT-based approach for extracting prerequisiterelations among Wikipedia concepts. Mathematical Problems in Engineering, 2021, 1- 8. https://doi.org/10.1155/2021/3510402
Rahman, M. M., & Islam, M. N. (2021). Exploring the performance of ensemble machine learningclassifiers for sentiment analysis of COVID-19 tweets. Advances in Intelligent Systems and Computing, 383396. https://doi.org/10.1007/978-981-16-5157-1_30
Divyapushpalakshmi, M., & Ramalakshmi, R. (2021). An efficient sentimental analysis using hybrid deep learning and optimization technique for Twitter using parts of speech (POS) tagging. International Journal of Speech Technology, 24(2), 329-339. https://doi.org/10.1007/s10772-021-09801-7
Dang, C. N., Moreno-García, M. N., & De la Prieta, F. (2021). Hybrid deep learning models for sentimentanalysis.
Complexity, 2021, 1-16. https://doi.org/10.1155/2021/9986920
Kaur, H., Ahsaan, S. U., Alankar, B., & Chang, V. (2021). A proposed sentiment analysis deep learning algorithm for analyzing COVID-19 tweets. Information Systems Frontiers, 23(6), 1417-1429. https://doi.org/10.1007/s10796-021- 10135-7
Qaid, T. S., Mazaar, H., Al-Shamri, M. Y., Alqahtani, M. S., Raweh, A. A., & Alakwaa, W. (2021). Hybrid deep- learning and machine-learning models for predicting COVID-19. Computational Intelligence and Neuroscience, 2021, 1- 11. https://doi.org/10.1155/2021/9996737
Shrivastava, A., Chakkaravarthy, M., Shah, M.A..A Novel Approach Using Learning Algorithm for Parkinson’s Disease Detection with Handwritten Sketches. In Cybernetics and Systems, 2022
Shrivastava, A., Chakkaravarthy, M., Shah, M.A., A new machine learning method for predicting systolic and diastolic blood pressure using clinical characteristics. In Healthcare Analytics, 2023, 4, 100219
Shrivastava, A., Chakkaravarthy, M., Shah, M.A.,Health Monitoring based Cognitive IoT using Fast Machine Learning Technique. In International Journal of Intelligent Systems and Applications in Engineering, 2023, 11(6s), pp. 720–729
Shrivastava, A., Rajput, N., Rajesh, P., Swarnalatha, S.R., IoT-Based Label Distribution Learning Mechanism for Autism Spectrum Disorder for Healthcare Application. In Practical Artificial Intelligence for Internet of Medical Things: Emerging Trends, Issues, and Challenges, 2023, pp. 305–321
Boina, R., Ganage, D., Chincholkar, Y.D., .Chinthamu, N., Shrivastava, A., Enhancing Intelligence Diagnostic Accuracy Based on Machine Learning Disease Classification. In International Journal of Intelligent Systems and Applications in Engineering, 2023, 11(6s), pp. 765–774
Shrivastava, A., Pundir, S., Sharma, A., ...Kumar, R., Khan, A.K. Control of A Virtual System with Hand Gestures. In Proceedings - 2023 3rd International Conference on Pervasive Computing and Social Networking, ICPCSN 2023, 2023, pp. 1716–1721
Qasim, R., Bangyal, W. H., Alqarni, M. A., & Ali Almazroi, A. (2022). A fine-tuned BERT-based transfer learning approach for text classification. Journal of Healthcare Engineering, 2022, 1- 17. https://doi.org/10.1155/2022/3498123
Liao, W., Zeng, B., Yin, X., & Wei, P. (2020). An improved aspect-category sentiment analysis model for text sentiment analysis based on Roberta. Applied Intelligence, 51(6), 3522-3533. https://doi.org/10.1007/s10489- 020-01964-1
Samuel, J., Rahman, M. M., Ali, G., Esawi, E., & Samuel, Y. (2020). COVID-19 public sentiment insightsand machine learning for tweets classification. https://doi.org/10.31234/osf.io/sw2dn
Wang, H., Sun, K., & Wang, Y. (2022). Exploring the Chinese public’s perception of omicron variants on social media: LDA-based topic modeling and sentiment analysis. International Journal of Environmental Research and Public Health, 19(14), 8377. https://doi.org/10.3390/ijerph19148377
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.