Hybrid Feature Extraction and Deep Learning Classifier Based Effective Classification for Twitter Sentiment Analysis

Authors

  • Usha G. R. Department of ISE, Sri Dharmasthala Manjunatheshwara Institute of Technology, Ujire-574240 Affiliated to Visvesvaraya Technological University, Belagavi, Karnataka, INDIA
  • J. V. Gorabal Department of CSE, ATME College of Engineering, Mysuru 570028, INDIA

Keywords:

Accuracy, hybrid feature extraction, long short-term memory, softmax regression model, twitter sentiment analysis

Abstract

Twitter is a widely used social media platform that is regarded as a crucial information source for gathering opinions, attitudes, reactions, and emotions from individuals. Therefore, the Twitter Sentiment Analysis (TSA) is developed for deciding the whether the textual tweets express a positive or negative opinion. The abundance of slang phrases and poor spellings in short sentence formats make it challenging to analyze Twitter data, nevertheless. In this paper, Hybrid Feature Extraction (HFE) is proposed along with the deep learning classifier to improve the classification. The HFE is the combination of Bag of Word (BoW) and FastText Word Embedding (FTWE) techniques that are used to extract the syntactic information and semantic information-related features from the tweets. The deep learning classifier namely Long Short-Term Memory (LSTM) with Softmax Regression Model (SRM) is used to classify the tweets as positive and negative. The datasets used to analyze the proposed HFE-LSTM-SRM method are Twitter and Sentiment140 datasets. The HFE-LSTM-SRM is analyzed by means of accuracy, precision, recall, F1-measure, and average computational time. The HFE-LSTM-SRM is evaluated using current techniques like Robustly Optimized Bidirectional Encoder Representations from Transformers (ROBERT-LSTM) and Spider-Monkey-Optimizer with K-Means Algorithm (SMOK). HFE-LSTM-SRM is more accurate than ROBERT-LSTM for the Sentiment140 dataset at 98.87%.

Downloads

Download data is not yet available.

References

Tam, S., Said, R.B. and Tanriöver, Ö.Ö., 2021. A ConvBiLSTM deep learning model-based approach for Twitter sentiment classification. IEEE Access, 9, pp.41283-41293.

Kayıkçı, Ş., 2022. SenDemonNet: sentiment analysis for demonetization tweets using heuristic deep neural network. Multimedia Tools and Applications, 81(8), pp.11341-11378.

Qiu, Y., Song, Z. and Chen, Z., 2022. Short-term stock trends prediction based on sentiment analysis and machine learning. Soft Computing, 26(5), pp.2209-2224.

Zhang, T., Gong, X. and Chen, C.P., 2021. BMT-Net: Broad multitask transformer network for sentiment analysis. IEEE Transactions on Cybernetics.

Sazzed, S. and Jayarathna, S., 2021. Ssentia: a self-supervised sentiment analyzer for classification from unlabeled data. Machine Learning with Applications, 4, p.100026.

Mendon, S., Dutta, P., Behl, A. and Lessmann, S., 2021. A Hybrid approach of machine learning and lexicons to sentiment analysis: enhanced insights from twitter data of natural disasters. Information Systems Frontiers, 23(5), pp.1145-1168.

Basiri, M.E., Nemati, S., Abdar, M., Cambria, E. and Acharya, U.R., 2021. ABCDM: An attention-based bidirectional CNN-RNN deep model for sentiment analysis. Future Generation Computer Systems, 115, pp.279-294.

Divyapushpalakshmi, M. and Ramalakshmi, R., 2021. An efficient sentimental analysis using hybrid deep learning and optimization technique for Twitter using parts of speech (POS) tagging. International Journal of Speech Technology, 24(2), pp.329-339.

Jacob, S.S. and Vijayakumar, R., 2021. Sentimental analysis over twitter data using clustering based machine learning algorithm. Journal of Ambient Intelligence and Humanized Computing, pp.1-12.

Rehioui, H. and Idrissi, A., 2019. New clustering algorithms for twitter sentiment analysis. IEEE Systems Journal, 14(1), pp.530-537.

Alattar, F. and Shaalan, K., 2021. Using artificial intelligence to understand what causes sentiment changes on social media. IEEE Access, 9, pp.61756-61767.

Swathi, T., Kasiviswanath, N. and Rao, A.A., 2022. An optimal deep learning-based LSTM for stock price prediction using twitter sentiment analysis. Applied Intelligence, pp.1-14.

Gupta, I. and Joshi, N., 2021. Feature-Based Twitter Sentiment Analysis With Improved Negation Handling. IEEE Transactions on Computational Social Systems, 8(4), pp.917-927.

Alsayat, A., 2022. Improving Sentiment Analysis for Social Media Applications Using an Ensemble Deep Learning Language Model. Arabian Journal for Science and Engineering, 47(2), pp.2499-2511.

Bibi, M., Aziz, W., Almaraashi, M., Khan, I.H., Nadeem, M.S.A. and Habib, N., 2020. A cooperative binary-clustering framework based on majority voting for Twitter sentiment analysis. IEEE Access, 8, pp.68580-68592.

Naresh, A. and Venkata Krishna, P., 2021. An efficient approach for sentiment analysis using machine learning algorithm. Evolutionary Intelligence, 14(2), pp.725-731.

Bibi, M., Abbasi, W.A., Aziz, W., Khalil, S., Uddin, M., Iwendi, C. and Gadekallu, T.R., 2022. A novel unsupervised ensemble framework using concept-based linguistic methods and machine learning for twitter sentiment analysis. Pattern Recognition Letters, 158, pp.80-86.

Naz, H., Ahuja, S. and Kumar, D., 2021. DT-FNN based effective hybrid classification scheme for twitter sentiment analysis. Multimedia Tools and Applications, 80(8), pp.11443-11458.

Shekhawat, S.S., Shringi, S. and Sharma, H., 2021. Twitter sentiment analysis using hybrid Spider Monkey optimization method. Evolutionary Intelligence, 14(3), pp.1307-1316.

Tan, K.L., Lee, C.P., Anbananthen, K.S.M. and Lim, K.M., 2022. RoBERTa-LSTM: A Hybrid Model for Sentiment Analysis With Transformer and Recurrent Neural Network. IEEE Access, 10, pp.21517-21525.

Twitter dataset, (2014). https://drive.google.com/file/d/0BwPSGZHAP_yoN2pZcVl1Qmp1OEU/view?usp=sharing.

Sentiment140 dataset. https://www.kaggle.com/kazanova/sentiment140

Hakak, S., Alazab, M., Khan, S., Gadekallu, T.R., Maddikunta, P.K.R. and Khan, W.Z., 2021. An ensemble machine learning approach through effective feature extraction to classify fake news. Future Generation Computer Systems, 117, pp.47-58.

Hao Y, Mu T, Hong R, Wang M, Liu X, Goulermas JY. Cross-domain sentiment encoding through stochastic word embedding. IEEE Trans Knowl Data Eng. 2020;32(10):1909–22. https://doi.org/10.1109/tkde.2019.2913379.

Zhu L, Li W, Shi Y, Guo K. SentiVec: learning sentiment-context vector via kernel optimization function for sentiment analysis. IEEE Trans Neural Netw Learn Syst. 2021;32(6):2561–72. https://doi.org/10.1109/tnnls.2020.3006531.

Bie Y, Yang Y. A multitask multiview neural network for end-to-end aspect-based sentiment analysis. Big Data Mining Analytics. 2021;4(3):195–207. https://doi.org/10.26599/bdma.2021.9020003.

Munuswamy S, Saranya MS, Ganapathy S, Muthurajkumar S, Kannan A. Sentiment analysis techniques for social media-based recommendation systems. Nat Acad Sci Lett. 2021;44(3):281–7. https://doi.org/10.1007/s40009-020-01007-w.

Babu, M. C. ., & K, S. . (2023). An Intelligent Optimal Secure Framework for Malicious Events Prevention in IOT Cloud Networks. International Journal on Recent and Innovation Trends in Computing and Communication, 11(3), 117–127. https://doi.org/10.17762/ijritcc.v11i3.6328

Ali Ahmed, Machine Learning in Healthcare: Applications and Challenges , Machine Learning Applications Conference Proceedings, Vol 1 2021.

Downloads

Published

16.07.2023

How to Cite

G. R., U. ., & Gorabal, J. V. . (2023). Hybrid Feature Extraction and Deep Learning Classifier Based Effective Classification for Twitter Sentiment Analysis. International Journal of Intelligent Systems and Applications in Engineering, 11(3), 142–149. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/3151

Issue

Section

Research Article