Hybrid Feature Extraction and Deep Learning Classifier Based Effective Classification for Twitter Sentiment Analysis
Keywords:
Accuracy, hybrid feature extraction, long short-term memory, softmax regression model, twitter sentiment analysisAbstract
Twitter is a widely used social media platform that is regarded as a crucial information source for gathering opinions, attitudes, reactions, and emotions from individuals. Therefore, the Twitter Sentiment Analysis (TSA) is developed for deciding the whether the textual tweets express a positive or negative opinion. The abundance of slang phrases and poor spellings in short sentence formats make it challenging to analyze Twitter data, nevertheless. In this paper, Hybrid Feature Extraction (HFE) is proposed along with the deep learning classifier to improve the classification. The HFE is the combination of Bag of Word (BoW) and FastText Word Embedding (FTWE) techniques that are used to extract the syntactic information and semantic information-related features from the tweets. The deep learning classifier namely Long Short-Term Memory (LSTM) with Softmax Regression Model (SRM) is used to classify the tweets as positive and negative. The datasets used to analyze the proposed HFE-LSTM-SRM method are Twitter and Sentiment140 datasets. The HFE-LSTM-SRM is analyzed by means of accuracy, precision, recall, F1-measure, and average computational time. The HFE-LSTM-SRM is evaluated using current techniques like Robustly Optimized Bidirectional Encoder Representations from Transformers (ROBERT-LSTM) and Spider-Monkey-Optimizer with K-Means Algorithm (SMOK). HFE-LSTM-SRM is more accurate than ROBERT-LSTM for the Sentiment140 dataset at 98.87%.
Downloads
References
Tam, S., Said, R.B. and Tanriöver, Ö.Ö., 2021. A ConvBiLSTM deep learning model-based approach for Twitter sentiment classification. IEEE Access, 9, pp.41283-41293.
Kayıkçı, Ş., 2022. SenDemonNet: sentiment analysis for demonetization tweets using heuristic deep neural network. Multimedia Tools and Applications, 81(8), pp.11341-11378.
Qiu, Y., Song, Z. and Chen, Z., 2022. Short-term stock trends prediction based on sentiment analysis and machine learning. Soft Computing, 26(5), pp.2209-2224.
Zhang, T., Gong, X. and Chen, C.P., 2021. BMT-Net: Broad multitask transformer network for sentiment analysis. IEEE Transactions on Cybernetics.
Sazzed, S. and Jayarathna, S., 2021. Ssentia: a self-supervised sentiment analyzer for classification from unlabeled data. Machine Learning with Applications, 4, p.100026.
Mendon, S., Dutta, P., Behl, A. and Lessmann, S., 2021. A Hybrid approach of machine learning and lexicons to sentiment analysis: enhanced insights from twitter data of natural disasters. Information Systems Frontiers, 23(5), pp.1145-1168.
Basiri, M.E., Nemati, S., Abdar, M., Cambria, E. and Acharya, U.R., 2021. ABCDM: An attention-based bidirectional CNN-RNN deep model for sentiment analysis. Future Generation Computer Systems, 115, pp.279-294.
Divyapushpalakshmi, M. and Ramalakshmi, R., 2021. An efficient sentimental analysis using hybrid deep learning and optimization technique for Twitter using parts of speech (POS) tagging. International Journal of Speech Technology, 24(2), pp.329-339.
Jacob, S.S. and Vijayakumar, R., 2021. Sentimental analysis over twitter data using clustering based machine learning algorithm. Journal of Ambient Intelligence and Humanized Computing, pp.1-12.
Rehioui, H. and Idrissi, A., 2019. New clustering algorithms for twitter sentiment analysis. IEEE Systems Journal, 14(1), pp.530-537.
Alattar, F. and Shaalan, K., 2021. Using artificial intelligence to understand what causes sentiment changes on social media. IEEE Access, 9, pp.61756-61767.
Swathi, T., Kasiviswanath, N. and Rao, A.A., 2022. An optimal deep learning-based LSTM for stock price prediction using twitter sentiment analysis. Applied Intelligence, pp.1-14.
Gupta, I. and Joshi, N., 2021. Feature-Based Twitter Sentiment Analysis With Improved Negation Handling. IEEE Transactions on Computational Social Systems, 8(4), pp.917-927.
Alsayat, A., 2022. Improving Sentiment Analysis for Social Media Applications Using an Ensemble Deep Learning Language Model. Arabian Journal for Science and Engineering, 47(2), pp.2499-2511.
Bibi, M., Aziz, W., Almaraashi, M., Khan, I.H., Nadeem, M.S.A. and Habib, N., 2020. A cooperative binary-clustering framework based on majority voting for Twitter sentiment analysis. IEEE Access, 8, pp.68580-68592.
Naresh, A. and Venkata Krishna, P., 2021. An efficient approach for sentiment analysis using machine learning algorithm. Evolutionary Intelligence, 14(2), pp.725-731.
Bibi, M., Abbasi, W.A., Aziz, W., Khalil, S., Uddin, M., Iwendi, C. and Gadekallu, T.R., 2022. A novel unsupervised ensemble framework using concept-based linguistic methods and machine learning for twitter sentiment analysis. Pattern Recognition Letters, 158, pp.80-86.
Naz, H., Ahuja, S. and Kumar, D., 2021. DT-FNN based effective hybrid classification scheme for twitter sentiment analysis. Multimedia Tools and Applications, 80(8), pp.11443-11458.
Shekhawat, S.S., Shringi, S. and Sharma, H., 2021. Twitter sentiment analysis using hybrid Spider Monkey optimization method. Evolutionary Intelligence, 14(3), pp.1307-1316.
Tan, K.L., Lee, C.P., Anbananthen, K.S.M. and Lim, K.M., 2022. RoBERTa-LSTM: A Hybrid Model for Sentiment Analysis With Transformer and Recurrent Neural Network. IEEE Access, 10, pp.21517-21525.
Twitter dataset, (2014). https://drive.google.com/file/d/0BwPSGZHAP_yoN2pZcVl1Qmp1OEU/view?usp=sharing.
Sentiment140 dataset. https://www.kaggle.com/kazanova/sentiment140
Hakak, S., Alazab, M., Khan, S., Gadekallu, T.R., Maddikunta, P.K.R. and Khan, W.Z., 2021. An ensemble machine learning approach through effective feature extraction to classify fake news. Future Generation Computer Systems, 117, pp.47-58.
Hao Y, Mu T, Hong R, Wang M, Liu X, Goulermas JY. Cross-domain sentiment encoding through stochastic word embedding. IEEE Trans Knowl Data Eng. 2020;32(10):1909–22. https://doi.org/10.1109/tkde.2019.2913379.
Zhu L, Li W, Shi Y, Guo K. SentiVec: learning sentiment-context vector via kernel optimization function for sentiment analysis. IEEE Trans Neural Netw Learn Syst. 2021;32(6):2561–72. https://doi.org/10.1109/tnnls.2020.3006531.
Bie Y, Yang Y. A multitask multiview neural network for end-to-end aspect-based sentiment analysis. Big Data Mining Analytics. 2021;4(3):195–207. https://doi.org/10.26599/bdma.2021.9020003.
Munuswamy S, Saranya MS, Ganapathy S, Muthurajkumar S, Kannan A. Sentiment analysis techniques for social media-based recommendation systems. Nat Acad Sci Lett. 2021;44(3):281–7. https://doi.org/10.1007/s40009-020-01007-w.
Babu, M. C. ., & K, S. . (2023). An Intelligent Optimal Secure Framework for Malicious Events Prevention in IOT Cloud Networks. International Journal on Recent and Innovation Trends in Computing and Communication, 11(3), 117–127. https://doi.org/10.17762/ijritcc.v11i3.6328
Ali Ahmed, Machine Learning in Healthcare: Applications and Challenges , Machine Learning Applications Conference Proceedings, Vol 1 2021.
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.