Enhancement of the Lexical Approach by N-Grams Technique via Improving Negation-Based Traditional Sentiment Analysis

Authors

  • Harish Dutt Sharma Department of Computer Application, School of CA&IT, Shri Guru Ram Rai University, Dehradun 248001, Uttarakhand, India
  • Sanjay Sharma Department of Computer Application, School of CA&IT, Shri Guru Ram Rai University, Dehradun 248001, Uttarakhand, India

Keywords:

Negation handling, N-Grams techniques, Sentiment analysis, Pre-processing technique, Machine learning

Abstract

Sentiment analysis, often known as opinion mining, is a significant area in artificial intelligence today. Sentiment analysis was widely observed in this field. Currently, a lot of data is constantly being exchanged as text on social networking and e-commerce platforms like Facebook, Twitter, Amazon, etc. Therefore, sentiment analysis is the best technique for businesses to comprehend what their customers want from them so that they may adapt their plans in response to client feedback and expand their customer base. To extract the exact meaning from the text is a tough task. So here, our effort is to get the positive and negative sentiment of reviews from the dataset and enhance the performance of sentiment through Natural language processing (NLP) over pre-existing pre-processing technique and machine learning algorithms. So for this purpose, we have an Amazon product review dataset. Which is an extract from the Kaggle website. In this study, we aim to remove noise from the dataset and improve the traditional NLP preprocessing technique after that, we will use Term Frequency-Inverse Document Frequency (TF-IDF) method for feature selection and then classify the result through the classification algorithm such as Artificial Neural Network (ANN), Naïve Bayes (NB), and Support Vector Machine (SVM).

Downloads

Download data is not yet available.

References

M. A. Palomino and F. Aider, “Evaluating the Effectiveness of Text Pre-Processing in Sentiment Analysis,” Appl. Sci., vol. 12, no. 17, Sep. 2022, doi: 10.3390/app12178765.

V. Chang, L. Liu, Q. Xu, T. Li, and C. H. Hsu, “An improved model for sentiment analysis on luxury hotel review,” in Expert Systems, John Wiley and Sons Inc, Feb. 2023. doi: 10.1111/exsy.12580.

A. Iqbal, R. Amin, J. Iqbal, R. Alroobaea, A. Binmahfoudh, and M. Hussain, “Sentiment Analysis of Consumer Reviews Using Deep Learning,” Sustain., vol. 14, no. 17, Sep. 2022, doi: 10.3390/su141710844.

K. Gupta, N. Jiwani, and P. Whig, “Effectiveness of Machine Learning in Detecting Early-Stage Leukemia,” 2023, pp. 461–472. doi: 10.1007/978-981-19-2535-1_34.

R. Prabowo and M. Thelwall, “Sentiment analysis: A combined approach,” J. Informetr., vol. 3, no. 2, pp. 143–157, Apr. 2009, doi: 10.1016/j.joi.2009.01.003.

Y. Yu, W. Duan, and Q. Cao, “The impact of social and conventional media on firm equity value: A sentiment analysis approach,” Decis. Support Syst., vol. 55, no. 4, pp. 919–926, Nov. 2013, doi: 10.1016/j.dss.2012.12.028.

R. Kumar Behera, S. Kumar Rath, S. Misra, R. Damaševičius, and R. Maskeliūnas, “Distributed Centrality Analysis of Social Network Data Using MapReduce,” Algorithms, vol. 12, no. 8, p. 161, Aug. 2019, doi: 10.3390/a12080161.

S. Vohra and J. Teraiya, “Applications and Challenges for Sentiment Analysis : A Survey,” Int. J. Eng. Res. Technol., vol. 2, 2013.

A. Alsayat, “Improving Sentiment Analysis for Social Media Applications Using an Ensemble Deep Learning Language Model,” Arab. J. Sci. Eng., vol. 47, no. 2, pp. 2499–2511, Feb. 2022, doi: 10.1007/s13369-021-06227-w.

N. P. Cruz, M. Taboada, and R. Mitkov, “A machine-learning approach to negation and speculation detection for sentiment analysis,” J. Assoc. Inf. Sci. Technol., vol. 67, no. 9, pp. 2118–2136, Sep. 2016, doi: 10.1002/asi.23533.

J. Serrano-Guerrero, J. A. Olivas, F. P. Romero, and E. Herrera-Viedma, “Sentiment analysis: A review and comparative analysis of web services,” Inf. Sci. (Ny)., vol. 311, pp. 18–38, Aug. 2015, doi: 10.1016/j.ins.2015.03.040.

B. Pang and L. Lee, “Opinion Mining and Sentiment Analysis,” Found. Trends® Inf. Retr., vol. 2, no. 1–2, pp. 1–135, 2008, doi: 10.1561/1500000011.

I. Councill, R. Mcdonald, and L. Velikovich, “What’s great and what’s not: Learning to classify the scope of negation for improved sentiment analysis,” pp. 51–59, Jun. 2010.

Aashutosh Bhatt, Ankit Patel, Harsh Chheda, and Kiran Gawande, “Amazon Review Classification and Sentiment Analysis ,” Int. J. Comput. Sci. Inf. Technol., vol. 6 (6), pp. 5107–5110, 2015.

E. Haddi, X. Liu, and Y. Shi, “The Role of Text Pre-processing in Sentiment Analysis,” Procedia Comput. Sci., vol. 17, pp. 26–32, 2013, doi: 10.1016/j.procs.2013.05.005.

K. S. Srujan, S. S. Nikhil, H. Raghav Rao, K. Karthik, B. S. Harish, and H. M. Keerthi Kumar, “Classification of Amazon Book Reviews Based on Sentiment Analysis,” 2018, pp. 401–411. doi: 10.1007/978-981-10-7512-4_40.

N. Prollochs, S. Feuerriegel, and D. Neumann, “Enhancing Sentiment Analysis of Financial News by Detecting Negation Scopes,” in 2015 48th Hawaii International Conference on System Sciences, IEEE, Jan. 2015, pp. 959–968. doi: 10.1109/HICSS.2015.119.

N. Prollochs, S. Feuerriegel, and D. Neumann, “Detecting Negation Scopes for Financial News Sentiment Using Reinforcement Learning,” in 2016 49th Hawaii International Conference on System Sciences (HICSS), IEEE, Jan. 2016, pp. 1164–1173. doi: 10.1109/HICSS.2016.147.

L. Rokach, R. Romano, and O. Maimon, “Negation recognition in medical narrative reports,” Inf. Retr. Boston., vol. 11, no. 6, pp. 499–538, Dec. 2008, doi: 10.1007/s10791-008-9061-0.

B. Trstenjak, S. Mikac, and D. Donko, “KNN with TF-IDF based Framework for Text Categorization,” Procedia Eng., vol. 69, pp. 1356–1364, 2014, doi: 10.1016/j.proeng.2014.03.129.

T. U. Haque, N. N. Saber, and F. M. Shah, “Sentiment analysis on large scale Amazon product reviews,” in 2018 IEEE International Conference on Innovative Research and Development (ICIRD), IEEE, May 2018, pp. 1–6. doi: 10.1109/ICIRD.2018.8376299.

B. S. Rintyarna, R. Sarno, and C. Fatichah, “Enhancing the performance of sentiment analysis task on product reviews by handling both local and global context,” Int. J. Inf. Decis. Sci., vol. 12, no. 1, p. 75, 2020, doi: 10.1504/IJIDS.2020.104992.

A. Ritter, Mausam, O. Etzioni, and S. Clark, “Open domain event extraction from twitter,” in Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, New York, NY, USA: ACM, Aug. 2012, pp. 1104–1112. doi: 10.1145/2339530.2339704.

A. R. Razon, J. A. Barnden, and J. A. Barnden@cs, “A New Approach to Automated Text Readability Classification based on Concept Indexing with Integrated Part-of-Speech n-gram Features.”

M. Wankhade, A. C. S. Rao, and C. Kulkarni, “A survey on sentiment analysis methods, applications, and challenges,” Artif. Intell. Rev., vol. 55, no. 7, pp. 5731–5780, Oct. 2022, doi: 10.1007/s10462-022-10144-1.

A. P. Rodrigues et al., “Real-Time Twitter Spam Detection and Sentiment Analysis using Machine Learning and Deep Learning Techniques,” Comput. Intell. Neurosci., vol. 2022, 2022, doi: 10.1155/2022/5211949.

Downloads

Published

07.02.2024

How to Cite

Sharma, H. D., & Sharma, S. . (2024). Enhancement of the Lexical Approach by N-Grams Technique via Improving Negation-Based Traditional Sentiment Analysis. International Journal of Intelligent Systems and Applications in Engineering, 12(15s), 63–69. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/4715

Issue

Section

Research Article