Improving Opinion Mining Accuracy with Dragonfly Algorithm-Based Hybrid Classification
Keywords:
Machine Learning, Dragonfly, Hybrid, TF-IDF, Datasets.Abstract
The paper proposes the Dragonfly + Hybrid Classifier, a novel approach designed to enhance opinion mining across diverse datasets. Leveraging the Dragonfly algorithm for feature set selection and combining it with a hybrid classification method, this innovative approach offers the potential for more accurate and reliable predictions. On the Twitter Sentiment dataset, notorious for its dynamic and noisy nature, the Dragonfly + Hybrid Classifier excels with an average precision of approximately 0.93498, recall of approximately 0.92965, and an F-measure of approximately 0.93208, alongside an average accuracy of around 96.134%. Within the Movie Review dataset, where opinions are nuanced and context-dependent, the Dragonfly + Hybrid Classifier secures an impressive average precision of approximately 0.91348, coupled with an average recall of approximately 0.9189, achieving an F-measure of approximately 0.91582 and maintaining an average accuracy of around 94.98%. In the context of the Depression dataset, where sensitivity and accuracy are paramount, the Dragonfly + Hybrid Classifier excels with an average precision of approximately 0.9627, an average recall of approximately 0.966, an F-measure of approximately 0.9643, and an average accuracy of around 94.62%. These findings collectively affirm the Dragonfly + Hybrid Classifier as a potent tool for opinion analysis across diverse domains, positioning it as a valuable asset in field of opinion mining and analysis applications, particularly in domains where opinion understanding is paramount.
Downloads
References
S. L. Marie-Sainte and N. Alalyani, “Firefly algorithm based feature selection for Arabic text classification,” Journal of King Saud University-Computer and Information Sciences, vol. 32, no. 3, pp. 320–328, 2020.
A. Ahad, B. Madhuravani, S. Kosuru, S. M. F. U. Haque, and M. A. Hussain, “An Intelligent Data Mining System for Tweet Opinion Analysis using Combined Cluster based Classification Approach,” International Journal of Intelligent Systems and Applications in Engineering, vol. 10, no. 4, pp. 604–609, Dec. 2022.
A. H. Abdulhafiz, “Novel Opinion mining System for Movie Reviews,” International Journal of Intelligent Systems and Applications in Engineering, vol. 8, no. 2, pp. 94–101, Jun. 2020, doi: 10.18201/ijisae.2020261590.
N. K. Laskari and S. K. Sanampudi, “Suggestion Key Phrase Extraction: A Fine-Grained Suggestion Mining from Opinion Reviews,” International Journal of Intelligent Systems and Applications in Engineering, vol. 11, no. 4, pp. 164–171, Sep. 2023.
P. Durga and D. Godavarthi, “A Review on Evaluation of Different Models for Classifying Sentiments from Twitter: Challenges and Applications,” International Journal of Intelligent Systems and Applications in Engineering, vol. 12, no. 1, pp. 235–266.
D. Elangovan and V. Subedha, “An Effective Feature Selection Based Classification model using Firefly with Levy and Multilayer Perceptron based Sentiment Analysis,” in 2020 International Conference on Inventive Computation Technologies (ICICT), 2020, pp. 376–380.
R. K. Patra, B. Patil, T. S. Kumar, G. Shivakanth, and B. M. Manjula, “Machine Learning Based Sentiment Analysis and Swarm Intelligence,” in 2023 IEEE International Conference on Integrated Circuits and Communication Systems (ICICACS), 2023, pp. 1–8.
S. S. Alotaibi, E. Alabdulkreem, S. Althahabi, M. A. Hamza, M. Rizwanullah, A. S. Zamani, A. Motwakel, and R. Marzouk, “Artificial fish swarm optimization with deep learning enabled opinion mining approach,” Computer Systems Science and Engineering, vol. 45, no. 1, pp. 737–751, 2023.
H. Chantar, M. Mafarja, H. Alsawalqah, A. A. Heidari, I. Aljarah, and H. Faris, “Feature selection using binary grey wolf optimizer with elite-based crossover for Arabic text classification,” Neural Computing and Applications, vol. 32, pp. 12201–12220, 2020.
D. Deepika and N. Balaji, “Effective heart disease prediction with Grey-wolf with Firefly algorithm-differential evolution (GF-DE) for feature selection and weighted ANN classification,” Computer Methods in Biomechanics and Biomedical Engineering, vol. 25, no. 12, pp. 1409–1427, 2022.
X. Yong and Y. L. Gao, “Improved firefly algorithm for feature selection with the ReliefF-based initialization and the weighted voting mechanism,” Neural Computing and Applications, vol. 35, no. 1, pp. 275–301, 2023.
T. Bezdan, M. Zivkovic, N. Bacanin, A. Chhabra, and M. Suresh, “Feature selection by hybrid brain storm optimization algorithm for covid-19 classification,” Journal of Computational Biology, vol. 29, no. 6, pp. 515–529, 2022.
P. Mudgil, P. Gupta, I. Mathur, and N. Joshi, “An improved context-aware analysis for sentimental Grass Hopper Optimization algorithm and its post affects on Twitter,” Journal of Intelligent & Fuzzy Systems, 2023, doi: 10.3233/JIFS-219430.
R. Asgarnezhad, S. A. Monadjemi, and M. Soltanaghaei, “An application of MOGW optimization for feature selection in text classification,” The Journal of Supercomputing, vol. 77, pp. 5806–5839, 2021.
A. Alarifi, A. Tolba, Z. Al-Makhadmeh, and W. Said, “A big data approach to sentiment analysis using greedy feature selection with cat swarm optimization-based long short-term memory neural networks,” The Journal of Supercomputing, vol. 76, pp. 4414–4429, 2020.
M. Tubishat, M. A. Abushariah, N. Idris, and I. Aljarah, “Improved whale optimization algorithm for feature selection in Arabic sentiment analysis,” Applied Intelligence, vol. 49, pp. 1688–1707, 2019.
“Sentiment 140 dataset.” available online at https://www.kaggle.com/datasets/kazanova/sentiment140 (accessed jan, 2024).
“NLTK Movie Reviews Dataset.” available onlne at https://www.kaggle.com/datasets/nltkdata/movie-review (accessed jan, 2024).
“Depression Sentimental Analysis for Tweets Dataset.” available at https://www.kaggle.com/datasets/gargmanas/sentimental-analysis-for-tweets (accessed jan, 2024).
A. Madasu and S. Elango, “Efficient feature selection techniques for sentiment analysis,” Multimedia Tools and Applications 2019 79:9, vol. 79, no. 9, pp. 6313–6335, Dec. 2019, doi: 10.1007/S11042-019-08409-Z.
M. K. Vijaymeena and K. Kavitha, “A survey on similarity measures in text mining,” Machine Learning and Applications: An International Journal (MLAIJ), vol. 3, no. 1, 2016, doi: 10.5121/mlaij.2016.3103.
A. Gupta, P. Tyagi, T. Choudhury, and M. Shamoon, “Sentiment Analysis Using Support Vector Machine,” Proceedings of the 4th International Conference on Contemporary Computing and Informatics, IC3I 2019, pp. 49–53, Dec. 2019, doi: 10.1109/IC3I46837.2019.9055645.
Downloads
Published
How to Cite
Issue
Section
License
![Creative Commons License](http://i.creativecommons.org/l/by-sa/4.0/88x31.png)
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
All papers should be submitted electronically. All submitted manuscripts must be original work that is not under submission at another journal or under consideration for publication in another form, such as a monograph or chapter of a book. Authors of submitted papers are obligated not to submit their paper for publication elsewhere until an editorial decision is rendered on their submission. Further, authors of accepted papers are prohibited from publishing the results in other publications that appear before the paper is published in the Journal unless they receive approval for doing so from the Editor-In-Chief.
IJISAE open access articles are licensed under a Creative Commons Attribution-ShareAlike 4.0 International License. This license lets the audience to give appropriate credit, provide a link to the license, and indicate if changes were made and if they remix, transform, or build upon the material, they must distribute contributions under the same license as the original.