FRARBiLSTM-A Novel Fake Review Authentication Model Using Afinn and Roberta

Authors

  • Vikas Attri Lovely Professional University, Faculty of Computer Applications, Department of Computer Applications, Phagwara, Punjab 144001, India
  • Isha Batra Lovely Professional University, Faculty of Computer Applications, Department of Computer Applications, Phagwara, Punjab 144001, India
  • Arun Malik Lovely Professional University, Faculty of Computer Applications, Department of Computer Applications, Phagwara, Punjab 144001, India
  • Vipin Kumar Lovely Professional University, Faculty of Computer Applications, Department of Computer Applications, Phagwara, Punjab 144001, India

Keywords:

Online Social Networks (OSN), E-commerce, Machine Learning, Word Embeddings, Ensemble Learning, Afinn, RoBERTa

Abstract

Due to the rise in online transactions, fake customer review identification is attracting attention. Fake customer reviews are identified using features such as reviewer identification, product information, and review text. Recent research suggests that review semantics may be particularly pertinent for text classification. The reviewers' veiled feelings could also point to misleading information. Our neural network model combines word context, customer emotions, and the traditional bag-of-words to improve fake review detection. The algorithms use N-grams, dynamic word embeddings, and emotion indicators based on lexicons to learn document-level representation. We contrast the classification performance of the detection systems with several cutting-edge methods for fake review detection to demonstrate the value of the systems. No matter the sentiment polarity or product category, the suggested approaches on the present datasets outperform Afinn, RoBERTa, Ensemble, and hybrid models. The paper offers Hybrid/Ensemble-based strategies under the proposed model named FRARBiLSTM(Fake Reviews-AFINN RoBERTa using Bidirectional LSTM). This model performs better than previous classifiers in detecting false reviews with an accuracy of 97.31. When used with Ensemble and hybrid Learning, this model can exceed and attain superior performance compared to the most modern word embedding algorithms, particularly RoBERTa and AFINN.

Downloads

Download data is not yet available.

References

Rout, Jitendra Kumar, Amiya Kumar Dash, and Niranjan Kumar Ray. "A framework for fake review detection: issues and challenges." 2018 international conference on information technology (ICIT). IEEE, 2018.

Floyd, Kristopher, et al. "How online product reviews affect retail sales: A meta-analysis." Journal of retailing 90.2 (2014): 217-232.

BrightLocal (2018) Local consumer review survey 2018. https:// www.brightlocal.com/research/local-consumer-review- survey/. Accessed 8 Nov 2019

The Times (2018) ‘A third of TripAdvisor reviews are fake’ as cheats buy five stars. The Times September 22, 2018. https:// www.thetimes.co.uk/article/hotel-and-caf-cheats-are- caught-try ing-to-buy-tripadvisor-stars-027fbcwc8. Accessed 22 Jan 2019

Ott M, Cardie C, Hancock JT (2013) Negative deceptive opinion spam. In: 2013 conference of the North American chapter of the association for computational linguistics: human language technologies, ACL, pp 497–501

Harris C (2012) Detecting deceptive opinion spam using human computation. In: Workshops at AAAI on artificial intelligence, AAAI, pp 87–93

Atefeh, Heydari, et al. "Detection of review spam: A survey." Expert Systems with Applications 42.7 (2015): 3634-3642.

Hussain N, Turab Mirza H, Rasool G, Hussain I, Kaleem M (2019) Spam review detection techniques: a systematic literature review. Appl Sci 9(5):987. https://doi.org/10.3390/ app9050987

Crawford M, Khoshgoftaar TM, Prusa JD, Richter AN, Al Najada H (2015) Survey of review spam detection using machine learning techniques. J Big Data 2(1):1–23. https://doi.org/10. 1186/s40537-015-0029-9

Ren Y, Ji D (2017) Neural networks for deceptive opinion spam detection: an empirical study. Inf Sci 385:213–224. https:// doi. org/10.1016/j.ins.2017.01.015

Le Q, Mikolov T (2014) Distributed representations of sentences and documents. In: International conference on machine learning, JMLR, vol 32, pp 1188–1196

Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J (2013) Distributed representations of words and phrases and their compositionality. In: Advances in neural information processing systems, NIPS, vol 26, pp 3111–3119

Li L, Qin B, Ren W, Liu T (2017) Document representation and feature combination for deceptive spam review detection. Neurocomputing 254:33–41. https://doi.org/10.1016/ j.neucom.2016. 10.080

Peng Q, Zhong M (2014) Detecting spam review through senti- ment analysis. J Softw 9(8):2065–2072. https://doi.org/ 10.4304/ jsw.9.8.2065-2072

Barbado R, Araque O, Iglesias CA (2019) A framework for fake review detection in online consumer electronics retailers. Inf Process Manag 56(4):1234–1244. https://doi.org/10.1016/j. indmarman.2019.08.003

Kennedy S, Walsh N, Sloka K, McCarren A, Foster J (2019) Fact or factitious? Contextualized opinion spam detection. In: Proceedings of the 57th annual meeting of the association for computational linguistics: student research workshop, ACL, pp 344–350. https://doi.org/10.18653/v1/p19-2048

Liu Y, Pang B, Wang X (2019) Opinion spam detection by incorporating multimodal embedded representation into a probabilistic review graph. Neurocomputing 366:276–283. https://doi. org/10.1016/j.neucom.2019.08.013

Felbermayr A, Nanopoulos A (2016) The role of emotions for the perceived usefulness in online customer reviews. J Interact Mark 36:60–76. https://doi.org/10.1016/j.intmar.2016.05.004

Malik, M. S. I., and Ayyaz Hussain. "Helpfulness of product reviews as a function of discrete positive and negative emotions." Computers in Human Behavior 73 (2017): 290-302.

Barushka A, Hajek P (2016) Spam filtering using regularized neural networks with rectified linear units. In: Adorni G, Cagnoni S, Gori M, Maratea M (eds) Conference of the Italian association for artificial intelligence, vol 10037. Lecture notes in computer science. Springer, Cham, pp 65–75. https:// doi.org/10.1007/978- 3-319-49130-1_6

Barushka A, Hajek P (2018) Spam filtering using integrated distribution-based balancing approach and regularized deep neural networks. Appl Intell 48(10):3538–3556. https:// doi.org/ 10.1007/s10489-018-1161-y

Barushka A, Hajek P (2018) Spam filtering in social networks using regularized deep neural networks with ensemble learning. In: Iliadis L, Maglogiannis I, Plagianakos V (eds) Artificial intelligence applications and innovations. AIAI 2018, vol 519. IFIP advances in information and communication technology. Springer, Cham, pp 38–49. https://doi.org/ 10.1007/978-3-319- 92007-8_4

Jain G, Sharma M, Agarwal B (2018) Spam detection on social media using semantic convolutional neural network. Int J Knowl Discov Bioinform (IJKDB) 8(1):12–26. https://doi.org/ 10.4018/ IJKDB.2018010102

Jain G, Sharma M, Agarwal B (2019) Spam detection in social media using convolutional and long short term memory neural network. Ann Math Artif Intell 85(1):21–44. https:// doi.org/10. 1007/s10472-018-9612-z

Madisetty S, Desarkar MS (2018) A neural network-based ensemble approach for spam detection in Twitter. IEEE Trans Comput Soc Syst 5(4):973–984. https://doi.org/ 10.1109/TCSS. 2018.2878852

Jindal N, Liu B (2007) Analyzing and detecting review spam. In: 7th IEEE international conference on data mining, ICDM 2007, IEEE, pp 547–552. https://doi.org/10.1109/icdm.2007.68

Wang G, Li C, Wang W, Zhang Y, Shen D, Zhang X, Henao R, Carin L (2018) Joint embedding of words and labels for text classification. In: Proceedings of the 56th annual meeting of the association for computational linguistics, ACL, pp 2321– 2331.https://doi.org/10.18653/v1/p18-1216

Liu Y, Pang B, Wang X (2019) Opinion spam detection by incorporating multimodal embedded representation into a prob- abilistic review graph. Neurocomputing 366:276–283. https://doi. org/10.1016/j.neucom.2019.08.013

Ghai R, Kumar S, Pandey AC (2019) Spam detection using rating and review processing method. In: Panigrahi B, Trivedi M, Mishra K, Tiwari S, Singh P (eds) Smart innovations in com- munication and computational sciences. Springer, Singapore, pp 189–198. https://doi.org/ 10.1007/978-981-10-8971-8_18

Xue H, Wang Q, Luo B, Seo H, Li F (2019) Content-aware trust propagation toward online review spam detection. J Data Inf Qual (JDIQ) 11(3):11. https://doi.org/10.1145/3305258

Xue H, Wang Q, Luo B, Seo H, Li F (2019) Content-aware trust propagation toward online review spam detection. J Data Inf Qual (JDIQ) 11(3):11. https://doi.org/10.1145/3305258

Li F, Huang M, Yang Y, Zhu X (2011) Learning to identify review spam. In: International joint conference on artificial intelligence (IJCAI 2011), pp 2488–2493

Li H, Chen Z, Mukherjee A, Liu B, Shao J (2015) Analyzing and detecting opinion spam on a large-scale dataset via temporal and spatial patterns. In: 9th international AAAI conference on web and social media (ICWSM 2015), AAAI, pp 634–637

Mukherjee A, Venkataraman V, Liu B, Glance N (2013) What yelp fake review filter might be doing?. In: 7th international AAAI conference on weblogs and social media, AAAI, pp 409–418

Pandey AC, Rajpoot DS (2019) Spam review detection using spiral cuckoo search clustering method. Evol Intell 12(2):147–164. https://doi.org/10.1007/s12065-019-00204-x

Barbado R, Araque O, Iglesias CA (2019) A framework for fake review detection in online consumer electronics retailers. Inf Process Manag 56(4):1234–1244. https://doi.org/10.1016/j. indmarman.2019.08.003

Rout JK, Dalmia A, Choo KKR, Bakshi S, Jena SK (2017) Revisiting semi-supervised learning for online deceptive review detection. IEEE Access 5:1319–1327. https://doi.org/ 10.1109/ ACCESS.2017.2655032

Crawford M, Khoshgoftaar TM, Prusa JD, Richter AN, Al Najada H (2015) Survey of review spam detection using machine learning techniques. J Big Data 2(1):1–23. https://doi.org/10. 1186/s40537-015-0029-9

Patel NA, Patel R (2018) A survey on fake review detection using machine learning techniques. In: 2018 4th international confer- ence on computing communication and automation (ICCCA), IEEE, pp 1–6. https://doi.org/10.1109/ ccaa.2018.8777594

Vidanagama DU, Silva TP, Karunananda AS (2019) Deceptive consumer review detection: a survey. Artif Intell Rev. https:// doi. org/10.1007/s10462-019-09697-5

Barushka, Aliaksandr, and Petr Hajek. "Review spam detection using word embeddings and deep neural networks." Artificial Intelligence Applications and Innovations: 15th IFIP WG 12.5 International Conference, AIAI 2019, Hersonissos, Crete, Greece, May 24–26, 2019, Proceedings 15. Springer International Publishing, 2019.

Li L, Qin B, Ren W, Liu T (2017) Document representation and feature combination for deceptive spam review detection. Neurocomputing 254:33–41. https://doi.org/10.1016/j.neucom.2016. 10.080

Ren Y, Ji D (2017) Neural networks for deceptive opinion spam detection: an empirical study. Inf Sci 385:213–224. https:// doi. org/10.1016/j.ins.2017.01.015

Zeng ZY, Lin JJ, Chen MS, Chen MH, Lan YQ, Liu JL (2019) A review structure based ensemble model for deceptive review spam. Information 10(7):243. https://doi.org/10.3390/ info10070243

Barbado R, Araque O, Iglesias CA (2019) A framework for fake review detection in online consumer electronics retailers. Inf Process Manag 56(4):1234–1244. https://doi.org/10.1016/j. indmarman.2019.08.003

Kennedy, Stefan, et al. "Fact or factitious? Contextualized opinion spam detection." arXiv preprint arXiv:2010.15296 (2020).

Barushka, Aliaksandr, and Petr Hajek. "Review spam detection using word embeddings and deep neural networks." Artificial Intelligence Applications and Innovations: 15th IFIP WG 12.5 International Conference, AIAI 2019, Hersonissos, Crete, Greece, May 24–26, 2019, Proceedings 15. Springer International Publishing, 2019.

48.L.Li,W.Ren,B.Qin,andT.Liu,’‘Learning document representation for deceptive opinion spam detection,’’ in Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. Nanjing, China: Springer, 2015, pp. 393–404.

49.X. Wang, K. Liu, and J. Zhao, ‘‘Handling cold-start problem in review spam detection by jointly embedding texts and behaviors,’’ in Proc. 55th Annu. Meeting Assoc. Comput. Linguistics, vol. 1, 2017, pp. 366–376.

50.W. Zhang, Y. Du, T. Yoshida, and Q. Wang, ‘‘DRI-RCNN: An approach to deceptive review identification using recurrent convolutional neural network,’’ Inf. Process. Manage., vol. 54, no. 4, pp. 576–592, 2018.

Chang, Wei-Cheng, et al. "Taming pretrained transformers for extreme multi-label text classification." Proceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining. 2020.

Jwa, Heejung, et al. "exbake: Automatic fake news detection model based on bidirectional encoder representations from transformers (bert)." Applied Sciences 9.19 (2019): 4062.

Büyüköz, Berfu, Ali Hürriyetoğlu, and Arzucan Özgür. "Analyzing ELMo and DistilBERT on socio-political news classification." Proceedings of the Workshop on Automated Extraction of Socio-political Events from News 2020. 2020.

Wang, Congcong, Paul Nulty, and David Lillis. "A comparative study on word embeddings in deep learning for text classification." Proceedings of the 4th International Conference on Natural Language Processing and Information Retrieval. 2020.

Salminen, J., Kandpal, C., Kamel, A. M., Jung, S., & Jansen, B. J. (2022). Creating and detecting fake reviews of online products. Journal of Retailing and Consumer Services, 64, 102771.

Qadir Mir, Abrar, Furqan Yaqub Khan, and Mohammad Ahsan Chishti. "Online Fake Review Detection Using Supervised Machine Learning And BERT Model." arXiv e-prints (2023): arXiv-2301.

Deshai, N., and B. Bhaskara Rao. "Deep Learning Hybrid Approaches to Detect Fake Reviews and Ratings." Journal of Scientific & Industrial Research 82.1 (2022): 120-127.

Downloads

Published

07.01.2024

How to Cite

Attri, V. ., Batra, I. ., Malik, A. ., & Kumar, V. . (2024). FRARBiLSTM-A Novel Fake Review Authentication Model Using Afinn and Roberta. International Journal of Intelligent Systems and Applications in Engineering, 12(10s), 135–144. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/4355

Issue

Section

Research Article