Sentiment Analysis towards Cryptocurrency and NFT in Bahasa Indonesia for Twitter Large Amount Data Using BERT

Authors

  • Mochammad Haldi Widianto Computer Science Department, School of Computer Science, Bina Nusantara University, Bandung Campus, Jakarta, Indonesia, 11480 https://orcid.org/0000-0001-8722-9868
  • Yhudi Cornelius Computer Science Department, School of Computer Science, Bina Nusantara University, Bandung Campus, Jakarta, Indonesia, 11480

Keywords:

Bidirectional Encoder Representation from Transformers (BERT), Cryptocurrency, Non-Fungible Token (NFT), Sentiment Analysis

Abstract

Sentiment analysis is one part of Natural Language Processing (NLP), where the system can understand the form of text. Sentiment Analysis itself requires an Artificial Intelligence (AI) algorithm. One of them is the Bidirectional Encoder Representation from Transformers (BERT), which can assess positive or negative sentiment, especially when discussing "Cryptocurrency" and "Non-Fungible Token (NFT)" because in recent years. The discussion on these two topics has been widely discussed. On social media such as Twitter. In this study, a BERT model was created to assess sentiment analysis on "Cryptocurrency" and "NFT", by utilizing data crawling and pre-processing using Rapidminer (student version) with 86% accuracy and 87% precision. The results were obtained by selecting suitable hyperparameters such as learning rate 1e-7, epoch 5, and batch size 32. The results also increased from 2% accuracy to 3% precision. Further research needs to be done to improve the BERT model, not just an increase in the pre-processing part. In further research, the authors suggest combining several models or updating the pre-processing.

Downloads

Download data is not yet available.

References

Z. Shahbazi and Y. C. Byun, “Machine Learning-Based Analysis of Cryptocurrency Market Financial Risk Management,” IEEE Access, vol. 10, pp. 37848–37856, 2022, doi: 10.1109/ACCESS.2022.3162858.

Z. Shahbazi and Y. C. Byun, “Improving the cryptocurrency price prediction performance based on reinforcement learning,” IEEE Access, vol. 9, pp. 162651–162659, 2021, doi: 10.1109/ACCESS.2021.3133937.

A. Ghosh, S. Gupta, A. Dua, and N. Kumar, “Security of Cryptocurrencies in blockchain technology: State-of-art, challenges and future prospects,” Journal of Network and Computer Applications, vol. 163, p. 102635, 2020, doi: https://doi.org/10.1016/j.jnca.2020.102635.

A. Narayanan, J. Bonneau, E. Felten, A. Miller, and S. Goldfeder, Bitcoin and cryptocurrency technologies: a comprehensive introduction. Princeton University Press, 2016.

H. R. Hasan et al., “Incorporating Registration, Reputation, and Incentivization Into the NFT Ecosystem,” IEEE Access, vol. 10, pp. 76416–76433, 2022, doi: 10.1109/ACCESS.2022.3192388.

G. Y. Pratomo, “Kripto Lokal Berpotensi Dongrak Ekonomi Digital Indonesia,” Feb. 2022. https://www.liputan6.com/crypto/read/4897884/kripto-lokal-berpotensi-dongrak-ekonomi-digital-indonesia (accessed Jun. 11, 2022).

A. Dwivedi, R. P. Pant, S. Pandey, and K. Kumar, “Internet of Things’ (IoT’s) Impact on Decision Oriented Applications of Big Data Sentiment Analysis,” in 2018 3rd International Conference On Internet of Things: Smart Innovation and Usages (IoT-SIU), 2018, pp. 1–10. doi: 10.1109/IOT-SIU.2018.8519922.

Y. Zhuang and J. Kim, “A bert-based multi-criteria recommender system for hotel promotion management,” Sustainability (Switzerland), vol. 13, no. 14, Jul. 2021, doi: 10.3390/su13148039.

B. Muller, B. Sagot, D. Seddah, and B. Muller BenoˆıtBenoˆıt Sagot Djamé Seddah, “Enhancing BERT for Lexical Normalization,” in The 5th Workshop on Noisy User-generated Text (WNUT), 2019. [Online]. Available: https://gitlab.inria.fr/bemuller/bert

E. Martínez-Cámara, M. T. Martín-Valdivia, L. A. Ureña-López, and A. R. Montejo-Ráez, “Sentiment analysis in Twitter,” Nat Lang Eng, vol. 20, no. 1, pp. 1–28, Jan. 2014, doi: 10.1017/S1351324912000332.

H. M. Zin, N. Mustapha, M. A. A. Murad, and N. M. Sharef, “Term Weighting Scheme Effect in Sentiment Analysis of Online Movie Reviews,” Adv Sci Lett, vol. 24, no. 2, pp. 933–937, Mar. 2018, doi: 10.1166/asl.2018.10661.

M. Bouazizi and T. Otsuki, “A Study on Text Mining on Twitter: Identifying Opinion and Detecting Different Forms of Speech Using Writing Patterns,” Tokyo, 2019.

L. H. Son, A. Kumar, S. R. Sangwan, A. Arora, A. Nayyar, and M. Abdel-Basset, “Sarcasm detection using soft attention-based bidirectional long short-term memory model with convolution network,” IEEE Access, vol. 7, pp. 23319–23328, 2019, doi: 10.1109/ACCESS.2019.2899260.

M. H. Widianto, Y. Heryadi, W. Suparta, and A. Wibowo, “Sentiment Analysis of Cooking Oil using Bidirectional Encoder Representations from Transformers,” in 2022 5th International Conference on Information and Communications Technology (ICOIACT), 2022, pp. 110–115. doi: 10.1109/ICOIACT55506.2022.9971861.

M. v Mäntylä, D. Graziotin, and M. Kuutila, “The evolution of sentiment analysis—A review of research topics, venues, and top cited papers,” Comput Sci Rev, vol. 27, pp. 16–32, 2018, doi: https://doi.org/10.1016/j.cosrev.2017.10.002.

Z. Li, R. Li, and G. Jin, “Sentiment Analysis of Danmaku Videos Based on Naïve Bayes and Sentiment Dictionary,” IEEE Access, vol. 8, pp. 75073–75084, 2020, doi: 10.1109/ACCESS.2020.2986582.

K. Bloom, N. Garg, and S. Argamon, “Extracting Appraisal Expressions,” in HLT/NAACL, Aug. 2007, pp. 308–315.

E. Cambria, “Affective Computing and Sentiment Analysis,” IEEE Intell Syst, vol. 31, no. 2, pp. 102–107, Mar. 2016, doi: 10.1109/MIS.2016.31.

S. M. Liu and J.-H. Chen, “A multi-label classification based approach for sentiment classification,” Expert Syst Appl, vol. 42, no. 3, pp. 1083–1093, 2015, doi: https://doi.org/10.1016/j.eswa.2014.08.036.

S. Sun, C. Luo, and J. Chen, “A review of natural language processing techniques for opinion mining systems,” Information Fusion, vol. 36, pp. 10–25, 2017, doi: https://doi.org/10.1016/j.inffus.2016.10.004.

M. Wankhade, A. C. S. Rao, and C. Kulkarni, “A survey on sentiment analysis methods, applications, and challenges,” Artif Intell Rev, 2022, doi: 10.1007/s10462-022-10144-1.

A. A. Lutfi, A. E. Permanasari, and S. Fauziati, “Sentiment Analysis in the Sales Review of Indonesian Marketplace by Utilizing Support Vector Machine,” Journal of Information Systems Engineering and Business Intelligence, vol. 4, no. 1, pp. 57–64, Apr. 2018, doi: 10.20473/jisebi.4.1.57-64.

A. Kumar and A. Jaiswal, “Systematic literature review of sentiment analysis on Twitter using soft computing techniques,” Concurr Comput, vol. 32, no. 1, p. e5107, 2020, doi: https://doi.org/10.1002/cpe.5107.

E. Cambria, S. Poria, A. Gelbukh, and M. Thelwall, “Sentiment Analysis Is a Big Suitcase,” IEEE Intell Syst, vol. 32, no. 6, pp. 74–80, Nov. 2017, doi: 10.1109/MIS.2017.4531228.

A. Ligthart, C. Catal, and B. Tekinerdogan, “Systematic reviews in sentiment analysis: a tertiary study,” Artif Intell Rev, vol. 54, no. 7, pp. 4997–5053, 2021, doi: 10.1007/s10462-021-09973-3.

J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding,” in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019, pp. 4171–4186. doi: 10.18653/v1/N19-1423.

A. Vaswani et al., “Attention is All you Need,” in Advances in Neural Information Processing Systems, 2017, vol. 30. [Online]. Available: https://proceedings.neurips.cc/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf

L. Yang, Y. Li, J. Wang, and R. S. Sherratt, “Sentiment Analysis for E-Commerce Product Reviews in Chinese Based on Sentiment Lexicon and Deep Learning,” IEEE Access, vol. 8, pp. 23522–23530, 2020, doi: 10.1109/ACCESS.2020.2969854.

A. P. Widyassari, A. Affandy, E. Noersasongko, A. Z. Fanani, A. Syukur, and R. S. Basuki, “Literature Review of Automatic Text Summarization: Research Trend, Dataset and Method,” in 2019 International Conference on Information and Communications Technology (ICOIACT), 2019, pp. 491–496. doi: 10.1109/ICOIACT46704.2019.8938454.

A. Vaswani et al., “Attention Is All You Need,” 31st Conference on Neural Information Processing Systems, pp. 1–11, 2017, doi: 10.1109/2943.974352.

B. Siswanto, “Sentiment Analysis in Indonesian on Jakarta Culinary as A Recommender System,” in 2021 4th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI), Dec. 2021, pp. 46–50. doi: 10.1109/ISRITI54043.2021.9702772.

B. Siswanto, F. L. Gaol, B. Soewito, and H. L. H. S. Warnars, “Sentiment Analysis of Big Cities on The Island of Java in Indonesia from Twitter Data as A Recommender System,” in 2021 International Conference on Informatics, Multimedia, Cyber and Information System (ICIMCIS, Oct. 2021, pp. 124–128. doi: 10.1109/ICIMCIS53775.2021.9699147.

Y. Lu, “Deep neural networks and fraud detection,” 2017.

A. S. A. Sukor, A. Zakaria, N. A. Rahim, L. M. Kamarudin, and H. Nishizaki, “Abnormality detection approach using deep learning models in smart home environments,” in ACM International Conference Proceeding Series, Apr. 2019, pp. 22–27. doi: 10.1145/3330180.3330185.

W. Hastomo, A. S. B. Karno, N. Kalbuana, A. Meiriki, and Sutarno, “Characteristic Parameters of Epoch Deep Learning to Predict Covid-19 Data in Indonesia,” J Phys Conf Ser, vol. 1933, no. 1, p. 12050, Jun. 2021, doi: 10.1088/1742-6596/1933/1/012050.

I. Kandel and M. Castelli, “The effect of batch size on the generalizability of the convolutional neural networks on a histopathology dataset,” ICT Express, vol. 6, no. 4, pp. 312–315, 2020, doi: 10.1016/j.icte.2020.04.010.

K. M. Ting, “Confusion Matrix,” in Encyclopedia of Machine Learning and Data Mining, G. I. Sammut Claude and Webb, Ed. Boston, MA: Springer US, 2017, p. 260. doi: 10.1007/978-1-4899-7687-1_50.

M. Kang, K. H. Lee, and Y. Lee, “Filtered bert: Similarity filter-based augmentation with bidirectional transfer learning for protected health information prediction in clinical documents,” Applied Sciences, vol. 11, no. 8, 2021, doi: 10.3390/app11083668.

Pre-Training and Fine-Tuning BERT model [26].

Downloads

Published

03.02.2023

How to Cite

Haldi Widianto , M. ., & Cornelius , Y. . (2023). Sentiment Analysis towards Cryptocurrency and NFT in Bahasa Indonesia for Twitter Large Amount Data Using BERT. International Journal of Intelligent Systems and Applications in Engineering, 11(1), 303–309. Retrieved from https://ijisae.org/index.php/IJISAE/article/view/2539

Issue

Section

Research Article