Advancement in Bangla Sentiment Analysis: A Comparative Study of Transformer-Based and Transfer Learning Models for E-commerce Sentiment Classification
Downloads
Background: As a direct result of the Internet's expansion, the quantity of information shared by Internet users across its numerous platforms has increased. Sentiment analysis functions at a higher level when there are more available perspectives and opinions. However, the lack of labeled data significantly complicates sentiment analysis utilizing Bangla natural language processing (NLP). In recent years, nevertheless, due to the development of more effective deep learning models, Bangla sentiment analysis has improved significantly.
Objective: This article presents a curated dataset for Bangla e-commerce sentiment analysis obtained solely from the "Daraz" platform. We aim to conduct sentiment analysis in Bangla for binary and understudied multiclass classification tasks.
Methods: Transfer learning (LSTM, GRU) and Transformers (Bangla-BERT) approaches are compared for their effectiveness on our dataset. To enhance the overall performance of the models, we fine-tuned them.
Results: The accuracy of Bangla-BERT was highest for both binary and multiclass sentiment classification tasks, with 94.5% accuracy for binary classification and 88.78% accuracy for multiclass sentiment classification.
Conclusion: Our proposed method performs noticeably better classifying multiclass sentiments in Bangla than previous deep learning techniques.
Keywords: Bangla-BERT, Deep Learning, E-commerce, NLP, Sentiment Analysis
W. Medhat, A. Hassan, and H. Korashy, "Sentiment analysis algorithms and applications: A survey," Ain Shams engineering journal, vol. 5, no. 4, pp. 1093-1113, 2014.
C. O. Alm, D. Roth, and R. Sproat, "Emotions from text: machine learning for text-based emotion prediction," in Proceedings of human language technology conference and conference on empirical methods in natural language processing, pp. 579-586, 2005.
A. Adak, B. Pradhan, and N. Shukla, "Sentiment analysis of customer reviews of food delivery services using deep learning and explainable artificial intelligence: Systematic review," Foods, vol. 11, no. 10, 1500, 2022.
A. Iqbal, R. Amin, J. Iqbal, R. Alroobaea, A. Binmahfoudh, and M. Hussain, "Sentiment Analysis of Consumer Reviews Using Deep Learning," Sustainability, vol. 14, no. 17, 10844, 2022.
S. Zulfiker, A. Chowdhury, D. Roy, S. Datta, and S. Momen, "Bangla E-Commerce Sentiment Analysis Using Machine Learning Approach," in 4th International Conference on Sustainable Technologies for Industry 4.0 (STI), pp. 1-5, 2022.
M.J. Hossain, D.D. Joy, S. Das, and R. Mustafa, "Sentiment Analysis on Reviews of E-commerce Sites Using Machine Learning Algorithms," in International Conference on Innovations in Science, Engineering and Technology (ICISET), pp. 522-527, 2022.
K.A. Hasan, S. Islam, G. M.E. Elahi, and M.N. Izhar, "Sentiment recognition from Bangla text," in Technical Challenges and Design Issues in Bangla Language Processing, pp. 315-327, 2013.
O. Sen et al., "Bangla Natural Language Processing: A comprehensive analysis of classical, machine learning, and deep learning based methods," IEEE Access, vol. 10, pp. 38999-39044, 2022.
N.R. Bhowmik, M. Arifuzzaman, and M.R.H. Mondal, "Sentiment analysis on Bangla text using extended lexicon dictionary and deep learning algorithms," Array, vol. 13, 100123, 2022.
M.R. Khan, S.N. Rahmatullah, M.F. Islam, A.R.M. Kamal, and M.A. Hossain, "Sentiment analysis of COVID-19 vaccination in Bangla language with code-mixed text from social media," in 12th International Conference on Electrical and Computer Engineering (ICECE), pp. 76-79, 2022.
M.H. Alam, M.M. Rahoman, and M.A.K. Azad, "Sentiment analysis for Bangla sentences using convolutional neural network," in 20th International Conference of Computer and Information Technology (ICCIT), pp. 1-6, 2017.
A. Hassan, M.R. Amin, A.K. Al Azad, and N. Mohammed, "Sentiment analysis on bangla and Romanized Bangla text using deep recurrent models," in International Workshop on Computational Intelligence (IWCI), pp. 51-56, 2016.
E. Hossain, O. Sharif, M.M. Hoque, and I.H. Sarker, "Sentilstm: a deep learning approach for sentiment analysis of restaurant reviews," in International Conference on Hybrid Intelligent Systems, pp. 193-203, 2020.
M.I.H. Junaid, F. Hossain, U.S. Upal, A. Tameem, A. Kashim, and A. Fahmin, "Bangla Food Review Sentimental Analysis using Machine Learning," in IEEE 12th Annual Computing and Communication Workshop and Conference (CCWC), pp. 0347-0353, 2022.
A. Ahmed and M.A. Yousuf, "Sentiment analysis on Bangla text using long short-term memory (LSTM) recurrent neural network," in Proceedings of International Conference on Trends in Computational and Cognitive Engineering: Proceedings of TCCE 2020, pp. 181-192, 2020.
M.F. Wahid, M.J. Hasan, and M.S. Alom, "Cricket sentiment analysis from Bangla text using recurrent neural network with long short term memory model," in International Conference on Bangla Speech and Language Processing (ICBSLP), pp. 1-4, 2019.
N.J. Prottasha et al., "Transfer learning for sentiment analysis using BERT based supervised fine-tuning," Sensors, vol. 22, no. 11, 4157, 2022.
M. Kowsher, A.A. Sami, N.J. Prottasha, M.S. Arefin, P.K. Dhar, and T. Koshiba, "Bangla-BERT: transformer-based efficient model for transfer learning and language understanding," IEEE Access, vol. 10, pp. 91855-91870, 2022.
T. Alam, A. Khan, and F. Alam, "Bangla text classification using transformers," arXiv preprint arXiv:.04446, 2020.
K.I. Islam, M.S. Islam, and M.R. Amin, "Sentiment analysis in Bengali via transfer learning using multilingual BERT," in 23rd International Conference on Computer and Information Technology (ICCIT), pp. 1-5, 2020.
T. Mikolov, K. Chen, G. Corrado, and J. Dean, "Efficient estimation of word representations in vector space," arXiv preprint arXiv:. 2013.
M. Al-Amin, M.S. Islam, and S.D. Uzzal, "Sentiment analysis of Bengali comments with Word2Vec and sentiment information of words," in International Conference on Electrical, Computer and Communication Engineering (ECCE), pp. 186-190, 2017.
Y. Santur, "Sentiment analysis based on gated recurrent unit," in International Artificial Intelligence and Data Processing Symposium (IDAP), pp. 1-5, 2019.
K. Cho et al., "Learning phrase representations using RNN encoder-decoder for statistical machine translation," arXiv preprint arXiv:. 2014.
G. Murthy, S. R. Allu, B. Andhavarapu, M. Bagadi, and M. Belusonti, "Text based sentiment analysis using LSTM," Int. J. Eng. Res. Tech. Res, vol. 9, no. 5, pp. 299-303, 2020.
Z. Jin, Y. Yang, and Y. Liu, "Stock closing price prediction based on sentiment analysis and LSTM," Neural Computing and Applications, vol. 32, pp. 9713-9729, 2020.
R. Rahman, S. A. Hasan, and F. A. Rubel, "Identifying Sentiment and Recognizing Emotion from Social Media Data in Bangla Language," in 12th International Conference on Electrical and Computer Engineering (ICECE), pp. 36-39, 2022.
M.M. Abdelgwad, T.H.A. Soliman, A.I. Taloba, and M.F. Farghaly, "Arabic aspect based sentiment analysis using bidirectional GRU based models," Journal of King Saud University-Computer and Information Sciences, vol. 34, no. 9, pp. 6652-6662, 2022.
A.A. Sharfuddin, M. N. Tihami, and M. S. Islam, "A deep recurrent neural network with bilstm model for sentiment classification," in International conference on Bangla speech and language processing (ICBSLP), pp. 1-4, 2018.
A. Bhattacharjee et al., "BanglaBERT: Language model pre-training and benchmarks for low-resource language understanding evaluation in Bangla," arXiv preprint arXiv:.00204, 2021.
J.D. M.W.C. Kenton and L.K. Toutanova, "BERT: Pre-training of deep bidirectional transformers for language understanding," in Proceedings of naacL-HLT, 2019.
A. Zhao and Y. Yu, "Knowledge-enabled BERT for aspect-based sentiment analysis," Knowledge-Based Systems, vol. 227, 107220, 2021.
G.I. Diaz, A. Fokoue-Nkoutche, G. Nannicini, and H. Samulowitz, "An effective algorithm for hyperparameter optimization of neural networks," IBM Journal of Research and Development, vol. 61, no. 4/5, pp. 1-9, 2017.
R. Ahuja, A. Chug, S. Kohli, S. Gupta, and P. Ahuja, "The impact of features extraction on the sentiment analysis," Procedia Computer Science, vol. 152, pp. 341-348, 2019.
E.A.E. Lucky, M.M.H. Sany, M. Keya, S.A. Khushbu, and S.R.H. Noori, "An attention on sentiment analysis of child abusive public comments towards bangla text and ml," in 12th international conference on computing communication and networking technologies (ICCCNT), pp. 1-6, 2021.
M. Rahman, M.R.A. Talukder, L.A. Setu, and A.K. Das, "A dynamic strategy for classifying sentiment from Bengali text by utilizing word2vector model," Journal of Information Technology Research, vol. 15, no. 1, pp. 1-17, 2022.
Copyright (c) 2023 The Authors. Published by Universitas Airlangga.

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
All accepted papers will be published under a Creative Commons Attribution 4.0 International (CC BY 4.0) License. Authors retain copyright and grant the journal right of first publication. CC-BY Licenced means lets others to Share (copy and redistribute the material in any medium or format) and Adapt (remix, transform, and build upon the material for any purpose, even commercially).