Vader Lexicon and Support Vector Machine Algorithm to Detect Customer Sentiment Orientation
Downloads
Background: The concept of customer orientation, which is based on a set of fundamental beliefs that prioritize the interests of the customer, requires companies to detect these interests in order to maintain a high level of quality in their products or services. Furthermore, there are several indicators of customer orientation, and one of them is their opinion or taste, which provides valuable feedback for businesses. With the rapid development of social media, customers can express emotions, thoughts, and opinions about services or products that may not be easily conveyed in the real world.
Objective: The objective of this study is to detect customer orientation towards product or service quality, as expressed in online or social media. Additionally, the study showcases the novelty and superiority of the annotation process used for detecting customer orientation classifications.
Methods: This study employs a method to compare the classification performance of the Vader lexicon annotation process with manual annotation. To accomplish this, a dataset from the Amazon website will be analyzed and classified using the Support Vector Machine algorithm. The objective of this method is to determine the level of customer orientation present within the dataset. To evaluate the effectiveness of the Vader lexicon, the study will compare the results of manual and automatic data annotation.
Results: The results showed that customer orientation towards product or service quality has a predominantly positive value, comprising up to 76% of the total responses analyzed.
Conclusion: The findings demonstrate that using Vader in the annotation process results in superior accuracy values compared to manual annotation. Specifically, the accuracy value increased from 86% to 88.57%, indicating that Vader could be a reliable tool for annotating text. Therefore, future studies should consider using Vader as a classifier or integrating it into the annotation process to further enhance its performance.
Keywords: Classification, Customer, Orientation, Text analysis, Vader lexicon,
H. Park and W.-M. Hur, "Customer Showrooming Behavior, Customer Orientation, and Emotional Labor: Sales Control as a Moderator,” Journal of Retailing and Consumer Services, vol. 72, pp. 1–10, 2023, doi: https://doi.org/10.1016/j.jretconser.2023.103268.
M. R. Jalilvand, "The Effect of Innovativeness and Customer-Oriented Systems on Performance in The Hotel Industry of Iran,” Journal of Science and Technology Policy Management, vol. 8, no. 1, pp. 43–61, 2017, doi: 10.1108/JSTPM-08-2016-0018.
R. T. Frambach, P. C. Fiss, and P. T. M. Ingenbleek, "How Important is Customer Orientation for Firm Performance? A Fuzzy Set Analysis of Orientations, Strategies, and Environments,” J Bus Res, vol. 69, no. 4, pp. 1428–1436, Apr. 2016, doi: 10.1016/j.jbusres.2015.10.120.
M. Nilashi et al., "Big Social Data and Customer Decision Making in Vegetarian Restaurants: A Combined Machine Learning Method,” Journal of Retailing and Consumer Services, vol. 62, Sep. 2021, doi: 10.1016/j.jretconser.2021.102630.
W. Kwon, M. Lee, and K. J. Back, "Exploring the Underlying Factors of Customer Value in Restaurants: A Machine Learning Approach,” Int J Hosp Manag, vol. 91, Oct. 2020, doi: 10.1016/j.ijhm.2020.102643.
Y. Bian, R. Ye, J. Zhang, and X. Yan, "Customer Preference Identification from Hotel Online Reviews: A Neural Network Based Fine-Grained Sentiment Analysis,” Comput Ind Eng, vol. 172, Oct. 2022, doi: 10.1016/j.cie.2022.108648.
J. Zhang, A. Zhang, D. Liu, and Y. Bian, "Customer Preferences Extraction for Air Purifiers based on Fine-Grained Sentiment Analysis of Online Reviews,” Knowl Based Syst, vol. 228, p. 107259, Sep. 2021, doi: 10.1016/j.knosys.2021.107259.
J. Zhang, X. Lu, and D. Liu, "Deriving Customer Preferences for Hotels based on Aspect-Level Sentiment Analysis of Online Reviews,” Electron Commer Res Appl, vol. 49, Sep. 2021, doi: 10.1016/j.elerap.2021.101094.
P. Ray and A. Chakrabarti, "Twitter Sentiment Analysis for Product Review Using Lexicon Method,” in International Conference on Data Management, Analytics and Innovation, 2017, pp. 211–216. doi: 10.1109/ICDMAI.2017.8073512.
J. Mahilraj, G. Tigistu, and S. Tumsa, "Text Preprocessing Method on Twitter Sentiment Analysis using Machine Learning,” International Journal of Innovative Technology and Exploring Engineering, vol. 9, no. 12, pp. 233–240, Sep. 2020, doi: 10.35940/ijitee.K7771.0991120.
D. H. Abd, A. R. Abbas, and A. T. Sadiq, "Analyzing Sentiment System to Specify Polarity by Lexicon-Based,” Bulletin of Electrical Engineering and Informatics, vol. 10, no. 1, pp. 283–289, Feb. 2021, doi: 10.11591/eei.v10i1.2471.
A. Alsayat, "Improving Sentiment Analysis for Social Media Applications Using an Ensemble Deep Learning Language Model,” Arab J Sci Eng, vol. 47, no. 2, pp. 2499–2511, Feb. 2022, doi: 10.1007/s13369-021-06227-w.
M. Işik and H. Dağ, "The Impact of Text Preprocessing on the Prediction of Review Ratings,” Turkish Journal of Electrical Engineering and Computer Sciences, vol. 28, no. 3, pp. 1405–1421, 2020, doi: 10.3906/elk-1907-46.
V. Nurcahyawati and Z. Mustaffa, "Improving Sentiment Reviews Classification Performance using Support Vector Machine-Fuzzy Matching Algorithm,” Bulletin of Electrical Engineering and Informatics, vol. 12, no. 3, pp. 1817–1824, Jun. 2023, doi: 10.11591/eei.v12i3.4830.
R. H. Muhammadi, T. G. Laksana, and A. B. Arifa, "Combination of Support Vector Machine and Lexicon-Based Algorithm in Twitter Sentiment Analysis,” Jurnal Ilmu Komputer dan Informatika, vol. 8, no. 1, pp. 59–71, 2022, doi: https://doi.org/10.23917/khif.v8i1.15213.
F. Firmansyah et al., "Comparing Sentiment Analysis of Indonesian Presidential Election 2019 with Support Vector Machine and K-Nearest Neighbor Algorithm,” in International Conference on Computing, Engineering, and Design, Institute of Electrical and Electronics Engineers Inc., Oct. 2020. doi: 10.1109/ICCED51276.2020.9415767.
S. Shevira, I. M. A. D. Suarjaya, and P. W. Buana, "Lexicon and Naive Bayes Algorithms to Detect Mental Health Situations from Twitter Data,” Journal of Information Systems Engineering and Business Intelligence, vol. 8, no. 2, pp. 142–148, 2022, doi: 10.20473/jisebi.8.2.
N. M. Sham and A. Mohamed, "Climate Change Sentiment Analysis Using Lexicon, Machine Learning and Hybrid Approaches,” Sustainability (Switzerland), vol. 14, no. 8, Apr. 2022, doi: 10.3390/su14084723.
W. Zhao et al., "Weakly-Supervised Deep Embedding for Product Review Sentiment Analysis,” IEEE Trans Knowl Data Eng, vol. 30, no. 1, pp. 185–197, Jan. 2018, doi: 10.1109/TKDE.2017.2756658.
A. Borg and M. Boldt, "Using VADER Sentiment and SVM for Predicting Customer Response Sentiment,” Expert Syst Appl, vol. 162, Dec. 2020, doi: 10.1016/j.eswa.2020.113746.
M. S. Hossain, M. F. Rahman, M. K. Uddin, and M. K. Hossain, "Customer Sentiment Analysis and Prediction of Halal Restaurants Using Machine Learning Approaches,” Journal of Islamic Marketing, 2022, doi: 10.1108/JIMA-04-2021-0125.
V. M. C. Sagarino, J. I. M. Montejo, and A. M. Ceniza-Canillo, "Sentiment Analysis of Product Reviews as Customer Recommendations in Shopee Philippines Using Hybrid Approach,” in International Conference on Information Technology and Digital Applications, Institute of Electrical and Electronics Engineers Inc., 2022. doi: 10.1109/ICITDA55840.2022.9971379.
A. G. Budianto, B. Wirjodirdjo, I. Maflahah, and D. Kurnianingtyas, "Sentiment Analysis Model for KlikIndomaret Android App During Pandemic Using Vader and Transformers NLTK Library,” in IEEE International Conference on Industrial Engineering and Engineering Management, IEEE Computer Society, 2022, pp. 423–427. doi: 10.1109/IEEM55944.2022.9989577.
J. N. Mindoro, M. A. F. Malbog, M. D. S. Nipas, J. A. B. Susa, A. G. Acoba, and J. S. Gulmatico, "Sentiment Analysis in Customer Experience in Philippine Courier Delivery Services using VADER Algorithm Thru Chatbot Interviews,” in International Conference on Power, Control and Computing Technologies, Institute of Electrical and Electronics Engineers Inc., 2022. doi: 10.1109/ICPC2T53885.2022.9777007.
D. Effrosynidis, A. I. Karasakalidis, G. Sylaios, and A. Arampatzis, "The Climate Change Twitter Dataset,” Expert Syst Appl, vol. 204, Oct. 2022, doi: 10.1016/j.eswa.2022.117541.
D. Marutho, Muljono, S. Rustad, and Purwanto, "Sentiment Analysis Optimization Using Vader Lexicon on Machine Learning Approach,” in International Seminar on Intelligent Technology and Its Applications: Advanced Innovations of Electrical Systems for Humanity, Institute of Electrical and Electronics Engineers Inc., 2022, pp. 98–103. doi: 10.1109/ISITIA56226.2022.9855341.
M. K. Bashar, "A Hybrid Approach to Explore Public Sentiments on COVID-19,” SN Comput Sci, vol. 3, no. 3, pp. 1–19, May 2022, doi: 10.1007/s42979-022-01112-1.
M. S. Hossen and N. R. Dev, "An Improved Lexicon Based Model for Efficient Sentiment Analysis on Movie Review Data,” Wirel Pers Commun, vol. 120, no. 1, pp. 535–544, Sep. 2021, doi: 10.1007/s11277-021-08474-4.
M. S. Oliveira, A. Mourthe, and M. C. Duque, "Extracting events from Daily Drilling Reports using Fuzzy String Matching,” The APPEA Journal, vol. 62, no. 2, pp. S158–S161, May 2022, doi: 10.1071/aj21118.
H. Kyung Yu and J. Gon Kim, "Indoor Positioning by Weighted Fuzzy Matching in Lifi Based Hospital Ward Environment,” J Phys Conf Ser, vol. 1487, no. 1, Apr. 2020, doi: 10.1088/1742-6596/1487/1/012010.
S. Abdulmalek, S. AL-Hagree, M. Alsurori, M. Hadwan, A. Aqlan, and F. Alqasemi, "Levenstein's Algorithm On English and Arabic: A Survey,” in International Conference of Technology, Science and Administration (ICTSA), Taiz, Yemen: IEEE, 2021. doi: 10.1109/ICTSA52017.2021.9406547.
D. N. de Oliveira and L. H. de C. Merschmann, "Joint Evaluation of Preprocessing Tasks with Classifiers for Sentiment Analysis in Brazilian Portuguese Language,” Multimed Tools Appl, 2021, doi: 10.1007/s11042-020-10323-8.
E. Araslanov, E. Komotskiy, and E. Agbozo, "Assessing the Impact of Text Preprocessing in Sentiment Analysis of Short Social Network Messages in the Russian Language,” in International Conference on Data Analytics for Business and Industry: Way Towards a Sustainable Economy, ICDABI, Institute of Electrical and Electronics Engineers Inc., Oct. 2020. doi: 10.1109/ICDABI51230.2020.9325654.
M. K. Delimayanti, R. Sari, M. Laya, M. R. Faisal, Pahrul, and R. F. Naryanto, "The Effect of Pre-Processing on the Classification of Twitter's Flood Disaster Messages using Support Vector Machine Algorithm,” in Proceedings of ICAE 2020 - 3rd International Conference on Applied Engineering, Institute of Electrical and Electronics Engineers Inc., Oct. 2020. doi: 10.1109/ICAE50557.2020.9350387.
C. J. Hutto and E. Gilbert, "VADER: A Parsimonious Rule-based Model for Sentiment Analysis of Social Media Text,” in International AAAI Conference on Weblogs and Social Media, 2014, pp. 216–225. doi: https://doi.org/10.1609/icwsm.v8i1.14550.
Y. Asri, W. N. Suliyanti, D. Kuswardani, and M. Fajri, "Pelabelan Otomatis Lexicon Vader dan Klasifikasi Naive Bayes dalam Menganalisis Sentimen Data Ulasan PLN Mobile,” PETIR: Jurnal Pengkajian dan Penerapan Teknik Informatika, vol. 15, no. 2, pp. 264–275, Nov. 2022, doi: 10.33322/petir.v15i2.1733.
Copyright (c) 2023 The Authors. Published by Universitas Airlangga.
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
All accepted papers will be published under a Creative Commons Attribution 4.0 International (CC BY 4.0) License. Authors retain copyright and grant the journal right of first publication. CC-BY Licenced means lets others to Share (copy and redistribute the material in any medium or format) and Adapt (remix, transform, and build upon the material for any purpose, even commercially).