Sentiment Analysis Towards Kartu Prakerja Using Text Mining with Support Vector Machine and Radial Basis Function Kernel
Downloads
Background: The introduction of Kartu Prakerja (Pre-employment Card) Programme, henceforth KPP, which was claimed to have launched in order to improve the quality of workforce, spurred controversy among members of the public. The discussion covered the amount of budget, the training materials and the operations brought out various reactions. Opinions could be largely divided into groups: the positive and the negative sentiments.
Objective: This research aims to propose an automated sentiment analysis that focuses on KPP. The findings are expected to be useful in evaluating the services and facilities provided.
Methods: In the sentiment analysis, Support Vector Machine (SVM) in text mining was used with Radial Basis Function (RBF) kernel. The data consisted of 500 tweets from July to October 2020, which were divided into two sets: 80% data for training and 20% data for testing with five-fold cross validation.
Results: The results of descriptive analysis show that from the total 500 tweets, 60% were negative sentiments and 40% were positive sentiments. The classification in the testing data show that the average accuracy, sensitivity, specificity, negative sentiment prediction and positive sentiment prediction values were 85.20%; 91.68%; 75.75%; 85.03%; and 86.04%, respectively.
Conclusion: The classification results show that SVM with RBF kernel performs well in the opinion classification. This method can be used to understand similar sentiment analysis in the future. In KPP case, the findings can inform the stakeholders to improve the programmes in the future.
Keywords: Kartu Prakerja, Sentiment Analysis, Support Vector Machine, Text Mining, Radial Basis Function
M. A. Iswara, "Indonesia Advances Preemployment Card Launch to Friday to Anticipate Virus Impacts”, Retrieved from https://www.thejakartapost.com/news/2020/03/18/indonesia-advances-preemployment-card-launch-to-friday-to-anticipate-virus-impacts.html, 2020.
M. A. Iswara, "1.2 Million Indonesian Workers Furloughed, Laid off as COVID-19 Crushes Economy”, Retrieved from https://www.thejakartapost.com/news/2020/04/09/worker-welfare-at-stake-as-covid-19-wipes-out-incomes.html, 2020.
A. W. Akhlas, "Millions to Lose Jobs, Fall into Poverty as Indonesia Braces for Recession”, Retrieved from https://www.thejakartapost.com/news/2020/04/14/millions-to-lose-jobs-fall-into-poverty-as-indonesia-braces-for-recession.html , 2020.
J. Goldsmith, "Twitter Active Daily Users Surge 34% To Record 186M In Q2, Revenue Dips, CEO Jack Dorsey Apologizes For Breach”, Retrieved from https://deadline.com/2020/07/twitter-active-daily-users-surge-34-to-186m-q2-revenue-dips-19-1202992835/, 2020.
N. Zanini, and V. Dhawan, "Text Mining: An Introduction to Theory and some Applications”, Research Matters: A Cambridge Assessment Publication, Issue 19, Pages 38–44, 2015.
M. Sugiyama, "Introduction to Statistical Machine Learning”,
Elsevier:Waltham, 2015.
R. Darnag, B. Minaoui, and M. Fakir, "QSAR Models for Prediction Study of HIV Protease Inhibitors Using Support Vector Machines, Neural Networks and Multiple Linear Regression”, Arabian Journal of Chemistry, Vol. 10, Supplement 1, Pages S600–S608, 2017.
R. Moraes, J. F. Valiati, and W. P. G. Neto, "Document-level sentiment classification: An empirical comparison between SVM and ANN”, Expert Systems with Applications, Vol. 40, Issue 2, Pages 621–633, 2013.
S. Liu, J. McGree, Z. Ge, and Y. Xie, "Computational and Statistical Methods for Analysing Big Data with Applications”, Academic Press:UK, 2016.
P. H. Prastyo, A. S. Sumi, A. W. Dian, A. E. Permanasari, "Tweets Responding to the Indonesian Government's Handling of COVID-19: Sentiment Analysis Using SVM with Normalized Poly Kernel”, Journal of Information Systems Engineering and Business Intelligence, Vol. 6, No. 2, Pages 112–122, 2020.
P. Dellia and A. Tjahyanto, "Tax Complaints Classification on Twitter Using Text Mining”, IPTEK, Journal of Science, Vol. 2, No. 1, Pages 11–15, 2017.
C. Shofiya, and S. Abidi, "Sentiment Analysis on COVID-19-Related Social Distancing in Canada Using Twitter Data”, International Journal of Environmental Research and Public Health, Vol. 18, Issue 5993, Pages 1–10, 2001.
R. Risnantoyo, A. Nugroho, and K. Mandara, "Sentiment Analysis on Corona Virus Pandemic Using Machine Learning Algorithm”, Journal of Informatics and Telecommunication Engineering, Vol. 4, No. 1, Pages 86–96, 2020.
G. Miner, J. Elder, A. Fast, T. Hill, R. Nisbet, and D. Delen, "Practical Text Mining and Statistical Analysis for Non-structured Text Data Applications”, Academic Press:USA , 2012.
R. Atenstaedt, and C. Singh, "Word Cloud Analysis of BJGP”, British Journal of General Practice, Vol. 62, Issue 596, Pages 148, 2012.
Sahana, "Advances in Computational Intelligence”. Springer:Switzerland, 2020.
R. Chopra, A. M. Godbole, N. Sadvilkar, M. B. Shah, S. Ghosh, and D. Gunning, "The Natural Language Processing Workshop”, Packt: Birmingham, 2020.
A. Tripathy, A. Agrawal, and S. K. Rath, "Classification of Sentimental Reviews Using Machine Laerning Techniques”, Procedia Computer Science, Vol. 57, Pages 821–829, 2015.
M. Awad, and R. Khanna, "Efficient Learning Machines : Theories, Concepts, Applications for Engineers and System Designers”, Apress:New York, 2015.
C. Cortes, and V. Vapnik, "Support-Vector Networks”, Machine Learning, Vol. 20, Pages 273–297, 1995.
A. Kowalczyk, "Support Vector Machine Succinctly”, Syncfusion:USA, 2017.
M. Asrol, P. Papilo, and F. E. Gunawan, "Support Vector Machine with K-fold Validation to Improve the Industry's Sustainability Performance Classification”, Procedia Computer Science, Vol. 179, Pages 854–862, 2021.
A. C. Rencher, "Methods of Multivariate Analysis Second Edition”, John Wiley & Sons:USA, 2003.
H. Hong, B. Pradhan, D. T. Bui, C. Xu, A. M. Youssef, and W. Chen, "Comparison of Four Kernel Functions Used in Support Vector Machines for Landslide Susceptibility Mapping: a Case Study at Suichuan Area (China)”, Geomatics, Natural Hazards snd Risk, Vol. 8, No. 2, Pages 544–569, 2017.
Authors who publish with this journal agree to the following terms:
All accepted papers will be published under a Creative Commons Attribution 4.0 International (CC BY 4.0) License. Authors retain copyright and grant the journal right of first publication. CC-BY Licenced means lets others to Share (copy and redistribute the material in any medium or format) and Adapt (remix, transform, and build upon the material for any purpose, even commercially).