Data Mining Techniques in Handling Personality Analysis for Ideal Customers
Downloads
Background: Personality distinguishes individuals from one another, guides their actions and reactions, and dictates their preferences in many aspects of life, including shopping.
Objective: This study determines the characteristics of an ideal customer based on individual personality.
Methods: Data mining techniques used in this study are K-nearest neighbour (KNN), linear support vector machine (SVM), and random forest. This study also applies the synthetic minority oversampling technique (SMOTE) to overcome the imbalance in the amount of data.
Results: This study shows that the application of the SMOTE and random forest models resulted in 88% accuracy, 79% precision, and 70% recall, which are the highest compared to other models.
Conclusion: SMOTE in this research is unsuitable for use in the KNN and linear SVM classification models. Ensemble-based models such as random forest can produce high accuracy when SMOTE is applied for data pre-processing.
E. Utami, I. Oyong, S. Raharjo, A. Dwi Hartanto, and S. Adi, "Supervised learning and resampling techniques on DISC personality classification using Twitter information in Bahasa Indonesia,” Appl. Comput. Informatics, 2021.
A. Kunte and S. Panicker, "Personality Prediction of Social Network Users Using Ensemble and XGBoost,” Adv. Intell. Syst. Comput., vol. 1119, pp. 133–140, 2020.
T. Krismayer, M. Schedl, P. Knees, and R. Rabiser, "Predicting user demographics from music listening information,” Multimed. Tools Appl., vol. 78, no. 3, pp. 2897–2920, 2019.
G. Mavis, I. H. Toroslu, and P. Karagoz, "Personality Analysis Using Classification on Turkish Tweets,” Int. J. Cogn. Informatics Nat. Intell., vol. 15, no. 4, pp. 1–18, 2021.
G. Farnadi et al., "Computational personality recognition in social media,” User Model. User-adapt. Interact., vol. 26, no. 2–3, pp. 109–142, 2016.
M. M. Tadesse, H. Lin, B. Xu, and L. Yang, "Personality Predictions Based on User Behavior on the Facebook Social Media Platform,” IEEE Access, vol. 6, pp. 61959–61969, 2018.
F. Celli and L. Rossi, "The role of Emotional Stability in Twitter Conversations,” Proc. Work. Semant. Anal. Soc. Media, conjunction with EACL 2012, pp. 10–17, 2012.
A. Roshchina, J. Cardiff, and P. Rosso, "A comparative evaluation of personality estimation algorithms for the twin recommender system,” Int. Conf. Inf. Knowl. Manag. Proc., pp. 11–17, 2011.
F. Enos, S. Benus, R. L. Cautin, M. Graciarena, J. Hirschberg, and E. Shriberg, "Personality factors in human deception detection: Comparing human to machine performance,” Proc. Annu. Conf. Int. Speech Commun. Assoc. INTERSPEECH, vol. 2, pp. 813–816, 2006.
K. Luyckx and W. Daelemans, "Personae: a corpus for author and personality prediction from text,” Proc. Sixth Int. Conf. Lang. Resour. Eval., pp. 2981–2987, 2008.
J. Golbeck and D. L. Hansen, "Computing political preference among Twitter followers,” Conf. Hum. Factors Comput. Syst. - Proc., pp. 1105–1108, 2011.
D. Nie, Z. Guan, B. Hao, S. Bai, and T. Zhu, "Predicting personality on social media with semi-supervised learning,” Proc. - 2014 IEEE/WIC/ACM Int. Jt. Conf. Web Intell. Intell. Agent Technol. - Work. WI-IAT 2014, vol. 2, pp. 158–165, 2014.
M. Shumanov, H. Cooper, and M. Ewing, "Using AI predicted personality to enhance advertising effectiveness,” Eur. J. Mark., 2021.
J. Vargas, N. ALberto, and O. Arevalo, "Algorithms for Decision Making Through Customer Classification,” Int. Conf. Intell. Comput. Inf. Control Syst. Adv. Intell. Syst. Comput., vol. 1272, 2021.
Y. Zhang, J. Wang, and X. Zhang, "Personalized sentiment classification of customer reviews via an interactive attributes attention model,” Knowledge-Based Syst., vol. 226, p. 107135, 2021.
K. Maheswari, P. Packia, and A. Priya, "Predicting Customer Behavior in Online Shopping Using SVM Classifier,” IEEE Int. Conf. Intell. Tech. Control. Optim. Signal Process., 2017.
T. Tandera, Hendro, D. Suhartono, R. Wongso, and Y. L. Prasetio, "Personality Prediction System from Facebook Users,” Procedia Comput. Sci., vol. 116, pp. 604–611, 2017.
N. Majumder, S. Pouria, A. Gelbukh, and E. Cambria, "Deep Learning-Based Document Modeling for Personality Detection from Text,” IEEE Intell. Syst., pp. 74–79, 2017.
M. A. Rahman, A. Al Faisal, T. Khanam, M. Amjad, and M. S. Siddik, "Personality Detection from Text using Convolutional Neural Network,” 1st Int. Conf. Adv. Sci. Eng. Robot. Technol. 2019, ICASERT 2019, vol. 2019, no. Icasert, pp. 1–6, 2019.
B. Y. Pratama and R. Sarno, "Personality classification based on Twitter text using Naive Bayes, KNN and SVM,” Proc. 2015 Int. Conf. Data Softw. Eng. ICODSE 2015, pp. 170–174, 2016.
Y. Mehta, N. Majumder, A. Gelbukh, and E. Cambria, "Recent trends in deep learning based personality detection,” Artif. Intell. Rev., vol. 53, no. 4, pp. 2313–2339, 2020.
S. Elmitwally, Nouh Sabri; Kanwal, Asma; Abbas, Sagheer; Khan, Muhammad A.; Khan, Muhammad Adnan; Ahmad, Munir; Alanazi, "Personality Detection Using Context Based Emotions in Cognitive Agents,” C. Mater. Contin., vol. 70, no. 3, pp. 4947–4964, 2022.
A. Mamta, Bhamare; K, "Prediction of Personality Traits in Facebook Users,” Lect. Notes Data Eng. Commun. Technol., vol. 86, 2022.
M. S. R. B and A. Singh, "Personality Detection by Analysis of Twitter Profiles,” 8th Int. Conf. Soft Comput. Pattern Recognition, SoCPaR 2016, no. 2194–5357, 2018.
Kaggle, "Customer Personality Analysis,” https://www.kaggle.com/imakash3011/customer-personality-analysis, vol. November, 2021.
N. G. Ramadhan and T. I. Ramadhan, "Analysis Sentiment Based on IMDB Aspects from Movie Reviews using SVM,” Sinkron, vol. 7, no. 1, pp. 39–45, 2022.
N. G. Ramadhan, F. D. Adhinata, A. Jala, T. Segara, and D. Putra, "Deteksi Berita Palsu Menggunakan Metode Random Forest dan Logistic Regression,” J. Ris. Komput., vol. 9, no. 2, pp. 251–256, 2022.
N. G. Ramadhan, "Indonesian Online News Topics Classification using Word2Vec and,” J. Resti, no. 158, pp. 7–10, 2021.
Copyright (c) 2022 The Authors. Published by Universitas Airlangga.
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
All accepted papers will be published under a Creative Commons Attribution 4.0 International (CC BY 4.0) License. Authors retain copyright and grant the journal right of first publication. CC-BY Licenced means lets others to Share (copy and redistribute the material in any medium or format) and Adapt (remix, transform, and build upon the material for any purpose, even commercially).