The Performance Comparison of DBSCAN and K-Means Clustering for MSMEs Grouping based on Asset Value and Turnover
Downloads
Background: This study focuses on the latest knowledge regarding Micro, Small and Medium Enterprises (MSMEs) as a current central issue. These enterprises have shown their significance in providing employment opportunities and contributing to the country's economy. However, MSMEs face various challenges that must be addressed to optimize their outcomes. Understanding the characteristics of this group was crucial in formulating effective strategies.
Objective: This study proposed to cluster or combine micro, small, and medium enterprises (MSMEs) data in a particular area based on asset value and turnover. As a result, this study aimed to gain insights into the MSME landscape in the area and provided valuable information for decision-makers and stakeholders.
Methods: This study utilized two methods, namely the DBSCAN (Density-Based Spatial Clustering of Applications with Noise) method and the K-Means method. These methods were chosen for their distinct capabilities. DBSCAN was selected for its ability to handle noisy data and identify clusters with diverse forms, while K-Means was chosen for its popularity and ability to group data based on proximity. The study used a dataset containing MSME information, including asset values and turnover, collected from various sources.
Results: The outcomes encompassed identifying clusters of MSMEs based on their closeness in the feature space within a specific region. Optimizing the clustering outcomes involved modifying algorithm parameters like epsilon and minimum points for DBSCAN and the number of clusters for K-Means. Furthermore, this study attained a deeper understanding of the arrangement and characteristics of MSME clusters in the region through a comparative analysis of the two methodologies.
Conclusion: This study offered perspectives on clustering MSMEs based on asset value and turnover in a specific region. Employing DBSCAN and K-Means methodologies allowed researchers to depict the MSME landscape and grasp the business attributes of these enterprises. These results could aid in decision-making and strategic planning concerning the advancement of the MSME sector in the mentioned area. Future study may investigate supplementary factors and variables to deepen comprehension of MSME clusters and promote regional growth and sustainability.
Keywords: Asset Value, Clustering, DBSCAN, K-Means, Turnover
H. Gunawan, B. L. Sinaga, and W. P. Sigit Purnomo, "Assessment of the readiness of micro, small and medium enterprises in using E-money using the unified theory of acceptance and use of technology (UTAUT) method,” in Procedia Computer Science, Elsevier B.V., 2019, pp. 316–323.
Subianto and D. Wake, "Indonesia's Fintech Lending Potential,” Indonesia's Fintech Lending.
Organisation for Economic Cooperation and Development (OECD), "Key facts on SME financing: Indonesia,” OECD iLibrary.
T. Boonchoo, X. Ao, Y. Liu, W. Zhao, F. Zhuang, and Q. He, "Grid-based DBSCAN: Indexing and inference,” Pattern Recognit, vol. 90, pp. 271–284, Jun. 2019.
W. Wahyuri, U. Athiyah, I. Puspitasari, and Y. Nita, "Clustering of Drug Sampling Data to Determine Drug Distribution Patterns with K-Means Method : Study on Central Kalimantan Province, Indonesia,” Journal of Information Systems Engineering and Business Intelligence, vol. 5, no. 2, p. 208, Oct. 2019.
H. Cui, W. Wu, Z. Zhang, F. Han, and Z. Liu, "Clustering and application of grain temperature statistical parameters based on the DBSCAN algorithm,” J Stored Prod Res, vol. 93, Sep. 2021.
G. Armano and M. R. Farmani, "Multiobjective clustering analysis using particle swarm optimization,” Expert Syst Appl, vol. 55, pp. 184–193, Aug. 2016.
S. Monalisa and F. Kurnia, "Analysis of DBSCAN and K-means algorithm for evaluating outlier on RFM model of customer behaviour,” Telkomnika (Telecommunication Computing Electronics and Control), vol. 17, no. 1, pp. 110–117, Feb. 2019.
S. F. Galán, "Comparative evaluation of region query strategies for DBSCAN clustering,” Inf Sci (N Y), vol. 502, pp. 76–90, Oct. 2019.
D. Deng, "DBSCAN Clustering Algorithm Based on Density,” Proceedings - 2020 7th International Forum on Electrical Engineering and Automation, IFEEA 2020, pp. 949–953, Sep. 2020.
R. Dhivya and N. Shanmugapriya, "An Efficient DBSCAN with Enhanced Agglomerative Clustering Algorithm,” 2023 4th International Conference on Electronics and Sustainable Communication Systems, ICESC 2023 - Proceedings, pp. 1322–1327, 2023.
L. Ma, "An improved and heuristic-based iterative DBSCAN clustering algorithm,” IEEE Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), pp. 2709–2714, 2021.
R. E. Caraka et al., "Micro, small, and medium enterprises' business vulnerability cluster in Indonesia: An analysis using optimized fuzzy geodemographic clustering,” Sustainability (Switzerland), vol. 13, no. 14, Jul. 2021.
D. Abdullah, S. Susilo, A. S. Ahmar, R. Rusli, and R. Hidayat, "The application of K-means clustering for province clustering in Indonesia of the risk of the COVID-19 pandemic based on COVID-19 data,” Qual Quant, vol. 56, no. 3, pp. 1283–1291, Jun. 2021.
T. A. Cinderatama et al., "Implementasi Metode K-Means, Dbscan, dan Meanshift Untuk Analisis Jenis Ancaman Jaringan Pada Intrusion Detection System,” Jurnal Inovtek Polbeng - Seri Informatika, vol. 7, no. 1, 2022.
E. Kusmiati, D. Turipanam Alamanda, and F. Fahru Roji, "MSME Clusterization Using K-Means Clustering in Garut Regency, Indonesia,” Review of Integrative Business and Economics Research, vol. 12, p. 199, 2023.
K. Wang, R. Yang, C. Liu, T. Samarasinghalage, and Y. Zang, "Extracting Electricity Patterns from High-dimensional Data: A comparison of K-Means and DBSCAN algorithms,” in IOP Conference Series: Earth and Environmental Science, Institute of Physics.
D. Dwi Aulia and N. Nurahman, "Comparison Performance of K-Medoids and K-Means Algorithms In Clustering Community Education Levels,” Jurnal Nasional Pendidikan Teknik Informatika (JANAPATI), vol. 12, no. 2, pp. 273–282, Jul. 2023.
F. Andriyani and Y. Puspitarani, "Performance Comparison of K-Means and DBScan Algorithms for Text Clustering Product Reviews,” SinkrOn, vol. 7, no. 3, pp. 944–949, Jul. 2022.
IBM, "Data Mining Process,” IBM. Accessed: Oct. 09, 2023.
S. Dang and P. H. Ahmad, "Performance Evaluation of Clustering Algorithm Using Different Datasets Computer Science and Management Studies Performance Evaluation of Clustering Algorithm Using Different Datasets,” 2015.
K. Nurmayanti, W. P. Aini, S. R. Amrullah, and L. M. Sya'roni, "Comparison of Algorithms K-Means and DBSCAN for Clustering Student Cognitive Learning Outcomes in Physics Subject,” Kappa Journal, vol. 7, no. 1, pp. 251–255, 2023.
T. Kansal, S. Bahuguna, V. Singh, and T. Choudhury, "Customer Segmentation using K-means Clustering,” Proceedings of the International Conference on Computational Techniques, Electronics and Mechanical Systems, 2018.
Copyright (c) 2024 The Authors. Published by Universitas Airlangga.
This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
All accepted papers will be published under a Creative Commons Attribution 4.0 International (CC BY 4.0) License. Authors retain copyright and grant the journal right of first publication. CC-BY Licenced means lets others to Share (copy and redistribute the material in any medium or format) and Adapt (remix, transform, and build upon the material for any purpose, even commercially).