Clustering of Provincial Health Vulnerability Levels in Indonesia Using the K-Means Method

Keywords: Health Vulnerability, Clustering, K-Means, Health Indicators, Indonesia

Abstract

This study aims to classify the health vulnerability levels of 38 provinces in Indonesia based on health and socio-economic indicators in 2024, including the number of hospitals, access to adequate sanitation, access to safe drinking water, stunting prevalence, number of health facilities, population size, and the percentage of poor population. The analysis began with data normalization using the z-score method to standardize variable scales and prevent dominance by indicators with larger value ranges. Following normalization, the optimal number of clusters was determined using the Elbow method by examining the decrease in inertia across different k-values. Based on the inertia pattern and cluster stability, the optimal number of clusters was identified as K=4, which adequately represents the variation in health vulnerability. The clustering results were subsequently visualized in a spatial map using Indonesia’s provincial administrative boundaries. The visualization revealed clear geographical variation across regions, with Cluster 1 representing provinces with very good health conditions, Cluster 2 good conditions, Cluster 3 moderate conditions, and Cluster 4 provinces requiring special attention regarding health indicators. These findings provide a comprehensive overview of health vulnerability distribution in Indonesia and are expected to inform policymakers and stakeholders in prioritizing region-based health interventions, strengthening health development strategies, and promoting more equitable national health services.

Downloads

Download data is not yet available.

Author Biographies

Okma Arnilia, Universitas Islam Negeri Siber Syekh Nurjati Cirebon

Program Studi Informatika

Sahrial Ihsani Ishak, Universitas Dian Nusantara

Program Studi Teknik Informatika

Tri Widodo, Universitas Teknokrat Indonesia

Program Studi Teknik Komputer

I Gusti Nyoman Agung Bisma Tatwa, Institut Pertanian Bogor

Program Studi Ilmu Komputer

References

Kementerian Kesehatan Republik Indonesia, Profil Kesehatan Indonesia 2023. Jakarta: Kemenkes RI, 2023

Kementerian Kesehatan Republik Indonesia, e-PPGBM (Elektronik Pencatatan dan Pelaporan Gizi Berbasis Masyarakat), 2023.

Bappenas, Pedoman Penilaian Kerentanan. Jakarta: Kementerian PPN/Bappenas.

World Health Organization, Vulnerability and Risk Assessment Framework. Geneva: WHO.

Frontiers, “Health vulnerability and socio-economic impacts,” Frontiers in Public Health.

Pusat Krisis Kesehatan, Kementerian Kesehatan RI, Kerangka Kerentanan Kesehatan.

A. K. Jain, “Data clustering: 50 years beyond K-means,” Pattern Recognition Letters, vol. 31, no. 8, pp. 651–666, 2010.

J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, 3rd ed. Waltham: Morgan Kaufmann, 2012.

T. R. Kodinariya and P. R. Makwana, “Review on determining number of clusters in K-means clustering,” International Journal, 2013.

E-Jurnal Swadharma, “Pemetaan fasilitas kesehatan Kota Bandung menggunakan K-Means,” E-Jurnal Swadharma.

JournalShub, “Clustering data keluhan kesehatan tingkat provinsi dengan K-Means,” JournalShub.

Universitas Bumigora Journal, “Klasterisasi wilayah berdasarkan data penyakit tidak menular di Banten menggunakan K-Means,” Jurnal Universitas Bumigora.

PubMed, “Comparison of K-Means and Fuzzy C-Means clustering for hospital resource grouping,” Journal of Healthcare Engineering.

E-Jurnal Stimata, “Klasterisasi profil penyakit kabupaten/kota di Jawa Timur menggunakan K-Means,” E-Jurnal Stimata.

G. James, D. Witten, T. Hastie, and R. Tibshirani, An Introduction to Statistical Learning, Springer, 2013.

I. H. Witten, E. Frank, and M. A. Hall, Data Mining: Practical Machine Learning Tools and Techniques, 3rd ed. Morgan Kaufmann, 2011.

T. R. Kodinariya and P. R. Makwana, “Review on determining number of clusters in K-means clustering,” International Journal of Advance Research in Computer Science and Management Studies, vol. 1, no. 6, pp. 90–95, 2013.

A. K. Jain, M. N. Murty, and P. J. Flynn, “Data clustering: A review,” ACM Computing Surveys, vol. 31, no. 3, pp. 264–323, 1999.

S. Theodoridis and K. Koutroumbas, Pattern Recognition, 4th ed. London: Academic Press, 2009.

L. Kaufman and P. J. Rousseeuw, Finding Groups in Data: An Introduction to Cluster Analysis. New York: Wiley, 2005.

M. F. Goodchild, “GIS and spatial analysis,” International Journal of Geographical Information Systems, vol. 6, no. 4, pp. 385–395, 1992.

P. Longley, M. F. Goodchild, D. J. Maguire, and D. W. Rhind, Geographic Information Systems and Science, 3rd ed. Hoboken, NJ: Wiley, 2011.

T. A. Slocum, R. B. McMaster, F. C. Kessler, and H. H. Howard, Thematic Cartography and Geovisualization, 3rd ed. Upper Saddle River, NJ: Prentice Hall, 2009.

M. De Smith, M. Goodchild, and P. Longley, Geospatial Analysis: A Comprehensive Guide to Principles, Techniques and Software Tools, 6th ed. Winchelsea Press, 2018.

N. Andrienko and G. Andrienko, Exploratory Analysis of Spatial and Temporal Data: A Systematic Approach, Berlin: Springer, 2006

Published
2026-03-25
How to Cite
Arnilia, O., Ishak, S. I., Widodo, T., & Nyoman Agung Bisma Tatwa, I. G. (2026). Clustering of Provincial Health Vulnerability Levels in Indonesia Using the K-Means Method. Jurnal Informatika Dan Rekayasa Perangkat Lunak, 7(1), 26-34. https://doi.org/10.33365/jatika.v7i1.1469