Comparison of SMOTE and ADASYN in Optimizing Random Forest Model for Imbalanced Financial Ratio Bankruptcy Prediction

  • Novanda Rizky Ramadhana Universitas Lambung Mangkurat
  • Fuad Muhajirin Farid Universitas Lambung Mangkurat
  • Yeni Rahkmawati Universitas Lambung Mangkurat
Keywords: SMOTE, ADASYN, Random Forest, Company Bankruptcy, Financial Ratios

Abstract

Classification is a data analysis process that can predict classes based on predefined characteristics. In the era of big data, classification can be performed using machine learning. The problem of machine learning in classification analysis is imbalance data which often affect model performance. SMOTE and ADASYN are oversampling techniques to solve this problem. This study aims to evaluate the effectiveness of SMOTE and ADASYN in improving the performance of the Random Forest model on imbalanced data in the case of company bankruptcy using financial ratios. Models were built using training data with various splitting data and oversampling techniques. Then, the resulting models will be tested using testing data. The results show that the best model was achieved with a combination of splitting data 70:30 using SMOTE technique, which produced the highest f1-score of 40.57%, compared to ADASYN technique with 36.11% (a decrease of 4.46%), and without oversampling techniques with 19.51% (a decrease of 21.06%). The findings indicate SMOTE and ADASYN can identify minority values which are the main problem of imbalance data, with SMOTE showing better performance compared to ADASYN.

References

V. E. Syukrina Janrosl, A. Putra Prima, P. Studi Akuntansi, F. Ilmu Sosial dan Humaniora, U. Putera Batam, and S. Galileo, “Potensi Kebangkrutan Menggunakan Model Zavgren Dan Altman Pada Perusahaan Di Indonesia,” Measurement: Jurnal Akuntansi, vol. 16, no. 2, pp. 159–165, 2022.

Reskianty, “Implementasi Metode Support Vector Machine Dan Random Forest Untuk Dataset Tidak Seimbang (Studi Kasus: Klasifikasi Kebangkrutan Perusahaan),” Universitas Hasanuddin, Makassar, 2022.

A. Kurniadi, “Analisis Rasio Keuangan Untuk Memprediksi Financial Distress Perusahaan Manufaktur Di BEI,” Jurnal Ilmiah Manajemen Kesatuan, vol. 9, no. 3, pp. 495–508, Dec. 2021, doi: 10.37641/jimkes.v9i3.511.

B. G. Putri and S. Munfaqiroh, “Analisis Rasio Keuangan Untuk Mengukur Kinerja Keuangan,” INSPIRASI: Jurnal Ilmu-Ilmu Sosial, vol. 17, no. 1, pp. 214–226, 2020.

N. P. Aldy, “Pendekatan Algoritma Cost Sensitive Decision Tree Pada Klasifikasi Film Berdasarkan Perolehan Kompilasi Dari Internet Movie Database (IMDB),” Universitas Lambung Mangkurat, Banjarbaru, 2024.

I. Hayati, “Klasifikasi Mahasiswa Berpotensi Drop Out Menggunakan Algoritma Decision Tree C4.5 Dan Naive Bayes Di Universitas Jambi,” Universitas Jambi, Jambi, 2021.

Ary Prandika Siregar, Dwi Priyadi Purba, Jojor Putri Pasaribu, and Khairul Reza Bakara, “Implementasi Algoritma Random Forest Dalam Klasifikasi Diagnosis Penyakit Stroke,” Jurnal Penelitian Rumpun Ilmu Teknik, vol. 2, no. 4, pp. 155–164, Nov. 2023, doi: 10.55606/juprit.v2i4.3039.

R. Hariyanto and A. A. Widodo, “Klasifikasi Hasil Prediksi Panen Padi Berdasarkan Fisiologis Menggunakan Metode Nãive Bayes Classification” Conference on Innovation and Application of Science and Technology (CIASTECH 2019), pp. 237–244, 2019.

O.- Pahlevi, A.- Amrin, and Y.- Handrianto, “Implementasi Algoritma Klasifikasi Random Forest Untuk Penilaian Kelayakan Kredit,” Jurnal Infortech, vol. 5, no. 1, pp. 71–76, Jun. 2023, doi: 10.31294/infortech.v5i1.15829.

I. Sulistiani, E. Mufida, P. M. Yasser, and L. Alamsyah, “Systematic Literature Review: Bankruptcy Prediction Menggunakan Teknik Machine Learning dan Deep Learning,” INTECH, vol. 2, no. 1, pp. 13–18, Jun. 2021, doi: 10.54895/intech.v2i1.824.

R. G. Wardhana, G. Wang, and F. Sibuea, “Penerapan Machine Learning Dalam Prediksi Tingkat Kasus Penyakit Di Indonesia,” Journal of Information System Management (JOISM), vol. 5, no. 1, pp. 40–45, Jul. 2023, doi: 10.24076/joism.2023v5i1.1136.

H. Marlina, Elmayati, A. Zulius, and H. O. L. Wijaya, “Penerapan Algoritma Random Forest Dalam Klasifikasi Penjurusan di SMA Negeri Tugumulyo,” Brahmana: Jurnal Penerapan Kecerdasan Buatan, vol. 4, no. 2, pp. 138–143, 2023.

H. A. Salman, A. Kalakech, and A. Steiti, “Random Forest Algorithm Overview,” Babylonian Journal of Machine Learning, vol. 2024, pp. 69–79, Jun. 2024, doi: 10.58496/BJML/2024/007.

R. D. Fitriani, H. Yasin, and T. Tarno, “Penanganan Klasifikasi Kelas Data Tidak Seimbang Dengan Random Oversampling Pada Naive Bayes (Studi Kasus: Status Peserta KB IUD di Kabupaten Kendal),” Jurnal Gaussian, vol. 10, no. 1, pp. 11–20, 2021.

R. Siringoringo, “Klasifikasi Data Tidak Seimbang Menggunakan Algoritma Smote Dan k-Nearest Neighbor,” Journal Information System Development (ISD), vol. 3, no. 1, pp. 44–49, 2018.

M. H. A. Hamid, M. Yusoff, and A. Mohamed, “Survey on Highly Imbalanced Multi-class Data,” International Journal of Advanced Computer Science and Applications, vol. 13, no. 6, 2022, doi: 10.14569/IJACSA.2022.0130627.

A. Indrawati, H. Subagyo, A. Sihombing, W. Wagiyah, and S. Afandi, “Analyzing The Impact Of Resampling Method For Imbalanced Data Text In Indonesian Scientific Articles Categorization,” BACA: JURNAL DOKUMENTASI DAN INFORMASI, vol. 41, no. 2, p. 133, Dec. 2020, doi: 10.14203/j.baca.v41i2.702.

C. Agustina and E. Rahmawati, “Optimalisasi Algoritma Random Forest Menggunakan SMOTE untuk Prediksi Pembatalan Tamu Hotel,” EVOLUSI : Jurnal Sains dan Manajemen, vol. 12, no. 2, Sep. 2024, doi: 10.31294/evolusi.v12i2.23149.

M. I. Anugrah, J. Zeniarja, and D. S. Setiawan, “Peningkatan Performa Model Hard Voting Classifier dengan Teknik Oversampling ADASYN pada Penyakit Diabetes,” Edumatic: Jurnal Pendidikan Informatika, vol. 8, no. 1, pp. 290–299, Jun. 2024, doi: 10.29408/edumatic.v8i1.25838.

E. Erlin, Y. Desnelita, N. Nasution, L. Suryati, and F. Zoromi, “Dampak SMOTE terhadap Kinerja Random Forest Classifier berdasarkan Data Tidak seimbang,” MATRIK : Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer, vol. 21, no. 3, pp. 677–690, Jul. 2022, doi: 10.30812/matrik.v21i3.1726.

W.-Y. Loh and Y. Shih, “Split Selection Methods for Classification Trees,” Stat Sin, pp. 815–840, 1999.

S. Tangirala, “Evaluating the Impact of GINI Index and Information Gain on Classification using Decision Tree Classifier Algorithm*,” International Journal of Advanced Computer Science and Applications, vol. 11, no. 2, 2020, doi: 10.14569/IJACSA.2020.0110277.

S. Mahmuda, “Implementasi Metode Random Forest pada Kategori Konten Kanal Youtube,” JURNAL JENDELA MATEMATIKA, vol. 2, no. 01, pp. 21–31, Jan. 2024, doi: 10.57008/jjm.v2i01.633.

E. Sutoyo and M. A. Fadlurrahman, “Penerapan SMOTE untuk Mengatasi Imbalance Class dalam Klasifikasi Television Advertisement Performance Rating Menggunakan Artificial Neural Network,” Jurnal Edukasi dan Penelitian Informatika (JEPIN), vol. 6, no. 3, p. 379, Dec. 2020, doi: 10.26418/jp.v6i3.42896.

A. N. Kasanah, M. Muladi, and U. Pujianto, “Penerapan Teknik SMOTE untuk Mengatasi Imbalance Class dalam Klasifikasi Objektivitas Berita Online Menggunakan Algoritma KNN,” Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi), vol. 3, no. 2, pp. 196–201, Aug. 2019, doi: 10.29207/resti.v3i2.945.

M. S. Pangestu and M. A. Fitriani, “Perbandingan Perhitungan Jarak Euclidean Distance, Manhattan Distance, dan Cosine Similarity dalam Pengelompokan Data Bibit Padi Menggunakan Algoritma K-Means,” Sainteks, vol. 19, no. 2, p. 141, Oct. 2022, doi: 10.30595/sainteks.v19i2.14495.

D. V. Ramadhanti, R. Santoso, and T. Widiharih, “Perbandingan Smote Dan Adasyn Pada Data Imbalance Untuk Klasifikasi Rumah Tangga Miskin Di Kabupaten Temanggung Dengan Algoritma k-Nearest Neighbor,” Jurnal Gaussian, vol. 11, no. 4, pp. 499–505, Feb. 2023, doi: 10.14710/j.gauss.11.4.499-505.

P. Romadloni, B. Adhi Kusuma, and W. Maulana Baihaqi, “Komparasi Metode Pembelajaran Mesin Untuk Implementasi Pengambilan Keputusan Dalam Menentukan Promosi Jabatan Karyawan,” JATI (Jurnal Mahasiswa Teknik Informatika), vol. 6, no. 2, pp. 622–628, Sep. 2022, doi: 10.36040/jati.v6i2.5238.

D. Normawati, “Implementasi Naïve Bayes Classifier Dan Confusion Matrix Pada Analisis Sentimen Berbasis Teks Pada Twitter,” Jurnal Sains Komputer & Informatika (J-SAKTI), pp. 697–711, 2021.

S. Riyanto, I. S. Sitanggang, T. Djatna, and T. D. Atikah, “Comparative Analysis using Various Performance Metrics in Imbalanced Data for Multi-class Text Classification,” International Journal of Advanced Computer Science and Applications, vol. 14, no. 6, 2023, doi: 10.14569/IJACSA.2023.01406116.

Published
2026-01-12
How to Cite
Ramadhana, N. R., Farid, F. M., & Rahkmawati, Y. (2026). Comparison of SMOTE and ADASYN in Optimizing Random Forest Model for Imbalanced Financial Ratio Bankruptcy Prediction. Jurnal Teknoinfo, 20(1), 81-92. https://doi.org/10.33365/teknoinfo.v20i1.1056