IMPLEMENTASI ALGORITMA DECISION TREE C4.5 DENGAN IMPROVISASI MEAN DAN MEDIAN PADA DATASET NUMERIK

  • Neni Febiani Universitas Nahdlatul Ulama Blitar
  • Abd. Charis Fauzan Universitas Nahdlatul Ulama Blitar
  • Muhamat Maariful Huda Universitas Nahdlatul Ulama Blitar

Abstract

The decision tree is a method of classifying data mining. The decision tree has one type of algorithm model, namely the C4.5 algorithm. The C4.5 decision tree algorithm is easy to understand because it has a tree-like structure in general. The C4.5 algorithm in handling quantitative data is often less efficient and effective. Based on these problems, this study improvised the numerical attribute dataset using the mean and median in the preprocessing of the data. The improvisation is used to obtain a threshold value, thereby minimizing information loss and time complexity when implementing the C4.5 decision tree in predicting training data. Evaluation of the system used in this study using a confusion matrix. The confusion matrix is ​​used as a benchmark in testing the classification method using data testing. In this study, the dataset was partitioned into three scenarios. In scenario 1 with 70% training data and 20% test data, the highest accuracy is 75%. The improvisation of the mean and median on the numerical attributes in the C4.5 algorithm can be used in this scenario.

References

[1] I. C. Wibowo, A. C. Fauzan, M. Dwi, P. Yustiana, and F. A. Qhabib, “Komparasi Algoritma Naive Bayes dan Decision Tree Untuk Memprediksi Lama Studi Mahasiswa,” vol. 1, no. 2, pp. 65–74, 2019.

[2] N. Azwanti, E. Elisa, U. P. Batam, and J. R. S. Kuning, “InfoTekJar : Jurnal Nasional Informatika dan Teknologi Jaringan Analisis Pola Penyakit Hipertensi Menggunakan Algoritma C4 . 5,” vol. 2, 2019.

[3] I. Massulloh and Fitriyani, “Implementasi Algoritma C4.5 Untuk Klasifikasi Anak Berkebutuhan Khusus Di Ibnu Sina Stimulasi Center,” eProsiding Sist. Inf., vol. 1, no. 1, pp. 136–144, 2020.

[4] A. S. Budiman and X. A. Parandani, “Uji Akurasi Klasifikasi Dan Validasi Data Pada Penggunaan Metode Membership Function Dan Algoritma C4.5 Dalam Penilaian Penerima Beasiswa,” Simetris J. Tek. Mesin, Elektro dan Ilmu Komput., vol. 9, no. 1, pp. 565–578, 2018, doi: 10.24176/simet.v9i1.2021.

[5] N. Cahyani and M. A. Muslim, “Increasing Accuracy of C4 . 5 Algorithm by Applying Discretization and Correlation-based Feature Selection for Chronic Kidney Disease Diagnosis,” vol. 12, no. 1, pp. 25–32, 2020.

[6] A. Ferchichi, K. Nouira, and A. Cherfi, “MC4.5 decision tree algorithm: an improved use of continuous attributes,” Int. J. Comput. Intell. Stud., vol. 9, no. 1/2, p. 4, 2020, doi: 10.1504/ijcistudies.2020.10028137.

[7] M. A. Muslim, S. H. Rukmana, E. Sugiharti, B. Prasetiyo, and S. Alimah, “Optimization of C4.5 algorithm-based particle swarm optimization for breast cancer diagnosis,” J. Phys. Conf. Ser., vol. 983, no. 1, pp. 0–7, 2018, doi: 10.1088/1742-6596/983/1/012063.

[8] A. Djebbar, H. Djellali, H. F. Marouani, and C. Base, “A new modified C4 . 5 Algorithm for improving case retrieval 4 PROPOSED ARCHITECTURE 2 PROBLEM AND OBJECTIVE 3 CASE BASED REASONING,” no. June, pp. 4–5, 2019.

[9] Elin Nurlia and U. Enri, “Penerapan Fitur Seleksi Forward Selection Untuk Menentukan Kematian Akibat Gagal Jantung Menggunakan Algoritma C4.5,” vol. 6, no. 1, pp. 42–50, 2021.

[10] D. Derisma, “Perbandingan Kinerja Algoritma untuk Prediksi Penyakit Jantung dengan Teknik Data Mining,” J. Appl. Informatics Comput., vol. 4, no. 1, pp. 84–88, 2020, doi: 10.30871/jaic.v4i1.2152.

[11] J. J. Pengaribuan, C. Tedja, and W. Sentosa, “Perbandingan Metode Algoritma C4.5 dan Extreme Learning Machine Untuk Mendiagnosis Penyakit Jantung Koroner,” Informatics Eng. Res. Technol., vol. 1, no. 1, pp. 1–7, 2019.

[12] S. R. J. I. Alham, “Sistem Diagnosis Penyakit Jantung Koroner Dengan Menggunakan Algoritma C4.5 Berbasis Website (Studi Kasus: RSUD Dr. Soedarso Pontianak),” Petir, vol. 14, no. 2, pp. 214–222, 2021, doi: 10.33322/petir.v14i2.1338.

[13] Fedesoriano, “Dataset Prediksi Gagal Jantung,” September 2021. https://www.kaggle.com/fedesoriano/heart-failure-prediction (accessed Dec. 14, 2021).

[14] Suyanto, Data Mining Untuk Klasifikasi dan Klasterisasi Data, Revisi. Bandung: Informatika Bandung, 2019.

[15] I. Yulianti, R. A. Saputra, M. S. Mardiyanto, and A. Rahmawati, “Optimasi Akurasi Algoritma C4.5 Berbasis Particle Swarm Optimization dengan Teknik Bagging pada Prediksi Penyakit Ginjal Kronis,” Techno.Com, vol. 19, no. 4, pp. 411–421, 2020, doi: 10.33633/tc.v19i4.3579.

[16] N. A. Sinaga, A. T. Purba, K. Akuntansi, P. B. Indonesia, T. Komputer, and P. B. Indonesia, “Penerapan algoritma c.45 untuk memprediksi tingkat kepuasan mahasiswa terhadap politeknik bisnis indonesia,” vol. 4, pp. 245–254, 2021.

[17] B. Santosa and A. Umam, Data Mining dan Big Data Analytics, 2nd ed. Yogyakarta: Penebar Media Pustaka, 2018.

[18] H. D. Fahma and A. C. Fauzan, “Prediksi Keberlangsungan Studi Mahasiswa Fakultas Ilmu Pendidikan dan Sosial Universitas Nahdlatul Ulama Blitar,” vol. 1, no. 2, pp. 110–119, 2021.

[19] H. H. Patel and P. Prajapati, “Study and Analysis of Decision Tree Based Classification Algorithms,” Int. J. Comput. Sci. Eng., vol. 6, no. 10, pp. 74–78, 2018, doi: 10.26438/ijcse/v6i10.7478.

[20] M. Hasnain, M. F. Pasha, I. Ghani, M. Imran, M. Y. Alzahrani, and R. Budiarto, “Evaluating Trust Prediction and Confusion Matrix Measures for Web Services Ranking,” IEEE Access, vol. 8, pp. 90847–90861, 2020, doi: 10.1109/ACCESS.2020.2994222.

[21] V. S. Ginting, Kusrini, and E. Taufiq, “Implementasi algoritma c4.5 untuk memprediksi keterlambatan pembayaran sumbangan pembangunan pendidikan sekolah menggunakan python,” vol. 10, pp. 36–44, 2020.
Published
2022-06-30
How to Cite
FEBIANI, Neni; FAUZAN, Abd. Charis; HUDA, Muhamat Maariful. IMPLEMENTASI ALGORITMA DECISION TREE C4.5 DENGAN IMPROVISASI MEAN DAN MEDIAN PADA DATASET NUMERIK. Jurnal Tekinkom (Teknik Informasi dan Komputer), [S.l.], v. 5, n. 1, p. 105-114, june 2022. ISSN 2621-3079. Available at: <https://jurnal.murnisadar.ac.id/index.php/Tekinkom/article/view/435>. Date accessed: 18 apr. 2024. doi: https://doi.org/10.37600/tekinkom.v5i1.435.
Section
Articles