Prediksi Penyakit Jantung Berbasis CRISP-DM Menggunakan Algoritma Random Forest
DOI:
https://doi.org/10.61722/jssr.v4i4.11287Keywords:
Heart Disease, Random Forest, CRISP-DM, Machine learning, PredictionAbstract
Heart disease is one of the leading causes of death worldwide and is influenced by various risk factors, such as age, cholesterol levels, blood pressure, diabetes, and unhealthy lifestyles. The increasing prevalence of heart disease highlights the need for a prediction system that can support early detection quickly and accurately. This study aims to develop a heart disease prediction model using the Random Forest algorithm based on the Cross Industry Standard Process for Data mining (CRISP-DM) methodology. The dataset consisted of 1,000 patient records with 15 predictor variables and one target variable (heart_disease). The research stages included Business Understanding, Data Understanding, Data Preparation, Modeling, Evaluation, and Deployment. Model performance was evaluated using Accuracy, Precision, Recall, F1-Score, and Area Under Curve (AUC). The results showed that the Random Forest model achieved an Accuracy of 99%, Precision of 100%, Recall of 97.44%, F1-Score of 98.70%, and an AUC of 1.00. Feature importance analysis revealed that age and cholesterol level were the most influential factors in predicting heart disease. The findings indicate that the CRISP-DM-based Random Forest algorithm can produce an accurate prediction model and has the potential to support early detection of heart disease.
References
R. Diyah Ruswanti, Dahlan Susilo, “Implementasi CRISP-DM pada Data Mining untuk Melakukan Prediksi Pendapatan dengan Algoritma C.45,” GO INFOTECH J. Ilm. STMIK AUB, vol. 30, no. 1, pp. 111–121, 2024, doi: 10.36309/goi.v30i1.266.
F. S. Rakha Gusti Wardhana, Gunawan Wang, “PENERAPAN MACHINE LEARNING DALAM PREDIKSI TINGKAT KASUS PENYAKIT DI INDONESIA,” J. Inf. Syst. Manag., vol. 5, no. 1, pp. 40–45, 2023, doi: https://doi.org/10.24076/joism.2023v5i1.1136.
M. A. Faradeya and E. R. Subhiyakto, “Klasifikasi Penyakit Gagal Jantung Menggunakan Algoritma Naive Bayes,” J. Algoritm., vol. 11, no. 1, pp. 115–127, 2025, doi: 10.33364/algoritma/v.22-1.2178.
D. I. J. Ihsanuddin Akbar, Fidi Supriadi, “PEMANFAATAN MACHINE LEARNING DI BIDANG KESEHATAN,” JATI (Jurnal Mhs. Tek. Inform., vol. 9, no. 1, pp. 1744–1749, 2025, doi: https://doi.org/10.36040/jati.v9i1.12663.
R. Julkarnaen Karo-Karo, Adryan Raihan Syakir, Raihan, Sumanto, Imam Budiawan and A. C. Pakpahan, “Heart Disease Prediction Using Machine Learning Algorithms Based on Health Indicators with Random Forest,” J. Ilm. Sist. Inf., vol. 5, no. 1, pp. 84–95, 2026, doi: https://doi.org/10.51903/1tgz3234.
M. F. R. M. Arya Ardhi Baskara, Nurul Maharani Piranti, “FRAMEWORK DATA MINING : SEBUAH SURVEI,” JATI (Jurnal Mhs. Tek. Inform., vol. 9, no. 3, pp. 4886–4895, 2025, doi: https://doi.org/10.36040/jati.v9i3.13803.
A. S. B. Maulidina Cahaya Rani, Farah Diba Azkia, Revinta Arrova Dewi, Mochamad Wahyudi, Sumanto, “PERBANDINGAN ALGORITMA RANDOM FOREST, NAIVE BAYES, DAN NEURAL NETWORK DALAM KLASIFIKASI PENYAKIT JANTUNG,” J. Sains Inform. Terap., vol. 4, no. 2, pp. 77–84, 2025, doi: https://doi.org/10.62357/jsit.v4i2.609.
A. P. B. K. Muhammad Fauzani Akbar, Ita Damaris Br Munthe, Albert Julian Marito Silaban, Ahmad Dwi Abiyyu, “Literature Reveiew Penerapan Machine Learning Pada Supply Chain Management,” J. Sist. Informasi, Tek. Komput. dan Teknol. Pendidik., vol. 5, no. Ml, pp. 302–310, 2026, doi: https://doi.org/10.55338/justikpen.v5i2.427.
N. D. S. Irfan Sazali Nasution, Arini Dwi Rahmadani, Windi Audina, Dinda Purnama Sari, “Systematic Review: Pengaruh Gaya Hidup dan Pengetahuan Masyarakat terhadap Risiko Penyakit Jantung Koroner,” SEHATRAKYAT (Jurnal Kesehat. Masyarakat), vol. 4, no. 2, pp. 287–298, 2025, doi: 10.54259/sehatrakyat.v4i2.4337.
K. C. J. S. Prasetyo2, “Prediksi Penyakit Jantung Koroner Menggunakan Metode K-NN dan Regresi Logistik Berdasarkan Kerangka Kerja CRISP-DM,” Semin. Nas. Univ. MA CHUNG 2024, vol. 4, no. 2020, pp. 241–248, 2024.
I. Y. Maulana and B. Irawan, “Rancangan Sistem Prediksi Penyakit Jantung Berbasis Framingham Risk Score: Konsistensi Teoritis dan Implementasi Web,” JCBD J. Comput. Digit. Bus., vol. 4, no. 3, pp. 146–151, 2025, doi: https://doi.org/10.56427/jcbd.v4i3.775.
A. Y. Yoga Adi Prasetyo, Ema Utami, “Pengaruh Komposisi Split Data Terhadap Performa Akurasi Analisis Sentimen Algoritma Naïve Bayes dan SVM,” J. Electr. Eng. Comput., vol. 6, no. 2, pp. 382–390, 2024, doi: 10.33650/jeecom.v4i2.
A. S. R. Ergi Putra Febtiawan, Lalu A. Syamsul Irfan Akbar, “Forecasting Produksi Energi Photovoltaic Menggunakan Algoritma Random Forest Classification,” J. Inf. Syst. Res., vol. 5, no. 4, pp. 1053–1062, 2024, doi: 10.47065/josh.v5i4.5514.
N. S. Adinda Dwi Putri, Fitriana Sholekhah, Eric Dadynata, Lusiana Efrizoni, Rahmaddeni, “The Application of C4 . 5 Decision Tree Algorithm for Predicting the Survival Rate of Thyroid Cancer Patients Penerapan Algoritma Decesion Tree C4 . 5 untuk Memprediksi Tingkat Kelangsungan Hidup Pasien Kanker Tiroid,” MALCOM Indones. J. Mach. Learn. Comput. Sci., vol. 4, no. October, pp. 1485–1495, 2024, doi: https://doi.org/10.57152/malcom.v4i4.1532.
M. Muhammad, J. Samodro, E. M. Sakti, and B. M. Kumudasmoro, “Analisis Pengaruh Ketidakseimbangan Data terhadap Kinerja Model Klasifikasi Penyakit Jantung,” J. Softw. Eng. Inf. Syst. e-ISSN, vol. 6, no. 1, pp. 56–62, 2026.
D. B. Saputra et al., “PENERAPAN MODEL CRISP-DM PADA PREDIKSI NASABAH KREDIT MENGGUNAKAN ALGORITMA RANDOM FOREST,” Idealis Indones. J. Inf. Syst., vol. 7, no. 2021, pp. 240–247, 2024.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 JOURNAL SAINS STUDENT RESEARCH

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.











