Fuzzy-driven gradient boosting for interpretable cardiovascular disease risk prediction
Fadhillah Azmi1, Amir Saleh2, Nurul Khairina3, Surya Alpacino Manurung1 and Muhammad Fadhil1
Department of Computer Engineering,Politeknik Negeri Medan, Jl. Almamater No. 1, Padang Bulan, Medan 20155,North Sumatra,Indonesia2
Department of Informatics Engineering,Universitas Medan Area, Jl. Kolam No. 1, Medan Estate, Percut Sei Tuan, Medan 20223,North Sumatra,Indonesia3
Corresponding Author : Fadhillah Azmi
Recieved : 30-June-2025; Revised : 15-February-2026; Accepted : 17-February-2026
Abstract
Cardiovascular disease (CVD) continues to be the foremost cause of mortality globally, highlighting the necessity for predictive models that are both precise and clinically interpretable. However, most machine learning (ML) models operate as “black boxes,” constraining their utility for practical medical decision-making. This paper presents a fuzzy-driven gradient boosting (GB) framework that incorporates fuzzy logic into the ML workflow to enhance interpretability while maintaining predictive performance. Employing the fuzzy c-means (FCM) methodology, triangular membership functions (MFs) are developed to convert five principal clinical variables—age, height, weight, systolic blood pressure (ap_hi), and diastolic blood pressure (ap_lo)—into linguistically interpretable categories. Each data point is transformed into a set of fuzzy membership values derived from the pertinent MFs, signifying its partial association with different linguistic categories. The framework is evaluated using 10-fold cross-validation (CV) on three feature types: raw data, fuzzy-linguistic labels, and fuzzy MF values, with the latter being the core fuzzy representation. The experimental results indicate that the hybrid model achieves the best performance, with an accuracy of 73.37% and an area under the receiver operating characteristic curve (AUROC) of 0.800 on Dataset I, and an accuracy of 98.83% with an AUROC of 0.996 on Dataset II. Fuzzy transformation was shown to significantly improve model transparency and provide clinically meaningful explanations. Overall, this proposed framework offers a promising direction for the development of interpretable, reliable, and high-performance artificial intelligence (AI)-based clinical decision support systems (CDSS) for predicting CVD risk.
Keywords
Cardiovascular disease (CVD), Fuzzy logic, Gradient boosting, Fuzzy c-means (FCM), Clinical decision support systems (CDSS).
Cite this article
Azmi F, Saleh A, Khairina N, Manurung SA, Fadhil M. Fuzzy-driven gradient boosting for interpretable cardiovascular disease risk prediction. International Journal of Advanced Technology and Engineering Exploration. 2026;13(135):225-244. DOI : 10.19101/IJATEE.2025.121220881
