ACCENTS Journals

Download PDF

Enhancing transparency in breast cancer diagnosis through LIME-driven machine learning models

Zulfikar Ali Ansari¹, Md Shamsul Haque Ansari², Alka Singh³, Naziya Hussain⁴, Sandeep Keshav³, Shaik Sanheera⁵ and Anwar Ahamed Shaikh⁶

Artificial Intelligence and Machine Learning Department,Symbiosis Institute of Technology, Pune Campus, Symbiosis International (Deemed University), Pune,Maharashtra,India¹
School of Computer Science,UPES Dehradun, Misraspatti,Uttarakhand,India²
School of Computer Applications,Noida Institute of Engineering and Technology, Greater Noida,Uttar Pradesh,India³
Department of Computer Science and Engineering,SVKM's NMIMS Deemed to be University, Shirpur Campus, Mumbai,Maharashtra,India⁴
Department of Computer Science and Engineering,Koneru Lakshmaiah Education Foundation, Vaddeswaram,Guntur, 522302,India⁵
Department of Computer Science and Engineering,School of Engineering and Technolog, Sanjivani University, Kopergaon,Maharashtra,India⁶

Corresponding Author : Zulfikar Ali Ansari

Recieved : 03-October-2024; Revised : 12-March-2026; Accepted : 24-March-2026

Abstract

Breast cancer is a serious global health concern. There is a critical need to develop diagnostic tools with enhanced predictive capability and interpretability. This study proposes a diagnostic framework based on machine learning (ML) algorithms, integrated with the local interpretable model-agnostic explanations (LIME) technique, to improve the interpretability of model predictions. Six ML models are considered for breast cancer diagnosis: random forest (RF), decision tree (DT), naïve Bayes (NB), extra trees (ET), extreme gradient boosting (XGBoost), and CatBoost (CB). The proposed models are evaluated on two benchmark datasets obtained from the University of California, Irvine (UCI), ML repository. The performance of the models is assessed using metrics such as accuracy, precision, recall, F1-score, Matthews correlation coefficient (MCC), and receiver operating characteristic-area under curve (ROC-AUC). In addition, a fidelity score is computed to compare the agreement between model predictions and their explanations. The results indicate that the proposed ensemble models exhibit strong predictive performance. Among all models, the ET classifier achieves the best results on both datasets, with accuracies of 97.66% on Dataset 1 and 97.62% on Dataset 2. Although the ensemble models demonstrate superior predictive capability, the DT model attains the highest fidelity score, indicating greater interpretability. Overall, the study highlights the importance of balancing predictive performance with interpretability in ML-based diagnostic systems.

Keywords

Breast cancer diagnosis, Machine learning, Explainable AI (XAI), LIME, Ensemble learning, Model interpretability.

Cite this article

Ansari ZA, Ansari MSH, Singh A, Hussain N, Keshav S, Sanheera S, Shaikh AA. Enhancing transparency in breast cancer diagnosis through LIME-driven machine learning models. International Journal of Advanced Technology and Engineering Exploration. 2026;13(136):410-427. DOI : 10.19101/IJATEE.2024.111101806

References

[1] Mahmood T, Saba T, Rehman A, Alamri FS. Harnessing the power of radiomics and deep learning for improved breast cancer diagnosis with multiparametric breast mammography. Expert Systems with Applications. 2024; 249:123747.

[Crossref] [Google Scholar]

[2] Obeagu EI, Obeagu GU. Breast cancer: a review of risk factors and diagnosis. Medicine. 2024; 103(3):1-6.

[Crossref] [Google Scholar]

[3] Oyeniyi J, Oluwaseyi P. Emerging trends in AI-powered medical imaging: enhancing diagnostic accuracy and treatment decisions. International Journal of Enhanced Research in Science Technology & Engineering. 2024;13(4):81-94

[Google Scholar]

[4] Abeelh EA, Abu AZ. Comparative effectiveness of mammography, ultrasound, and MRI in the detection of breast carcinoma in dense breast tissue: a systematic review. Cureus. 2024; 16(4):1-10.

[Crossref] [Google Scholar]

[5] Wang J, Li B, Luo M, Huang J, Zhang K, Zheng S, et al. Progression from ductal carcinoma in situ to invasive breast cancer: molecular features and clinical significance. Signal Transduction and Targeted Therapy. 2024; 9(1):1-28.

[Crossref] [Google Scholar]

[6] Darbandi MR, Darbandi M, Darbandi S, Bado I, Hadizadeh M, Khorshid HR. Artificial intelligence breakthroughs in pioneering early diagnosis and precision treatment of breast cancer: a multimethod study. European Journal of Cancer. 2024; 209:114227.

[Crossref] [Google Scholar]

[7] Botlagunta M, Botlagunta MD, Myneni MB, Lakshmi D, Nayyar A, Gullapalli JS, et al. Classification and diagnostic prediction of breast cancer metastasis on clinical data using machine learning algorithms. Scientific Reports. 2023; 13(1):1-17.

[Crossref] [Google Scholar]

[8] Albahri AS, Duhaim AM, Fadhel MA, Alnoor A, Baqer NS, Alzubaidi L, et al. A systematic review of trustworthy and explainable artificial intelligence in healthcare: assessment of quality, bias risk, and data fusion. Information Fusion. 2023; 96:156-91.

[Crossref] [Google Scholar]

[9] Hager P, Jungmann F, Holland R, Bhagat K, Hubrecht I, Knauer M, et al. Evaluation and mitigation of the limitations of large language models in clinical decision-making. Nature Medicine. 2024; 30(9):2613-22.

[Crossref] [Google Scholar]

[10] Ramkumar G. A robust breast cancer classification model using extra-trees classifier for histopathological image. In international conference on advances in computing, communication and applied informatics (ACCAI) 2023 (pp. 1-7). IEEE.

[Crossref] [Google Scholar]

[11] Ahmed U, Mahmood A, Tunio MA, Hafeez G, Khan AR, Razzaq S. Investigating boosting techniques’ efficacy in feature selection: a comparative analysis. Energy Reports. 2024; 11:3521-32.

[Crossref] [Google Scholar]

[12] Lemons K. A comparison between naïve bayes and random forest to predict breast cancer. International Journal of Undergraduate Research and Creative Activities. 2023; 12(1):1-8.

[Crossref] [Google Scholar]

[13] Yaqoob A, Verma NK, Aziz RM, Shah MA. Optimizing cancer classification: a hybrid RDO-XGBoost approach for feature selection and predictive insights. Cancer Immunology, Immunotherapy. 2024; 73(12):1-14.

[Crossref] [Google Scholar]

[14] Ali AZ, Madhava TM, Ahmed R. Quantifying breast cancer: radiomics, machine learning, and dimensionality reduction for enhanced image-based diagnosis. International Journal of Computing and Digital Systems. 2024; 16(1):1535-52.

[Crossref] [Google Scholar]

[15] Kallah-dagadu G, Mohammed M, Nasejje JB, Mchunu NN, Twabi HS, Batidzirai JM, et al. Breast cancer prediction based on gene expression data using interpretable machine learning techniques. Scientific Reports. 2025; 15(1):1-19.

[Crossref] [Google Scholar]

[16] Abdullakutty F, Akbari Y, Al-maadeed S, Bouridane A, Talaat IM, Hamoudi R. Histopathology in focus: a review on explainable multi-modal approaches for breast cancer diagnosis. Frontiers in Medicine. 2024; 11:1-23.

[Crossref] [Google Scholar]

[17] Swiderski B, Gielata L, Olszewski P, Osowski S, Kołodziej M. Deep neural system for supporting tumor recognition of mammograms using modified GAN. Expert Systems with Applications. 2021; 164:113968.

[Crossref] [Google Scholar]

[18] Rajbahadur GK, Wang S, Oliva GA, Kamei Y, Hassan AE. The impact of feature importance methods on the interpretation of defect classifiers. IEEE Transactions on Software Engineering. 2021; 48(7):2245-61.

[Crossref] [Google Scholar]

[19] Afify HM, Mohammed KK, Hassanien AE. Novel prediction model on OSCC histopathological images via deep transfer learning combined with grad-CAM interpretation. Biomedical Signal Processing and Control. 2023; 83:104704.

[Crossref] [Google Scholar]

[20] Khater T, Hussain A, Bendardaf R, Talaat IM, Tawfik H, Ansari S, et al. An explainable artificial intelligence model for the classification of breast cancer. IEEE Access. 2023; 13:5618-33.

[Crossref] [Google Scholar]

[21] Jaiswal V, Saurabh P, Lilhore UK, Pathak M, Simaiya S, Dalal S. A breast cancer risk predication and classification model with ensemble learning and big data fusion. Decision Analytics Journal. 2023; 8:1-13.

[Crossref] [Google Scholar]

[22] Iqbal S, Qureshi AN, Aurangzeb K, Alhussein M, Haider SI, Rida I. AMIAC: adaptive medical image analyzes and classification, a robust self-learning framework. Neural Computing and Applications. 2025; 37(25):20451-79.

[Crossref] [Google Scholar]

[23] Carriero A, Groenhoff L, Vologina E, Basile P, Albera M. Deep learning in breast cancer imaging: state of the art and recent advancements in early. Diagnostics. 2024; 14(8):1-34.

[Crossref] [Google Scholar]

[24] Nasser M, Yusof UK. Deep learning based methods for breast cancer diagnosis: a systematic review and future direction. Diagnostics. 2023; 13(1):1-26.

[Crossref] [Google Scholar]

[25] Hoque R, Das S, Hoque M, Haque E. Breast cancer classification using XGBoost. World Journal of Advanced Research and Reviews. 2024; 21(2):1985-94.

[Crossref] [Google Scholar]

[26] Klauschen F, Dippel J, Keyl P, Jurmeister P, Bockmayr M, Mock A, et al. Toward explainable artificial intelligence for precision pathology. Annual Review of Pathology: Mechanisms of Disease. 2024; 19(1):541-70.

[Crossref] [Google Scholar]

[27] Raptis S, Ilioudis C, Theodorou K. From pixels to prognosis: unveiling radiomics models with SHAP and LIME for enhanced interpretability. Biomedical Physics & Engineering Express. 2024; 10(3):1-10.

[Crossref] [Google Scholar]

[28] Hasan AH, Khalid UA, Abid MA, Khalid UA. Leveraging artificial intelligence in breast cancer screening and diagnosis. Cureus. 2025; 17(2):1-6.

[Crossref] [Google Scholar]

[29] Martínez-ramírez JM, Carmona C, Ramírez-expósito MJ, Martínez-martos JM. Extracting knowledge from machine learning models to diagnose breast cancer. Life. 2025; 15(2):1-29.

[Crossref] [Google Scholar]

[30] Asfaw BB, Tegaw EM. Explainable machine learning to compare the overall survival status between patients receiving mastectomy and breast conserving surgeries. Scientific Reports. 2025; 15(1):1-14.

[Crossref] [Google Scholar]

[31] Dou J, Yunus AP, Bui DT, Merghadi A, Sahana M, Zhu Z, et al. Assessment of advanced random forest and decision tree algorithms for modeling rainfall-induced landslide susceptibility in the Izu-oshima volcanic Island, Japan. Science of the Total Environment. 2019; 662:332-46.

[Crossref] [Google Scholar]

[32] Salvador J, Perez-pellitero E. Naive bayes super-resolution forest. In proceedings of the international conference on computer vision 2015 (pp. 325-33). IEEE.

[Google Scholar]

[33] Joshi A, Saggar P, Jain R, Sharma M, Gupta D, Khanna A. CatBoost—an ensemble machine learning model for prediction and classification of student academic performance. Advances in Data Science and Adaptive Analysis. 2021; 13(03n04):2141002.

[Crossref] [Google Scholar]

[34] Hussain SM, Buongiorno D, Altini N, Berloco F, Prencipe B, Moschetta M, et al. Shape-based breast lesion classification using digital tomosynthesis images: the role of explainable artificial intelligence. Applied Sciences. 2022; 12(12):1-27.

[Crossref] [Google Scholar]

[35] Elsadig MA, Altigani A, Elshoush HT. Breast cancer detection using machine learning approaches: a comparative study. International Journal of Electrical & Computer Engineering. 2023; 13(1):736-45.

[Crossref] [Google Scholar]

[36] Maouche I, Terrissa LS, Benmohammed K, Zerhouni N. An explainable AI approach for breast cancer metastasis prediction based on clinicopathological data. IEEE Transactions on Biomedical Engineering. 2023; 70(12):3321-9.

[Crossref] [Google Scholar]

[37] Islam T, Sheakh MA, Tahosin MS, Hena MH, Akash S, Bin JYA, et al. Predictive modeling for breast cancer classification in the context of Bangladeshi patients by use of machine learning approach with explainable AI. Scientific Reports. 2024; 14(1):1-17.

[Crossref] [Google Scholar]

[38] Raghavan K, B S, V K. Attention guided grad-CAM: an improved explainable artificial intelligence model for infrared breast cancer detection. Multimedia Tools and Applications. 2024; 83(19):57551-78.

[Crossref] [Google Scholar]

[39] Dutta M, Hasan KM, Akter A, Rahman MH, Assaduzzaman M. An interpretable machine learning-based breast cancer classification using XGBoost, SHAP, and LIME. Bulletin of Electrical Engineering and Informatics. 2024; 13(6):4306-15.

[Crossref] [Google Scholar]