International Journal of Advanced Technology and Engineering Exploration ISSN (Print): 2394-5443    ISSN (Online): 2394-7454 Volume-13 Issue-136 March-2026
  1. 4037
    Citations
  2. 2.7
    CiteScore
Enhancing transparency in breast cancer diagnosis through LIME-driven machine learning models

Zulfikar Ali Ansari1, Md Shamsul Haque Ansari2, Alka Singh3, Naziya Hussain4, Sandeep Keshav3, Shaik Sanheera5 and Anwar Ahamed Shaikh6

Artificial Intelligence and Machine Learning Department,Symbiosis Institute of Technology, Pune Campus, Symbiosis International (Deemed University), Pune,Maharashtra,India1
School of Computer Science,UPES Dehradun, Misraspatti,Uttarakhand,India2
School of Computer Applications,Noida Institute of Engineering and Technology, Greater Noida,Uttar Pradesh,India3
Department of Computer Science and Engineering,SVKM's NMIMS Deemed to be University, Shirpur Campus, Mumbai,Maharashtra,India4
Department of Computer Science and Engineering,Koneru Lakshmaiah Education Foundation, Vaddeswaram,Guntur, 522302,India5
Department of Computer Science and Engineering,School of Engineering and Technolog, Sanjivani University, Kopergaon,Maharashtra,India6
Corresponding Author : Zulfikar Ali Ansari

Recieved : 03-October-2024; Revised : 12-March-2026; Accepted : 24-March-2026

Abstract

Breast cancer is a serious global health concern. There is a critical need to develop diagnostic tools with enhanced predictive capability and interpretability. This study proposes a diagnostic framework based on machine learning (ML) algorithms, integrated with the local interpretable model-agnostic explanations (LIME) technique, to improve the interpretability of model predictions. Six ML models are considered for breast cancer diagnosis: random forest (RF), decision tree (DT), naïve Bayes (NB), extra trees (ET), extreme gradient boosting (XGBoost), and CatBoost (CB). The proposed models are evaluated on two benchmark datasets obtained from the University of California, Irvine (UCI), ML repository. The performance of the models is assessed using metrics such as accuracy, precision, recall, F1-score, Matthews correlation coefficient (MCC), and receiver operating characteristic-area under curve (ROC-AUC). In addition, a fidelity score is computed to compare the agreement between model predictions and their explanations. The results indicate that the proposed ensemble models exhibit strong predictive performance. Among all models, the ET classifier achieves the best results on both datasets, with accuracies of 97.66% on Dataset 1 and 97.62% on Dataset 2. Although the ensemble models demonstrate superior predictive capability, the DT model attains the highest fidelity score, indicating greater interpretability. Overall, the study highlights the importance of balancing predictive performance with interpretability in ML-based diagnostic systems.

Keywords

Breast cancer diagnosis, Machine learning, Explainable AI (XAI), LIME, Ensemble learning, Model interpretability.

Cite this article

Ansari ZA, Ansari MSH, Singh A, Hussain N, Keshav S, Sanheera S, Shaikh AA. Enhancing transparency in breast cancer diagnosis through LIME-driven machine learning models. International Journal of Advanced Technology and Engineering Exploration. 2026;13(136):410-427. DOI : 10.19101/IJATEE.2024.111101806

References

[1] Mahmood T, Saba T, Rehman A, Alamri FS. Harnessing the power of radiomics and deep learning for improved breast cancer diagnosis with multiparametric breast mammography. Expert Systems with Applications. 2024; 249:123747.

[2] Obeagu EI, Obeagu GU. Breast cancer: a review of risk factors and diagnosis. Medicine. 2024; 103(3):1-6.

[3] Oyeniyi J, Oluwaseyi P. Emerging trends in AI-powered medical imaging: enhancing diagnostic accuracy and treatment decisions. International Journal of Enhanced Research in Science Technology & Engineering. 2024;13(4):81-94

[4] Abeelh EA, Abu AZ. Comparative effectiveness of mammography, ultrasound, and MRI in the detection of breast carcinoma in dense breast tissue: a systematic review. Cureus. 2024; 16(4):1-10.

[5] Wang J, Li B, Luo M, Huang J, Zhang K, Zheng S, et al. Progression from ductal carcinoma in situ to invasive breast cancer: molecular features and clinical significance. Signal Transduction and Targeted Therapy. 2024; 9(1):1-28.

[6] Darbandi MR, Darbandi M, Darbandi S, Bado I, Hadizadeh M, Khorshid HR. Artificial intelligence breakthroughs in pioneering early diagnosis and precision treatment of breast cancer: a multimethod study. European Journal of Cancer. 2024; 209:114227.

[7] Botlagunta M, Botlagunta MD, Myneni MB, Lakshmi D, Nayyar A, Gullapalli JS, et al. Classification and diagnostic prediction of breast cancer metastasis on clinical data using machine learning algorithms. Scientific Reports. 2023; 13(1):1-17.

[8] Albahri AS, Duhaim AM, Fadhel MA, Alnoor A, Baqer NS, Alzubaidi L, et al. A systematic review of trustworthy and explainable artificial intelligence in healthcare: assessment of quality, bias risk, and data fusion. Information Fusion. 2023; 96:156-91.

[9] Hager P, Jungmann F, Holland R, Bhagat K, Hubrecht I, Knauer M, et al. Evaluation and mitigation of the limitations of large language models in clinical decision-making. Nature Medicine. 2024; 30(9):2613-22.

[10] Ramkumar G. A robust breast cancer classification model using extra-trees classifier for histopathological image. In international conference on advances in computing, communication and applied informatics (ACCAI) 2023 (pp. 1-7). IEEE.

[11] Ahmed U, Mahmood A, Tunio MA, Hafeez G, Khan AR, Razzaq S. Investigating boosting techniques’ efficacy in feature selection: a comparative analysis. Energy Reports. 2024; 11:3521-32.

[12] Lemons K. A comparison between naïve bayes and random forest to predict breast cancer. International Journal of Undergraduate Research and Creative Activities. 2023; 12(1):1-8.

[13] Yaqoob A, Verma NK, Aziz RM, Shah MA. Optimizing cancer classification: a hybrid RDO-XGBoost approach for feature selection and predictive insights. Cancer Immunology, Immunotherapy. 2024; 73(12):1-14.

[14] Ali AZ, Madhava TM, Ahmed R. Quantifying breast cancer: radiomics, machine learning, and dimensionality reduction for enhanced image-based diagnosis. International Journal of Computing and Digital Systems. 2024; 16(1):1535-52.

[15] Kallah-dagadu G, Mohammed M, Nasejje JB, Mchunu NN, Twabi HS, Batidzirai JM, et al. Breast cancer prediction based on gene expression data using interpretable machine learning techniques. Scientific Reports. 2025; 15(1):1-19.

[16] Abdullakutty F, Akbari Y, Al-maadeed S, Bouridane A, Talaat IM, Hamoudi R. Histopathology in focus: a review on explainable multi-modal approaches for breast cancer diagnosis. Frontiers in Medicine. 2024; 11:1-23.

[17] Swiderski B, Gielata L, Olszewski P, Osowski S, Kołodziej M. Deep neural system for supporting tumor recognition of mammograms using modified GAN. Expert Systems with Applications. 2021; 164:113968.

[18] Rajbahadur GK, Wang S, Oliva GA, Kamei Y, Hassan AE. The impact of feature importance methods on the interpretation of defect classifiers. IEEE Transactions on Software Engineering. 2021; 48(7):2245-61.

[19] Afify HM, Mohammed KK, Hassanien AE. Novel prediction model on OSCC histopathological images via deep transfer learning combined with grad-CAM interpretation. Biomedical Signal Processing and Control. 2023; 83:104704.

[20] Khater T, Hussain A, Bendardaf R, Talaat IM, Tawfik H, Ansari S, et al. An explainable artificial intelligence model for the classification of breast cancer. IEEE Access. 2023; 13:5618-33.

[21] Jaiswal V, Saurabh P, Lilhore UK, Pathak M, Simaiya S, Dalal S. A breast cancer risk predication and classification model with ensemble learning and big data fusion. Decision Analytics Journal. 2023; 8:1-13.

[22] Iqbal S, Qureshi AN, Aurangzeb K, Alhussein M, Haider SI, Rida I. AMIAC: adaptive medical image analyzes and classification, a robust self-learning framework. Neural Computing and Applications. 2025; 37(25):20451-79.

[23] Carriero A, Groenhoff L, Vologina E, Basile P, Albera M. Deep learning in breast cancer imaging: state of the art and recent advancements in early. Diagnostics. 2024; 14(8):1-34.

[24] Nasser M, Yusof UK. Deep learning based methods for breast cancer diagnosis: a systematic review and future direction. Diagnostics. 2023; 13(1):1-26.

[25] Hoque R, Das S, Hoque M, Haque E. Breast cancer classification using XGBoost. World Journal of Advanced Research and Reviews. 2024; 21(2):1985-94.

[26] Klauschen F, Dippel J, Keyl P, Jurmeister P, Bockmayr M, Mock A, et al. Toward explainable artificial intelligence for precision pathology. Annual Review of Pathology: Mechanisms of Disease. 2024; 19(1):541-70.

[27] Raptis S, Ilioudis C, Theodorou K. From pixels to prognosis: unveiling radiomics models with SHAP and LIME for enhanced interpretability. Biomedical Physics & Engineering Express. 2024; 10(3):1-10.

[28] Hasan AH, Khalid UA, Abid MA, Khalid UA. Leveraging artificial intelligence in breast cancer screening and diagnosis. Cureus. 2025; 17(2):1-6.

[29] Martínez-ramírez JM, Carmona C, Ramírez-expósito MJ, Martínez-martos JM. Extracting knowledge from machine learning models to diagnose breast cancer. Life. 2025; 15(2):1-29.

[30] Asfaw BB, Tegaw EM. Explainable machine learning to compare the overall survival status between patients receiving mastectomy and breast conserving surgeries. Scientific Reports. 2025; 15(1):1-14.

[31] Dou J, Yunus AP, Bui DT, Merghadi A, Sahana M, Zhu Z, et al. Assessment of advanced random forest and decision tree algorithms for modeling rainfall-induced landslide susceptibility in the Izu-oshima volcanic Island, Japan. Science of the Total Environment. 2019; 662:332-46.

[32] Salvador J, Perez-pellitero E. Naive bayes super-resolution forest. In proceedings of the international conference on computer vision 2015 (pp. 325-33). IEEE.

[33] Joshi A, Saggar P, Jain R, Sharma M, Gupta D, Khanna A. CatBoost—an ensemble machine learning model for prediction and classification of student academic performance. Advances in Data Science and Adaptive Analysis. 2021; 13(03n04):2141002.

[34] Hussain SM, Buongiorno D, Altini N, Berloco F, Prencipe B, Moschetta M, et al. Shape-based breast lesion classification using digital tomosynthesis images: the role of explainable artificial intelligence. Applied Sciences. 2022; 12(12):1-27.

[35] Elsadig MA, Altigani A, Elshoush HT. Breast cancer detection using machine learning approaches: a comparative study. International Journal of Electrical & Computer Engineering. 2023; 13(1):736-45.

[36] Maouche I, Terrissa LS, Benmohammed K, Zerhouni N. An explainable AI approach for breast cancer metastasis prediction based on clinicopathological data. IEEE Transactions on Biomedical Engineering. 2023; 70(12):3321-9.

[37] Islam T, Sheakh MA, Tahosin MS, Hena MH, Akash S, Bin JYA, et al. Predictive modeling for breast cancer classification in the context of Bangladeshi patients by use of machine learning approach with explainable AI. Scientific Reports. 2024; 14(1):1-17.

[38] Raghavan K, B S, V K. Attention guided grad-CAM: an improved explainable artificial intelligence model for infrared breast cancer detection. Multimedia Tools and Applications. 2024; 83(19):57551-78.

[39] Dutta M, Hasan KM, Akter A, Rahman MH, Assaduzzaman M. An interpretable machine learning-based breast cancer classification using XGBoost, SHAP, and LIME. Bulletin of Electrical Engineering and Informatics. 2024; 13(6):4306-15.