(Publisher of Peer Reviewed Open Access Journals)

International Journal of Advanced Computer Research (IJACR)

ISSN (Print):2249-7277    ISSN (Online):2277-7970
Volume-14 Issue-66 March-2024
Full-Text PDF
Paper Title : A hybrid random forest and k-nearest neighbors approach for breast cancer detection
Author Name : Om Prakash Kumar and Animesh Kumar Dubey
Abstract :

In this paper, a novel hybrid approach was presented combining random forest (RF) and k-nearest neighbors (kNN) for the classification of breast cancer data. RF is selected for its robustness against overfitting and its ability to handle high-dimensional data effectively, providing a measure of feature importance and generalizing well due to its ensemble nature. kNN is chosen for its simplicity and effectiveness in capturing local data patterns. Our hybrid RF-kNN approach involves feature importance weighting in kNN, dynamic k selection, polynomial feature expansion, and ensemble output combination. The Wisconsin breast cancer database (WBCD) is used for experimentation, evaluated using 10-fold cross-validation. Performance metrics include accuracy, precision, recall, and F1-score. The results demonstrate that the hybrid RF-kNN model outperforms individual models, achieving superior performance across all metrics and data splits. This highlights the robustness and effectiveness of the hybrid model in reducing false positives and correctly identifying patients with breast cancer, making it a reliable model for breast cancer detection.

Keywords : Breast cancer detection, Random forest, k-nearest neighbors, Hybrid model, Machine learning.
Cite this article : Kumar OP, Dubey AK. A hybrid random forest and k-nearest neighbors approach for breast cancer detection . International Journal of Advanced Computer Research. 2024; 14(66):42-49. DOI:10.19101/IJACR.2024.1466003.
References :
[1]Ng AY, Oberije CJ, Ambrózay É, Szabó E, Serfőző O, Karpati E, et al. Prospective implementation of AI-assisted screen reading to improve early detection of breast cancer. Nature Medicine. 2023; 29(12):3044-9.
[Crossref] [Google Scholar]
[2]Stergiopoulou D, Markou A, Strati A, Zavridou M, Tzanikou E, Mastoraki S, et al. Comprehensive liquid biopsy analysis as a tool for the early detection of minimal residual disease in breast cancer. Scientific Reports. 2023; 13(1):1258.
[Crossref] [Google Scholar]
[3]Singh L, Alam A. An efficient hybrid methodology for an early detection of breast cancer in digital mammograms. Journal of Ambient Intelligence and Humanized Computing. 2024; 15(1):337-60.
[Crossref] [Google Scholar]
[4]Abhisheka B, Biswas SK, Purkayastha B. A comprehensive review on breast cancer detection, classification and segmentation using deep learning. Archives of Computational Methods in Engineering. 2023; 30(8):5023-52.
[Crossref] [Google Scholar]
[5]Gago A, Aguirre JM, Wong L. Machine learning system for the effective diagnosis and survival prediction of breast cancer patients. International Journal of Online & Biomedical Engineering. 2024; 20(2): 95-113.
[Crossref] [Google Scholar]
[6]Hekal AA, Moustafa HE, Elnakib A. Ensemble deep learning system for early breast cancer detection. Evolutionary Intelligence. 2023; 6(3):1045-54.
[Crossref] [Google Scholar]
[7]Khalid A, Mehmood A, Alabrah A, Alkhamees BF, Amin F, Alsalman H, et al. Breast cancer detection and prevention using machine learning. Diagnostics. 2023; 13(19):1-21.
[Crossref] [Google Scholar]
[8]Dubey AK, Gupta U, Jain S. Analysis of k-means clustering approach on the breast cancer wisconsin dataset. International journal of computer assisted radiology and surgery. 2016; 11:2033-47.
[Crossref] [Google Scholar]
[9]Abunasser BS, Al-hiealy MR, Zaqout IS, Abu-naser SS. Convolution neural network for breast cancer detection and classification using deep learning. Asian Pacific Journal of Cancer Prevention. 2023; 24(2):531-44.
[Crossref] [Google Scholar]
[10]Prodan M, Paraschiv E, Stanciu A. Applying deep learning methods for mammography analysis and breast cancer detection. Applied Sciences. 2023; 13(7):1-18.
[Crossref] [Google Scholar]
[11]Rachna, Choudhary C, Thakur J. A robust machine learning model for breast cancer prediction. Optimized Predictive Models in Healthcare Using Machine Learning. 2024:117-34.
[Crossref] [Google Scholar]
[12]Kawina I, Amarendra K, Marapelli B. Deep learning and machine learning approach to breast cancer classification with random search hyperparameter tuning. International Journal of Intelligent Systems and Applications in Engineering. 2024; 12(16s):264-75.
[Google Scholar]
[13]Saroğlu HE, Shayea I, Saoud B, Azmi MH, El-saleh AA, Saad SA, et al. Machine learning, IoT and 5g technologies for breast cancer studies: a review. Alexandria Engineering Journal. 2024; 89:210-23.
[Crossref] [Google Scholar]
[14]Dianati-nasab M, Salimifard K, Mohammadi R, Saadatmand S, Fararouei M, Hosseini KS, et al. Machine learning algorithms to uncover risk factors of breast cancer: insights from a large case-control study. Frontiers in Oncology. 2024; 13:1276232.
[Crossref] [Google Scholar]
[15]Lomboy KE, Hernandez RM. A comparative performance of breast cancer classification using hyper-parameterized machine learning models. International Journal of Advanced Technology and Engineering Exploration. 2021; 8(82):1080-101.
[Crossref] [Google Scholar]
[16]Neelima G, Kanchanamala P, Misra A, Nugraha RA. Detection of breast cancer based on fuzzy logic. In international conference on advancement in data science, e-learning and information system 2023 (pp. 1-6). IEEE.
[Crossref] [Google Scholar]
[17]Jain R, Kukreja V, Chattopadhyay S, Verma A, Sharma R. Radial basis function integrated with support vector machine model for breast cancer detection. In 2nd international conference on artificial intelligence and machine learning applications theme: healthcare and internet of things 2024 (pp. 1-5). IEEE.
[Crossref] [Google Scholar]
[18]Khan RH, Miah J, Rahman MM, Tayaba M. A comparative study of machine learning algorithms for detecting breast cancer. In 13th annual computing and communication workshop and conference 2023 (pp. 647-52). IEEE.
[Crossref] [Google Scholar]
[19]Sun F, Yang X. Advancing breast cancer diagnosis: a comprehensive study of machine learning algorithms on histological tumor characteristics. In international conference on data science & informatics 2023 (pp. 302-6). IEEE.
[Crossref] [Google Scholar]
[20]Sawant A, Patil D, Khuman D, Pingle Y, Shinde V. Enhancing breast cancer detection: a machine learning approach for early diagnosis and classification. In 11th international conference on computing for sustainable global development 2024 (pp. 235-9). IEEE.
[Crossref] [Google Scholar]
[21]Vasista VL, Sona K, Pedarla J, Sahithi B, Rao TK, Prakash KB. Predicting breast cancer using classical machine learning and deep learning algorithms. In international conference on intelligent and innovative technologies in computing, electrical and electronics 2023 (pp. 988-91). IEEE.
[Crossref] [Google Scholar]
[22]Tinao MM, Rodriguez RB, Calibara ER. Breast cancer detection in the Philippines using machine learning approaches. In international conference on electronics, information, and communication 2024 (pp. 1-4). IEEE.
[Crossref] [Google Scholar]
[23]Kumar R, Chaudhry M, Patel HK, Prakash N, Dogra A, Kumar S. An analysis of ensemble machine learning algorithms for breast cancer detection: performance and generalization. In 11th international conference on computing for sustainable global development 2024 (pp. 366-70). IEEE.
[Crossref] [Google Scholar]
[24]Tripathi RP, Khatri SK, Van GD, Ather D. Unleashing the power of machine learning: a precision paradigm for breast cancer subtype classification using open-source data, with caution on dataset size and interpretability. In 6th international conference on contemporary computing and informatics 2023 (pp. 1004-8). IEEE.
[Crossref] [Google Scholar]
[25]Latha DU, Mahesh TR. Analysis of deep learning and machine learning methods for breast cancer detection. In international conference on computer science and emerging technologies 2023 (pp. 1-6). IEEE.
[Crossref] [Google Scholar]
[26]Manjunathan N, Gomathi N, Muthulingam S. Early detection of breast cancer using machine learning. In international conference on sustainable computing and smart systems 2023 (pp. 165-9). IEEE.
[Crossref] [Google Scholar]
[27]https://archive.ics.uci.edu/dataset/17/breast+cancer+wisconsin+diagnostic. Accessed 27 October 2023.