Enhanced YOLOv5 for superior object detection: a comprehensive study
Dhrgam AL Kafaf1 and Noor Thamir2
Ministry of Education,General Directorate of Education of Rusafa II, Baghdad,Iraq2
Corresponding Author : Dhrgam AL Kafaf
Recieved : 20-May-2024; Revised : 21-February-2025; Accepted : 23-February-2025
Abstract
This study presents an improved methodology for object detection by introducing modifications to the front end of the you only look once (YOLOv5) model. Utilizing the Microsoft common objects in context (COCO) dataset, it features key enhancements in data preprocessing, feature extraction, and the integration of attention mechanisms within a feature pyramid network (FPN). Major modifications include improved bounding box prediction and optimized non-maximum suppression (NMS) to reduce false positives, resulting in more accurate detection. Comprehensive evaluation results demonstrate that the modified YOLOv5 significantly outperforms the baseline model across multiple metrics. Notably, the enhanced model achieves a 15.5% increase in average precision at an intersection over union (IoU) threshold of 0.5 to 0.95, improving localization and classification accuracy. Additionally, it exhibits substantial gains in precision and recall, particularly in scenarios involving small or overlapping objects. These findings highlight the effectiveness of the proposed modifications in enhancing real-time object detection capabilities. The optimized model is expected to be valuable for various practical applications, including autonomous driving, medical imaging, and security systems. By providing an efficient model with robust detection capabilities across diverse environments, this research advances the state of the art in object detection
Keywords
YOLOv5, Object detection, Feature pyramid network (FPN), Non-maximum suppression (NMS), Attention mechanisms, Precision and Recall optimization.
Cite this article
AL Kafaf D, Thamir N. Enhanced YOLOv5 for superior object detection: a comprehensive study. International Journal of Advanced Technology and Engineering Exploration. 2025;12(123):276-286. DOI : 10.19101/IJATEE.2024.111100842
