Monocular camera-based 3D point cloud reconstruction and traffic sign detection using vision transformers and YOLOv8
Luis Alberto Chavarría Zamora1 and Pablo Soto- Quiros2
Escuela de Matemática,Instituto Tecnológico de Costa Rica,Cartago 30101,Costa Rica2
Corresponding Author : Luis Alberto Chavarría Zamora
Recieved : 23-January-2025; Revised : 22-October-2025; Accepted : 25-October-2025
Abstract
This study presents a novel method for extracting point clouds of traffic signs using a single monocular camera sensor. Traditional light detection and ranging (LiDAR) techniques, although highly accurate, are expensive, require integration with cameras for segmentation tasks, and increase overall system complexity. The proposed approach is significant as it enables the generation of accurately segmented point clouds without relying on a LiDAR sensor, which was not available to the research group. The solution is flexible, allowing substitution with equivalent algorithms for monocular depth estimation, image segmentation, camera calibration, and global positioning system (GPS) association. Furthermore, the integration of machine learning techniques is proposed for traffic sign classification.
Keywords
Monocular vision, Point cloud extraction, Traffic sign detection, Depth estimation, Image segmentation, Machine learning.
Cite this article
Zamora LAC, Quiros PS. Monocular camera-based 3D point cloud reconstruction and traffic sign detection using vision transformers and YOLOv8. International Journal of Advanced Technology and Engineering Exploration. 2025;12(131):1473-1485. DOI : 10.19101/IJATEE.2025.121220113
