A Comprehensive Review of YOLOv5: Advances in Real-Time Object Detection
Keywords:
YOLOv5, YOLOv4, Object Detection, Real-time, Performance EvaluationAbstract
YOLOv5 represents a significant advancement in the field of real-time object detection, building upon the YOLO (You Only Look Once) series' legacy. This paper provides a comprehensive review of YOLOv5, examining its architecture, innovations, performance benchmarks, and applications. We also compare YOLOv5 with previous YOLO versions and other state-of-the-art object detection models, highlighting its strengths and limitations. Through this review, we aim to offer insights into the evolution of YOLOv5 and its impact on the field of computer vision.
References
R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," in Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2014, pp. 580-587.
A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," in Advances in Neural Information Processing Systems, vol. 25, 2012, pp. 1097-1105.
J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," in Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2016, pp. 779-788.
J. Redmon and A. Farhadi, "YOLOv3: An Incremental Improvement," arXiv preprint arXiv:1804.02767, 2018.
Glenn, "YOLOv5," 2020. [Online]. Available: https://github.com/ultralytics/yolov5. [Accessed: 15-May-2024].
Balwante, S. S., Kolhe, R., Pingale, N. K., & Chandel, D. S. (2024). Drowsiness Detection System: Integrating YOLOv5 Object Detection with Arduino Hardware for Real-Time Monitoring. International Journal of Innovative Research in Computer Science & Technology, 12(2), 59-66.
Xu, R., Lin, H., Lu, K., Cao, L., & Liu, Y. (2021). A forest fire detection system based on ensemble learning. Forests, 12(2), 217.
T.-Y. Lin, M. Maire, S. Belongie, et al., "Microsoft COCO: Common Objects in Context," in European Conference on Computer Vision, 2014, pp. 740-755.
M. Everingham, L. Van Gool, C. K. I. Williams, et al., "The PASCAL Visual Object Classes (VOC) Challenge," International Journal of Computer Vision, vol. 88, no. 2, pp. 303-338, 2010.
W. Liu, D. Anguelov, D. Erhan, et al., "SSD: Single Shot MultiBox Detector," in European Conference on Computer Vision, 2016, pp. 21-37.
A. Bochkovskiy, C.-Y. Wang, and H.-Y. M. Liao, "YOLOv4: Optimal Speed and Accuracy of Object Detection," arXiv preprint arXiv:2004.10934, 2020.