A Comprehensive Review of YOLOv5: Advances in Real-Time Object Detection

Authors

  • Sandeep Kumar Jaiswal M. Tech Scholar, Department of Computer Science and Engineering, BN College of Engineering and Technology, Lucknow, India
  • Rohit Agrawal Assistant Professor, Department of Computer Science and Engineering, BN College of Engineering and Technology, Lucknow, India

Keywords:

YOLOv5, YOLOv4, Object Detection, Real-time, Performance Evaluation

Abstract

YOLOv5 represents a significant advancement in the field of real-time object detection, building upon the YOLO (You Only Look Once) series' legacy. This paper provides a comprehensive review of YOLOv5, examining its architecture, innovations, performance benchmarks, and applications. We also compare YOLOv5 with previous YOLO versions and other state-of-the-art object detection models, highlighting its strengths and limitations. Through this review, we aim to offer insights into the evolution of YOLOv5 and its impact on the field of computer vision.

 

References

R. Girshick, J. Donahue, T. Darrell, and J. Malik, "Rich feature hierarchies for accurate object detection and semantic segmentation," in Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2014, pp. 580-587.

A. Krizhevsky, I. Sutskever, and G. E. Hinton, "ImageNet classification with deep convolutional neural networks," in Advances in Neural Information Processing Systems, vol. 25, 2012, pp. 1097-1105.

J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You Only Look Once: Unified, Real-Time Object Detection," in Proc. IEEE Conf. Computer Vision and Pattern Recognition, 2016, pp. 779-788.

J. Redmon and A. Farhadi, "YOLOv3: An Incremental Improvement," arXiv preprint arXiv:1804.02767, 2018.

Glenn, "YOLOv5," 2020. [Online]. Available: https://github.com/ultralytics/yolov5. [Accessed: 15-May-2024].

Balwante, S. S., Kolhe, R., Pingale, N. K., & Chandel, D. S. (2024). Drowsiness Detection System: Integrating YOLOv5 Object Detection with Arduino Hardware for Real-Time Monitoring. International Journal of Innovative Research in Computer Science & Technology, 12(2), 59-66.

Xu, R., Lin, H., Lu, K., Cao, L., & Liu, Y. (2021). A forest fire detection system based on ensemble learning. Forests, 12(2), 217.

T.-Y. Lin, M. Maire, S. Belongie, et al., "Microsoft COCO: Common Objects in Context," in European Conference on Computer Vision, 2014, pp. 740-755.

M. Everingham, L. Van Gool, C. K. I. Williams, et al., "The PASCAL Visual Object Classes (VOC) Challenge," International Journal of Computer Vision, vol. 88, no. 2, pp. 303-338, 2010.

W. Liu, D. Anguelov, D. Erhan, et al., "SSD: Single Shot MultiBox Detector," in European Conference on Computer Vision, 2016, pp. 21-37.

A. Bochkovskiy, C.-Y. Wang, and H.-Y. M. Liao, "YOLOv4: Optimal Speed and Accuracy of Object Detection," arXiv preprint arXiv:2004.10934, 2020.

Downloads

Published

2024-05-01

How to Cite

[1]
S. K. Jaiswal and R. Agrawal, “A Comprehensive Review of YOLOv5: Advances in Real-Time Object Detection”, IJIRCST, vol. 12, no. 3, pp. 75–80, May 2024.

Issue

Section

Articles