A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NAS

IF 6 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Machine learning and knowledge extraction Pub Date : 2023-11-20 DOI:10.3390/make5040083

Juan R. Terven, Diana-Margarita Córdova-Esparza, J. Romero-González

引用次数: 0

Abstract

YOLO has become a central real-time object detection system for robotics, driverless cars, and video monitoring applications. We present a comprehensive analysis of YOLO’s evolution, examining the innovations and contributions in each iteration from the original YOLO up to YOLOv8, YOLO-NAS, and YOLO with transformers. We start by describing the standard metrics and postprocessing; then, we discuss the major changes in network architecture and training tricks for each model. Finally, we summarize the essential lessons from YOLO’s development and provide a perspective on its future, highlighting potential research directions to enhance real-time object detection systems.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

全面回顾计算机视觉中的 YOLO 架构：从 YOLOv1 到 YOLOv8 和 YOLO-NAS

YOLO 已成为机器人、无人驾驶汽车和视频监控应用的核心实时物体检测系统。我们对 YOLO 的演变进行了全面分析，研究了从最初的 YOLO 到 YOLOv8、YOLO-NAS 和 YOLO with transformers 的每次迭代中的创新和贡献。我们首先介绍了标准指标和后处理方法，然后讨论了每个模型在网络架构和训练技巧方面的主要变化。最后，我们总结了 YOLO 开发过程中的基本经验，并展望了 YOLO 的未来，强调了增强实时目标检测系统的潜在研究方向。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊