基于改进YOLOv5算法的复杂场景交通目标检测

doi:10.12068/j.issn.1005-3026.2022.08.002

东北大学学报（自然科学版） ›› 2022, Vol. 43 ›› Issue (8): 1073-1079.DOI: 10.12068/j.issn.1005-3026.2022.08.002

基于改进YOLOv5算法的复杂场景交通目标检测

顾德英，罗聿伦，李文超

(东北大学秦皇岛分校控制工程学院，河北秦皇岛066004)

修回日期:2021-08-27 接受日期:2021-08-27 发布日期:2022-08-11
通讯作者: 顾德英
作者简介:顾德英(1964 -)，男，辽宁新民人，东北大学秦皇岛分校教授.
基金资助:
河北省自然科学基金资助项目(F2019501044).

Traffic Target Detection in Complex Scenes Based on Improved YOLOv5 Algorithm

GU De-ying， LUO Yu-lun， LI Wen-chao

School of Control Engineering， Northeastern University at Qinhuangdao， Qinhuangdao 066004， China.

Revised:2021-08-27 Accepted:2021-08-27 Published:2022-08-11
Contact: LUO Yu-lun
About author:-
Supported by:
-

摘要/Abstract

摘要： 实时的交通场景目标检测是实现电子监控、自动驾驶等功能的先决条件.针对现有的目标检测算法检测效率不高，以及大多数轻量化目标检测算法模型精度较低，容易误检、漏检目标的问题，本文通过改进YOLOv5目标检测算法来进行模型训练，再使用伪标签策略对训练过程进行优化，然后在KITTI交通目标数据集上将标签合并为3类，对训练出的模型进行测试.实验结果表明，改进的YOLOv5最终模型在该所有类别上的mAP达到了92.5％，对比原YOLOv5训练的模型提高了3％.最后将训练的模型部署到Jetson Nano嵌入式平台上进行推理测试，并通过TensorRT加速推理，测得平均每帧图像的推理时间为77ms，可以实现实时检测的目标.

关键词: 深度学习;目标检测;YOLOv5算法;伪标签训练;嵌入式平台

Abstract: Real-time target detection in traffic scenarios is the prerequisite of electronic monitoring， automatic driving， and other functions. In view of the low detection efficiency of existing target detection algorithms and the low accuracy of most light target detection algorithms， which are easy to obtain wrong or insufficient target detection， this paper adopts the improved YOLOv5 target detection algorithm for model training， and the pseudo-label strategy for training process optimization. Then， the KITTI traffic target dataset tags are merged into three categories for model training and testing. Through the experimental comparison， the improved YOLOv5 model in this paper achieves 92.5% mAP in all categories， which is 3% higher than the original YOLOv5 training model. Finally， the three categories of the trained models are deployed on the Jetson Nano embedded platform for inference testing， and TensorRT is used to accelerate inference. The average inference time per frame of image is 77ms， which meets the goal of real-time detection.

Key words: deep learning; target detection; YOLOv5 algorithm; pseudo-label training; embedded platform

中图分类号:

TP391

顾德英，罗聿伦，李文超. 基于改进YOLOv5算法的复杂场景交通目标检测[J]. 东北大学学报（自然科学版）, 2022, 43(8): 1073-1079.

GU De-ying， LUO Yu-lun， LI Wen-chao. Traffic Target Detection in Complex Scenes Based on Improved YOLOv5 Algorithm[J]. Journal of Northeastern University(Natural Science), 2022, 43(8): 1073-1079.

参考文献

[1]Hinton G E，Salakhutdinov R.Reducing the dimensionality of data with neural networks［J］.Science，2006，313(5786):504-507.
[2]Krizhevsky A，Sutskever I，Hinton G E.ImageNet classification with deep convolutional neural networks［C］//Proceedings of the 25th International Conference on Neural Information Processing Systems.New York:ACM，2012:1097-1105.
[3]Girshick R，Donahue J，Darrell T，et al.Rich feature hierarchies for accurate object detection and semantic segmentation［C］//IEEE Conference on Computer Vision and Pattern Recognition.Columbus:IEEE，2014:580-587.
[4]Ren S，He K，Girshick R，et al.Faster R-CNN:towards real time object detection with region proposal networks［J］.IEEE Transactions on Pattern Analysis and Machine Intelligence，2015，39(6):1137-1149.
[5]Tian Y，Gelernter J，Wang X，et al.Lane marking detection via deep convolutional neural network［J］.Neurocomputing，2018，280:46-55.
[6]Redmon J，Divvala S，Girshick R，et al.You only look once:unified，real-time object detection［C］//IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas:IEEE，2016:779-788.
[7]Liu W，Anguelov D，Erhan D，et al.SSD:single shot multi-box detector［M］.Berlin:Springer，2016.
[8]周攀，徐向荣，朱永飞，等.基于改进YOLOv3的道路目标检测［J］.龙岩学院学报，2021，39(2):21-30.(Zhou Pan，Xu Xiang-rong，Zhu Yong-fei，et al.Road target detection based on improved YOLOv3［J］.Journal of Longyan University，2021，39(2):21-30.)
[9]刘力冉，曹杰，杨磊，等.一种改进YOLOv3-Tiny的行车检测算法［J］.计算机与现代化，2020(3):108-114.(Liu Li-ran，Cao Jie，Yang Lei，et al.An improved YOLOv3-Tiny traffic detection algorithm［J］.Computer and Modernization，2020(3):108-114.)
[10]Ma L Z，Chen Y，Zhang J L.Vehicle and pedestrian detection based on improved YOLOv4-Tiny model［J］.Journal of Physics:Conference Series，2021，1920(1):012034.
[11]秦丹峰，尹相辉，龚学余.基于改进YOLOv4-Tiny算法的移动端实时司机违章行为检测［J］.计算机科学与应用，2021，11(5):1291-1300.(Qin Dan-feng，Yin Xiang-hui，Gong Xue-yu.Real-time drivers violation detection on mobile terminal based on improved YOLOv4-Tiny［J］.Computer Science and Applications，2021，11(5):1291-1300.)
[12]Jocher G，Nishimura K，Mineeva T，et al.YOLOv5［EB/OL］.(2020-06-26)［2021-06-02］.https://github.com/ultralytics/yolov5.
[13]He K，Zhang X，Ren S，et al.Deep residual learning for image recognition［EB/OL］.(2015-12-10)［2021-06-02］.https://arxiv.org/abs/1512.03385.
[14]Trinh H C，Lee D H，Kwon Y K.PANET:a GPU based tool for fast parallel analysis of robustness dynamics and feed-forward/ feedback loop structures in large-scale biological networks［J］.PLoS ONE，2014，9(7):e103010.
[15]Rezztofighi H，Tsoi N，Gwak J Y，et al.Generalized intersection over union:a metric and a loss for bounding box regression［EB/OL］.(2019-04-15)［2021-06-02］.https://arxiv.org/pdf/1902.09630.
[16]Jocher G，Nishimura K，Mineeva T，et al.v5.0-YOLOv5-P6.1280 models，AWS，supervise.ly and YouTube integrations［EB/OL］.(2021-04-12)［2021-06-02］.http://github.com/ultralytics/yolov5releases/tag/v5.0.
[17]Geiger A，Lenz P，Urtasun R，et al.Are we ready for autonomous driving? The KITTI vision benchmark suite［C］// IEEE Conference on Computer Vision and Pattern Recognition.Las Vegas:IEEE，2012:3354-3361.

基于改进YOLOv5算法的复杂场景交通目标检测

Traffic Target Detection in Complex Scenes Based on Improved YOLOv5 Algorithm

RichHTML

PDF (PC)

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价

[1]	庞彦伟，苏畅，龙涛. 自适应构造与聚合多尺度代价体的双目立体匹配[J]. 东北大学学报（自然科学版）, 2023, 44(4): 457-468.
[2]	丁其川，王力，刘成. 融合长距离信道注意力与病理特征的肺结节分类[J]. 东北大学学报（自然科学版）, 2023, 44(4): 476-485.
[3]	李海燕，熊立昌，郭磊，李海江. 基于U-net边缘生成和超图卷积的两阶段修复算法[J]. 东北大学学报（自然科学版）, 2023, 44(3): 331-339.
[4]	杨丹，刘国如，任梦成，裴宏杨. 多尺度卷积核U-Net模型的视网膜血管分割方法[J]. 东北大学学报（自然科学版）, 2021, 42(1): 7-14.
[5]	韩东红，张宏亮，朱帅伟，齐孝龙. 面向新浪微博的情感社区检测算法[J]. 东北大学学报（自然科学版）, 2021, 42(1): 21-31.
[6]	原培新，陈鼎夫. 双能X射线高动态范围安检图像压缩算法[J]. 东北大学学报（自然科学版）, 2021, 42(1): 96-101.
[7]	魏颖, 徐楚翘, 刁兆富, 李伯群. 基于生成对抗网络的多目标行人跟踪算法[J]. 东北大学学报（自然科学版）, 2020, 41(12): 1673-1680.
[8]	李稷，徐安军. 炼钢车间多天车动态调度仿真方案[J]. 东北大学学报:自然科学版, 2020, 41(12): 1699-1707.
[9]	李占山，吕艾娜. 基于新冗余度的特征选择方法[J]. 东北大学学报:自然科学版, 2020, 41(11): 1550-1556.
[10]	王鑫，王翠荣，王聪，苑迎. 双通道多感知卷积神经网络图像超分辨率重建[J]. 东北大学学报:自然科学版, 2020, 41(11): 1564-1570.
[11]	张春雷，戴丽，刘宇，李鹤. 基于三点法和ICP算法的手术导航系统患者配准[J]. 东北大学学报:自然科学版, 2020, 41(11): 1584-1590.
[12]	陈剑，何涛，闻英友，马林涛. 基于BERT模型的司法文书实体识别方法[J]. 东北大学学报:自然科学版, 2020, 41(10): 1382-1387.
[13]	张田，田勇，王子，王昭东. 基于清晰度评价的自适应阈值图像分割法[J]. 东北大学学报:自然科学版, 2020, 41(9): 1231-1238.
[14]	张丽杰，刘建昌，谭树彬. 复杂建筑火灾中的人员疏散路径多目标规划[J]. 东北大学学报:自然科学版, 2020, 41(6): 761-766.
[15]	赵海，周冰玲，朱宏博，窦圣昶. 基于连续最大流的三维肺实质快速分割算法[J]. 东北大学学报:自然科学版, 2020, 41(4): 470-474.