复杂背景下基于YOLOv7-tiny的图像目标检测算法

薛珊; 安宏宇; 吕琼莹; 曹国华

doi:10.3788/IRLA20230472

复杂背景下基于YOLOv7-tiny的图像目标检测算法

doi: 10.3788/IRLA20230472

薛珊^{1, 2,},
安宏宇¹,
吕琼莹¹,
曹国华²

1.
长春理工大学机电工程学院，吉林长春 130022
2.
长春理工大学重庆研究院，重庆 400000

基金项目: 吉林省科技厅重点科技研发项目(20210203055SF)；吉林省教育厅科学技术研究项目

详细信息

作者简介:
安宏宇，男，硕士生，主要从事现代检测理论与技术方面的研究

中图分类号: TP391

Image target detection algorithm based on YOLOv7-tiny in complex background

1.
College of Mechanical and Electrical Engineering, Changchun University of Science and Technology, Changchun 130022, China
2.
Chongqing Research Institute, Changchun University of Science and Technology, Chongqing 400000, China

Funds: Key R & D projects of Jilin Provincial Science and Technology Department (20210203055SF); Science and Technology Research Project of Jilin Provincial Department of Education

摘要: “黑飞”无人机一旦带有炸弹等物品，会对人们带来威胁。对在公园、游乐场、学校等复杂背景下“黑飞”的无人机进行目标检测是十分必要的。前沿算法YOLOv7-tiny属于轻量级网络，具有更小的网络结构和参数，更适合检测小目标，但在识别小目标无人机时出现特征提取能力弱、回归损失大、检测精度低的问题；针对此问题，提出了一种基于YOLOv7-tiny改进的无人机图像目标检测算法YOLOv7-drone。首先，建立无人机图像数据集；其次，设计一种新的注意力机制模块SMSE嵌入到特征提取网络中，增强对复杂背景下无人机目标的关注度；然后，在主干网络中融入RFB结构，扩大特征层的感受野，丰富特征信息以增强特征提取的鲁棒性；然后，改进网络中的特征融合机制，通过新增小目标检测层，增加对小尺度目标的检测精度；然后，改变损失函数提高模型的收敛速度，减少损失以增强模型的鲁棒性；最后，引入可变形卷积(Deformable convolution, DCN)，更好的根据目标本身形状进行特征提取，提升了检测精度。在PASCAL VOC公共数据集上进行对比实验，结果表明改进后的算法YOLO7-drone相比于YOLOv7-tiny，平均精度(map@0.5)提升了6%；在自制无人机数据集上进行实验，结果表明YOLOv7-drone与原算法相比，平均精度(map@0.5)提高了6.1%，并且检测速度为72帧/s；与YOLOv5l、YOLOv7目标检测算法进行对比实验，结果表明改进后的算法在平均精度(map@0.5)上分别高于对比算法4%、3.1%，验证了文中算法的可行性。
- 目标检测 /
- 复杂背景 /
- 注意力机制 /
- 小目标检测
Abstract: Objective Once the "black flying" drone carries items such as bombs, it can pose a threat to people. Target detection of "black flying" drones in complex backgrounds such as parks, amusement parks, and schools is the key to anti-drone systems in public areas. This paper aims to detect small-scale targets in complex background. Because the traditional manual image feature extraction methods are not targeted, time complexity is high, windows are redundant, the detection effect is poor, and the average accuracy is low. The problems of false detection and missing detection will occur when detecting small-scale UAVs in complex background. Therefore, this paper aims to develop a black flying UAV detection model based on deep learning, which is essential for the detection of unmanned aerial vehicles. Methods YOLOv7 is a stage target detection algorithm without anchor frame, with high detection accuracy and good inference speed. YOLOv7-tiny belongs to the grain grabbing memory model, with fewer parameters and fast operation, making it widely used in industry. In the backbone network, the built multi-scale channel attention module SMSE (Fig.5) is introduced to enhance the attention of UAVs in complex backgrounds. Between the backbone network and the feature fusion layer, the RFB feature extraction module (Fig.6) is introduced to increase the Receptive field and expand the feature information extraction. In the feature fusion, the small target detection layer is added to improve the detection ability of small UAV targets. In terms of calculating losses, the introduction of SIoU Loss function redefines the penalty index, which significantly improves the speed of training and the accuracy of reasoning. Finally, the ordinary convolution is replaced by the deformable convolution (Fig.7), making the detection closer to the shape and size of the object. Results and Discussions The dataset selected in this article is a combination of the self-made dataset (Fig.1) and the Dalian University of Technology drone dataset (Fig.2). The mainly used evaluation indicators are mAP (mean accuracy) and FPS (detection speed), Params (parameter quantity) and GFLOPS (computational quantity) as secondary indicators. Each module was compared with the original algorithm, including attention comparison experiment (Tab.1), RFB module comparison experiment (Tab.2), small target detection layer comparison experiment (Tab.3), Loss function comparison experiment (Tab.4), and deformable convolution comparison experiment (Tab.5). And ablation experiments were conducted (Tab.6), which confirmed the effectiveness and feasibility of the proposed algorithm through mAP comparison, improving accuracy by 6.1%. On this basis, the detection performance of different algorithms was compared (Tab.7), and the generalization of the algorithm was verified on the VOC public dataset (Tab.8). Conclusions This article proposes an improved object detection algorithm for anti-drone systems. Through the multi-scale channel attention module, the attention of small targets is enhanced, the fusion RFB increases the Receptive field, adds a small target detection layer to improve the detection ability, and improves the Loss function to improve the training speed and reasoning accuracy. Finally, deformable convolution is introduced to better fit the target size. The improved algorithm has achieved good detection results on different datasets.
- target detection /
- complex background /
- attention mechanism /
- small target detection
图 1 无人机数据集部分图片

Figure 1. Partial picture of drone dataset

下载: 全尺寸图片幻灯片

图 2 DUT-ANTI-UAV数据集

Figure 2. DUT-ANTI-UAV dataset

下载: 全尺寸图片幻灯片

图 3 YOLOv7-tiny算法结构图

Figure 3. Network chart of YOLOv7-tiny

下载: 全尺寸图片幻灯片

图 4 YOLOv7-drone算法结构图

Figure 4. Network chart of YOLOv7-drone

下载: 全尺寸图片幻灯片

图 5 多尺度通道注意力机制模块结构图

Figure 5. Multi-scale channel attentional mechanism module

下载: 全尺寸图片幻灯片

图 6 RFB网络结构图

Figure 6. RFB network structure diagram

下载: 全尺寸图片幻灯片

图 7 可变形卷积示意图

Figure 7. Deformable convolution diagram

下载: 全尺寸图片幻灯片

图 8 加入注意力机制前后检测结果对比图

Figure 8. Comparison chart of test results before and after adding attention mechanism

下载: 全尺寸图片幻灯片

图 9 加入RFB结构前后检测结果图

Figure 9. Before and after adding RFB structure

下载: 全尺寸图片幻灯片

图 10 引入小目标检测层前后检测结果图

Figure 10. Before and after the introduction of small target detection layer detection results map

下载: 全尺寸图片幻灯片

图 11 加入SIoU结构前后检测结果图

Figure 11. Before and after adding SIoU structure

下载: 全尺寸图片幻灯片

图 12 加入可变形卷积结构前后检测结果图

Figure 12. Before and after adding DCN structure

下载: 全尺寸图片幻灯片

图 13 不同算法检测性能对比图

Figure 13. Comparison of detection performance of different algorithms

下载: 全尺寸图片幻灯片

图 14 算法改进前后CAM对比图

Figure 14. Before and after the algorithm improvement CAM contrast chart

下载: 全尺寸图片幻灯片

表 1 引入不同注意力机制算法检测性能对比表

Table 1. Comparison of detection performance of different attention mechanism algorithms

Model Params/M mAP@0.5 FPS/frame·s^-1 GFLOPS

YOLOv7-tiny 6.02 84.3 74 13.2
+SE 6.05 84.5 75 13.2
+CBAM 6.02 84.9 73 13.3
+EMA 6.06 85.9 75 13.5
+SMSE 8.99 86.8 71 15.6

下载: 导出CSV

表 2 引入RFB结构算法检测性能对比表

Table 2. The RFB structure algorithm is introduced to detect the performance comparison

Model Params/M mAP@0.5 FPS/frame·s^-1 GFLOPS

YOLOv7-tiny 6.02 84.3 74 13.2
+XMB 6.30 85.6 77 14.7

下载: 导出CSV

表 3 引入小目标检测层算法检测性能对比表

Table 3. The small target detection layer algorithm is introduced to detect the performance comparison

Model Params/M mAP@0.5 FPS/frame·s^-1 GFLOPS

YOLOv7-tiny 6.02 84.3 74 13.2
+XMB 6.10 85.1 70 15.5

下载: 导出CSV

表 4 改进损失函数算法检测性能对比表

Table 4. Improved loss function algorithm detection performance comparison

Model Params/M mAP@0.5 FPS/frame·s^-1 GFLOPS

YOLOv7-tiny 6.02 84.3 74 13.2
+SIoU 6.02 85.0 78 13.2

下载: 导出CSV

表 5 引入可变形卷积前后检测性能对比表

Table 5. A comparison of detection performance before and after deformable convolution is introduced

Model Params/M mAP@0.5 FPS/frame·s^-1 GFLOPS

YOLOv7-tiny 6.02 84.3 74 13.2
+DCN 6.08 86.6 71 14.8

下载: 导出CSV

表 6 逐步加入各个模块算法检测性能对比表

Table 6. Gradually add each module algorithm detection performance comparison

SMSE RFB XMB SIoU DCN Params/
M mAP@0.5 FPS/
frame·s^-1 GFLOPS

6.02 84.3 74 13.2
√ 8.99 86.8 71 15.6
√ √ 9.29 87.4 73 17.1
√ √ √ 9.39 88.2 69 20.1
√ √ √ √ 9.39 88.7 73 20.1
√ √ √ √ √ 9.45 90.4 72 21.7

下载: 导出CSV

表 7 不同目标检测算法检测性能对比表

Table 7. Comparison of detection performance of different target detection algorithms

Model Params/M mAP@0.5 FPS/frame·s^-1 GFLOPS

YOLOv7-tiny 6.02 84.3 74 13.2
YOLOv7 37.2 87.3 57 104.8
YOLOv5l 46.1 86.4 42 107.9
YOLOv7-drone 9.45 90.4 72 21.7

下载: 导出CSV

表 8 PASCAL VOC数据集检测性能对比表

Table 8. PASCAL VOC dataset detection performance comparison

Model Params/M mAP@0.5 FPS/frame·s^-1 GFLOPS

YOLOv7-tiny 6.02 65.0 74 13.2
YOLOv7-drone 9.45 71.0 72 21.7

下载: 导出CSV

[1]	Xue Shan, Chen Yuchao, Lv Qiongying, et al. Image recognition method of anti drone system based on coordinate attention mechanism [J]. Infrared and Laser Engineering, 2022, 51(9): 20211101. (in Chinese)
[2]	Xue Shan, Lu Tao, Lv Qiongying, et al. The drone target detection algorithm based on multiscale fusion and lightweight network[J]. Journal of Hunan University (Natural Sciences) , 2023, 50(08): 82-93. (in Chinese)
[3]	Xue Shan, Zhang Yaliang, Lv Qiongying, et al. Anti-UAV system object detection algorithm under complex back ground [J]. Journal of Jilin University (Engineering and Technology Edition), 2023, 53(3): 891-901. (in Chinese)
[4]	Xue Shan, Wang Yabo, Lv Qiongying, et al. Anti-occlusion target detection algorithm for anti-UAV system based on YOLOX-drone[J]. Chinese Journal of Engineering, 2023, 45(9): 1539-1549. (in Chinese)
[5]	Xue Shan, Wei Liwei, Gu Chenyu, et al. Drone identificati- on method based on mixed domain attention mechanism [J]. Journal of Xi'an Jiao Tong University, 2022, 56(10): 141-150. (in Chinese)
[6]	Dai Xinxue, Fan Songtao, Zhou Yan. Speech enhancement method for laser microphone based on ResUnet and TFGAN networks [J/OL]. Infrared and Laser Engineering: 1-10[2023-09-20]. (in Chinese)
[7]	Gedmon J, Donahue J, Darrell T, et al. Region based convolutional Networks for accurate object detection and segmentation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 38(1): 142-158. doi: 10.1109/TPAMI.2015.2437384
[8]	Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, Real-Time object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 2015: 779-788.
[9]	Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detector[C]//European Conference on Computer Vision, Amsterdam, The Netherlands, 2016: 21-37.
[10]	Wang C Y, Bochkovskiy A, Liao H Y M. YOLOv7: Train-able bag-of-freebies sets new state-of-the-art for real-time object detectors [DB/OL]. (2022-06-06) [2023-07-30]. https://arxiv.org/abs/2207.02696.
[11]	Zhao J, Zhang J H, Li D D, et al. Vision-based anti-UAV detection and tracking[DB/OL]. (2022-05-22) [2023-07-30]. https://arxiv.org/abs/2205.10851
[12]	Bochkovskiy A, Wang C Y, Liao H Y M. Yolov4: Optimal speed and accuracy of object detection [DB/OL]. (2020-04-23) [2023-07-30]. https://arxiv.org/abs/2004.10934.
[13]	Zhang H Y, Cisse M, Dauphin Y N, et al. mixup: Beyond Empirical Risk Minimization[DB/OL]. (2018-04-27) [2023-07-30]. https://arxiv.org/abs/1710.09412.
[14]	Zhang X, Zeng H, Guo S, et al. Efficient longrange attention network for image superresolution [DB/OL]. (2022-03-13) [2023-07-30]. https://arxiv.org/abs/2203.06697.
[15]	He K, Zhang X, Ren S, et al. Spatial pyramid pooling in deep convolutional networks for visual recognition[C]//IEEE Trans Pattern Anal Mach Intell, 2015, 37(9): 1904.
[16]	Liu S, Qi L, Qin H, et al. Path aggregation network for instance segmentation[C]//Proceedings of the IEEE Conference on Comp Uter Vision and Pattern Recognition, Salt Lake City, 2018: 8759.
[17]	Lin T Y, Dollar P, Girshick R, et al. Feature pyramid networks for object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, 2017: 2117.
[18]	Hu J, Shen L G. Squeeze-and-excitation networks [C]//2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. June 18-23, 2018, Salt Lake City, UT, USA. IEEE, 2018: 7132-7141.
[19]	Ling Qiang, Liu Yu, Wang Chunju, et al. DN-YOLOv5 surface defect detection algorithm for metal bipolar plate [J]. Acta Harbin Institute of Technology, 2023, 55(12): 104-112. (in Chinese)
[20]	Woo S, Park J, Lee J Y, et al. Cbam: Convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision (ECCV), Munich: ECCV, 2018: 3-19.

[1]	张学志, 赵红东, 刘伟娜, 赵一鸣, 关松. 基于改进YOLOv5的红外车辆检测方法 . 红外与激光工程, 2023, 52(8): 20230245-1-20230245-10. doi: 10.3788/IRLA20230245
[2]	李岳楠, 徐浩宇, 董浩. 频域内面向目标检测的领域自适应 . 红外与激光工程, 2022, 51(7): 20210638-1-20210638-9. doi: 10.3788/IRLA20210638
[3]	蒋昕昊, 蔡伟, 杨志勇, 徐佩伟, 姜波. 基于YOLO-IDSTD算法的红外弱小目标检测 . 红外与激光工程, 2022, 51(3): 20210106-1-20210106-10. doi: 10.3788/IRLA20210106
[4]	韩金辉, 魏艳涛, 彭真明, 赵骞, 陈耀弘, 覃尧, 李楠. 红外弱小目标检测方法综述 . 红外与激光工程, 2022, 51(4): 20210393-1-20210393-24. doi: 10.3788/IRLA20210393
[5]	袁帅, 延翔, 张昱赓, 秦翰林. 双邻域差值放大的高动态红外弱小目标检测方法（特邀） . 红外与激光工程, 2022, 51(4): 20220171-1-20220171-11. doi: 10.3788/IRLA20220171
[6]	蔡仁昊, 程宁, 彭志勇, 董施泽, 安建民, 金钢. 基于深度学习的轻量化红外弱小车辆目标检测算法研究 . 红外与激光工程, 2022, 51(12): 20220253-1-20220253-11. doi: 10.3788/IRLA20220253
[7]	南天章, 耿建君, 陈旭, 陈颖. 基于邻域特征的红外低慢小目标检测 . 红外与激光工程, 2019, 48(S1): 174-180. doi: 10.3788/IRLA201948.S128002
[8]	唐聪, 凌永顺, 郑科栋, 杨星, 郑超, 杨华, 金伟. 基于深度学习的多视窗SSD目标检测方法 . 红外与激光工程, 2018, 47(1): 126003-0126003(9). doi: 10.3788/IRLA201847.0126003
[9]	吴天舒, 张志佳, 刘云鹏, 裴文慧, 陈红叶. 基于改进SSD的轻量化小目标检测算法 . 红外与激光工程, 2018, 47(7): 703005-0703005(7). doi: 10.3788/IRLA201847.0703005
[10]	陈卫, 孙晓兵, 乔延利, 陈震庭, 殷玉龙. 海面耀光背景下的目标偏振检测 . 红外与激光工程, 2017, 46(S1): 63-68. doi: 10.3788/IRLA201746.S117001
[11]	张祥越, 丁庆海, 罗海波, 惠斌, 常铮, 张俊超. 基于改进LCM的红外小目标检测算法 . 红外与激光工程, 2017, 46(7): 726002-0726002(7). doi: 10.3788/IRLA201746.0726002
[12]	刘峰, 奚晓梁, 沈同圣. 基于最大值投影和快速配准的空间小目标检测 . 红外与激光工程, 2016, 45(11): 1104002-1104002(6). doi: 10.3788/IRLA201645.1104002
[13]	孙照蕾, 惠斌, 秦莫凡, 常铮, 罗海波, 夏仁波. 红外图像显著目标检测算法 . 红外与激光工程, 2015, 44(9): 2633-2637.
[14]	韩艳丽, 刘峰. 基于三角形匹配的空间小目标检测算法 . 红外与激光工程, 2014, 43(9): 3134-3140.
[15]	陈宇, 霍富荣, 刘洪志, 郑丽芹. 基于改进MACH算法的畸变目标识别 . 红外与激光工程, 2014, 43(12): 4186-4191.
[16]	黄浩, 陶华敏, 陈尚锋. 基于混合融合策略的双波段红外小目标检测方法 . 红外与激光工程, 2014, 43(9): 2827-2831.
[17]	刘志刚, 卢云龙, 魏一苇. 有监督的高光谱图像伪装目标检测方法 . 红外与激光工程, 2013, 42(11): 3076-3081.
[18]	黎志华, 李新国. 基于OpenCV的红外弱小运动目标检测与跟踪 . 红外与激光工程, 2013, 42(9): 2561-2565.
[19]	林建粦, 平西建, 马德宝. 采用DBT的漂移扫描星图小目标检测方法 . 红外与激光工程, 2013, 42(12): 3440-3446.
[20]	刘运龙, 薛雨丽, 袁素真, 毛峡. 基于局部均值的红外小目标检测算法 . 红外与激光工程, 2013, 42(3): 814-822.

点击查看大图

图(14) / 表(8)

计量

文章访问数: 213
HTML全文浏览量: 62
PDF下载量: 67
被引次数: 0

全文HTML

0. 引　言

随着科技发展，民用无人机给人们生活带来便利，但是无人机 “黑飞”事件的发生^[1-5]，给人们也带来了极大的危害。尤其是在公园、游乐场、学校等公共区域，如果“黑飞”无人机携带炸弹等危险品，将对公共安全区域带来极大威胁。在公园、大型游乐场等公共区域进行无人机的检测和跟踪是十分必要的。目标检测是跟踪的前提。无人机属于小目标；公园、游乐场等属于复杂背景；在复杂背景下进行无人机小目标的检测，并且希望能够更准确和更快速，成为亟待解决的问题。传统的无人机目标检测方法存在窗口冗余，特征鲁棒性较差等问题。自深度学习出现之后^[6]，目标检测取得了巨大的突破，包括以Faster R-CNN^[7]为代表的两阶段算法和以YOLO^[8]、SSD^[9]系列为代表的单阶段目标检测算法。在实际目标检测中，运用传统方法对在复杂场景下的无人机检测会出现无法检测、漏检的问题。采用基于深度学习的目标检测算法进行无人机的检测成为研究热点。例如，吴曼佳根据近距离空域背景复杂以及无人机目标小的特点，提出了一种基于改进的YOLOv3的小目标检测网络，在复杂背景下的小目标检测性能上有明显提升；刘朋飞等人鉴于低空复杂背景下无人机尺度多变、背景复杂的特点，提出一种基于SSD的算法。虽然上述基于深度学习的检测算法有了很大提升，但是在复杂背景下小目标检测仍然有漏检、无法检测、检测精度不高的问题。

针对无人机在复杂背景下目标较小存在难检测，易漏检的问题和无人机样本较小的问题，文中基于YOLOv7-tiny^[10]的目标检测算法，将注意力机制、特征融合、扩大感受野的思想融入到YOLOv7-tiny网络中，提出一种改进的无人机目标检测算法YOLOv7-drone算法。此算法可以解决在复杂背景下检测小目标，无人机小样本的问题，并且具有更高的准确率和速度。

5. 结束语

1）针对公园、游乐场、体育场等复杂背景下小尺度无人机的检测会出现误检、漏检、检测精度不高和检测速度不够快的问题，设计了一种基于多尺度通道注意力机制和小目标特征融合层的YOLOv7-drone无人机目标检测算法。

2）采用自制光学设备采集无人机图片，并与DUT-Anti-UAV数据集合并，共同构建文中数据集；引入多尺度通道注意力机制模块获得多尺度特征信息，并着重关注局部特征信息以便检测小尺度无人机；引入RFB结构增大感受野，提高浅层网络特征提取能力，以便在浅层结构获得高等语义信息；新增小目标特征融合检测层以适应更小尺度无人机的检测；将算法原有的损失函数替换为SIoU损失函数，改善回归损失精度，降低误差，加快网络的收敛速度；将普通卷积替换为可变形卷积以适应不同形状和大小的目标。

3）将改进后的算法YOLOv7-drone与原算法YOLOv7-tiny进行对比实验，实验结果表明，对于在复杂背景下小尺度目标情况下的无人机检测效果更好；将其与不同目标检测算法(YOLOv7、YOLOv5l、YOLOv7-tiny)相比，具有更好的检测性能。同时，在视觉公共数据集PASCAL VOC上进行对比实验，实验结果表明，改进后的YOLOv7-drone算法相较于原算法仍有更优异的检测效果。

参考文献 (20)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

复杂背景下基于YOLOv7-tiny的图像目标检测算法

doi: 10.3788/IRLA20230472

作者简介:
安宏宇，男，硕士生，主要从事现代检测理论与技术方面的研究

Image target detection algorithm based on YOLOv7-tiny in complex background

计量

复杂背景下基于YOLOv7-tiny的图像目标检测算法

doi: 10.3788/IRLA20230472

1. 长春理工大学机电工程学院，吉林长春 130022

2. 长春理工大学重庆研究院，重庆 400000

作者简介:
安宏宇，男，硕士生，主要从事现代检测理论与技术方面的研究

English Abstract

Image target detection algorithm based on YOLOv7-tiny in complex background

1. College of Mechanical and Electrical Engineering, Changchun University of Science and Technology, Changchun 130022, China

2. Chongqing Research Institute, Changchun University of Science and Technology, Chongqing 400000, China

全文HTML

3.1. 多尺度通道注意力机制的改进

3.2. 融合RFB结构的YOLOv7-tiny模型

3.3. 基于特征融合的小目标检测

3.4. 损失函数的优化

3.5. 可变形卷积

4.1. 数据集的建立及实验环境配置

4.2. 评价指标

4.3. 改进注意力机制实验

4.4. 引入RFB结构实验

4.5. 引入小目标检测层实验

4.6. 改进损失函数实验

4.7. 引入可变形卷积实验

4.8. 消融实验

4.9. 不同目标检测算法的对比实验

4.10. VOC公共数据集实验

目录

Model	Params/M	mAP@0.5	FPS/frame·s^-1	GFLOPS
YOLOv7-tiny	6.02	84.3	74	13.2
+SE	6.05	84.5	75	13.2
+CBAM	6.02	84.9	73	13.3
+EMA	6.06	85.9	75	13.5
+SMSE	8.99	86.8	71	15.6

SMSE	RFB	XMB	SIoU	DCN	Params/ M	mAP@0.5	FPS/ frame·s^-1	GFLOPS
					6.02	84.3	74	13.2
√					8.99	86.8	71	15.6
√	√				9.29	87.4	73	17.1
√	√	√			9.39	88.2	69	20.1
√	√	√	√		9.39	88.7	73	20.1
√	√	√	√	√	9.45	90.4	72	21.7

留言板

复杂背景下基于YOLOv7-tiny的图像目标检测算法

doi: 10.3788/IRLA20230472

作者简介: 安宏宇，男，硕士生，主要从事现代检测理论与技术方面的研究

Image target detection algorithm based on YOLOv7-tiny in complex background

计量

出版历程

复杂背景下基于YOLOv7-tiny的图像目标检测算法

doi: 10.3788/IRLA20230472

1. 长春理工大学 机电工程学院，吉林 长春 130022 2. 长春理工大学 重庆研究院，重庆 400000

作者简介: 安宏宇，男，硕士生，主要从事现代检测理论与技术方面的研究

English Abstract

Image target detection algorithm based on YOLOv7-tiny in complex background

1. College of Mechanical and Electrical Engineering, Changchun University of Science and Technology, Changchun 130022, China 2. Chongqing Research Institute, Changchun University of Science and Technology, Chongqing 400000, China

全文HTML

3.1. 多尺度通道注意力机制的改进

3.2. 融合RFB结构的YOLOv7-tiny模型

3.3. 基于特征融合的小目标检测

3.4. 损失函数的优化

3.5. 可变形卷积

4.1. 数据集的建立及实验环境配置

4.2. 评价指标

4.3. 改进注意力机制实验

4.4. 引入RFB结构实验

4.5. 引入小目标检测层实验

4.6. 改进损失函数实验

4.7. 引入可变形卷积实验

4.8. 消融实验

4.9. 不同目标检测算法的对比实验

4.10. VOC公共数据集实验

目录

作者简介:
安宏宇，男，硕士生，主要从事现代检测理论与技术方面的研究

1. 长春理工大学机电工程学院，吉林长春 130022

2. 长春理工大学重庆研究院，重庆 400000

作者简介:
安宏宇，男，硕士生，主要从事现代检测理论与技术方面的研究

1. College of Mechanical and Electrical Engineering, Changchun University of Science and Technology, Changchun 130022, China

2. Chongqing Research Institute, Changchun University of Science and Technology, Chongqing 400000, China