基于坐标注意力机制融合的反无人机系统图像识别方法

薛珊; 陈宇超; 吕琼莹; 曹国华

doi:10.3788/IRLA20211101

基于坐标注意力机制融合的反无人机系统图像识别方法

doi: 10.3788/IRLA20211101

薛珊^{1, 2,},
陈宇超^1,,
吕琼莹^1,,
曹国华²

1.
长春理工大学机电工程学院，吉林长春 130022
2.
长春理工大学重庆研究院，重庆 400000

基金项目: 吉林省科技厅重点科技研发项目 (20180201058 SF)；吉林省教育厅科学技术研究项目 (JJKH20210812 KJ)

详细信息

作者简介:
薛珊，女，副教授，硕士生导师，博士，主要研究方向为现代检测理论与技术

通讯作者: 吕琼莹，男，教授，博士生导师，博士，主要研究方向为现代检测理论与技术

中图分类号: TP391

Image recognition method of anti drone system based on coordinate attention mechanism

Xue Shan^{1, 2
,},
Chen Yuchao^1
,,
Lv Qiongying^1
,,
Cao Guohua²

1.
College of Mechanical and Electrical Engineering, Changchun University of Science and Technology, Changchun 130022, China
2.
Chongqing Research Institute, Changchun University of Science and Technology, Chongqing 400000, China

Funds: Key scientific and technological research and development projects of Jilin Provincial Science and Technology Department (20180201058 SF)；Science and technology research projects of Jilin Provincial Department of Education (JJKH20210812 KJ)

摘要: 反无人机系统是识别和打击“黑飞”无人机的有效手段，图像识别无人机是反无人机系统的关键之一。针对采集的无人机样本属于小样本、提取特征不够多，识别准确率不够高的问题，提出了一种基于迁移学习、密集卷积网络和坐标注意力机制融合的反无人机系统图像识别方法。首先，运用自制设备采集了多种无人机在不同背景下的图片，建立数据样本；其次，设计针对无人机小样本识别的基于迁移学习、坐标注意力机制和密集卷积网络融合的网络TL-CA4-DenseNet-121、基于通道注意力机制融合的网络TL-SE4-DenseNet-121等网络，运用设计的网络对小样本进行识别，并进行对比，然后分别进行了基于不同位置和不同个数的坐标注意力模块和通道注意力模块的网络识别实验；最后，将识别效果最优的网络与经典卷积神经网络模型进行对比实验。实验结果表明，提出的TL-CA4-DenseNet-121网络识别效果优于其他网络，识别的平均准确率为97.93%，F1-Score为0.9826，网络训练时间为6832 s。结果表明了该网络在识别小样本无人机方面的优越性和可行性。
- 无人机 /
- 图像识别 /
- 坐标注意力机制 /
- 密集卷积网络
Abstract: Anti drone system is an effective way to identify and attack the "black flying" drone. Image recognition drone is one of the keys of anti drone system. Aiming at the problems that the samples collected from drones are small samples, the features are not enough and the recognition accuracy is not high enough, an image recognition method of anti drone system based on transfer learning, dense convolutional network and coordinate attention mechanism was proposed. Firstly, a variety of drone images in different backgrounds were collected by using self-made device, and data samples were set up; Secondly, the network TL-CA4-DenseNet-121 based on transfer learning, coordinate attention mechanism and dense convolutional network, the network TL-SE4-DenseNet-121 based on channel attention mechanism were designed to identify small samples. The designed network was used to identify small samples and compare. The network recognition experiment of coordinate attention module and channel attention module based on different positions and different numbers were carried out respectively; Finally, the network with the best recognition effect was compared with the classical convolutional neural network models. The experimental results show that the proposed TL-CA4-DenseNet-121 network has better recognition effect than other networks, and the average accuracy of recognition is 97.93%, F1-Score is 0.9826 and training time is 6832 s. It shows the superiority and feasibility of this network in identifying small sample drones.
- drone /
- image recognition /
- coordinate attention mechanism /
- dense convolutional network

图 1 图像预处理效果图

Figure 1. Effect drawing of image preprocessing

下载: 全尺寸图片幻灯片

图 2 无人机图像数据集图片展示

Figure 2. Images display of drone image dataset

下载: 全尺寸图片幻灯片

图 3 密集块结构示意图

Figure 3. Diagram sketch of Dense Block

下载: 全尺寸图片幻灯片

图 4 TL-DenseNet-121结构图

Figure 4. Structure diagram of TL-DenseNet-121

下载: 全尺寸图片幻灯片

图 5 不同注意力机制模块。 (a) SE Block； (b) CA Block

Figure 5. Different attention mechanism modules. (a) SE Block; (b) CA Block

下载: 全尺寸图片幻灯片

图 6 融合不同个数、不同位置通道注意力模块的密集卷积网络。 (a) TL-SE1-DenseNet-121； (b) TL-SE4-DenseNet-121

Figure 6. Dense convolutional network incorporating different number of SE Blocks in different locations. (a) TL-SE1-DenseNet-121; (b) TL-SE4-DenseNet-121

下载: 全尺寸图片幻灯片

图 7 融合不同个数、不同位置坐标注意力模块的密集卷积网络。 (a) TL-CA1-DenseNet-121； (b) TL-CA4-DenseNet-121

Figure 7. Dense convolutional network incorporating different number of CA Blocks in different locations. (a) TL-CA1-DenseNet-121; (b) TL-CA4-DenseNet-121

下载: 全尺寸图片幻灯片

图 8 实验现场图片

Figure 8. Experimental scene pictures

下载: 全尺寸图片幻灯片

图 9 TL-DenseNet-121的初始训练结果图

Figure 9. Initial training results of TL-DenseNet-121

下载: 全尺寸图片幻灯片

图 10 TL-DenseNet-121的全训练结果图

Figure 10. Complete training results of TL-DenseNet-121

下载: 全尺寸图片幻灯片

图 11 DenseNet-121的训练结果图

Figure 11. Training results of DenseNet-121

下载: 全尺寸图片幻灯片

图 12 TL-CA4-DenseNet-121的全训练结果图

Figure 12. Complete training results of TL-CA4-DenseNet-121

下载: 全尺寸图片幻灯片

图 13 多种网络的对比曲线。 (a) 加入SE Block的网络对比曲线； (b) 加入CA Block的网络对比曲线

Figure 13. Comparison curves of various networks. (a) Network comparison curves introduced by SE Block; (b) Network comparison curves introduced by CA Block

下载: 全尺寸图片幻灯片

图 14 多模型的对比曲线

Figure 14. Comparison curves of various models

下载: 全尺寸图片幻灯片

表 1 无人机图像数据集分布情况

Table 1. Distribution of drone image dataset

	Simple background	Complex background
DJI mavic 2	1256	960
DJI mavic air	1088	960
Homemade Quad-Rotor drone	976	624
Caltech-UCSD-Birds-200-2011	2400
Total	8264

下载: 导出CSV

表 2 不同网络的对比结果

Table 2. Comparison results of different networks

	Average accuracy	F1-Score	Training time/s
TL-DenseNet-121	95.24%	0.9345	4985
TL-SE1-DenseNet-121	95.85%	0.9375	4984
TL-SE4-DenseNet-121	96.84%	0.9686	6362
TL-CA1-DenseNet-121	95.70%	0.9490	5602
TL-CA4-DenseNet-121	97.93%	0.9826	6832

下载: 导出CSV

表 3 多模型的对比结果

Table 3. Comparison results of different models

	Average accuracy	F1-Score	Training time/s
TL-AlexNet	94.06%	0.9151	1219
TL-VGG-16	94.49%	0.9244	6002
TL-ResNet-152	95.42%	0.9345	9023
TL-EfficientNet-B0	92.59%	0.8853	3388
TL-CA4-DenseNet-121	97.93%	0.9826	6832

下载: 导出CSV

[1]	Zhu Mengzhen, Chen Xia, Liu Xu, et al. Situation and key technology of tactical laser anti-UAV [J]. Infrared and Laser Engineering, 2021, 50(7): 20200230. (in Chinese)
[2]	Xue Shan, Zhang Zhen, Lv Qiongying, et al. Image recognition method of anti UAV system based on convolutional neural network [J]. Infrared and Laser Engineering, 2020, 49(7): 20200154. (in Chinese)
[3]	Krizhevsky A, Sutskever I, Hinton G. Imagenet classification with deep convolutional neural networks [J]. Advances in Neural Information Processing Systems, 2012, 25: 1097-1105.
[4]	Szegedy C, Liu Wei, Jia Yangqing, et al. Going deeper with convolutions[C]//IEEE Computer Society, 2014.
[5]	Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition [J]. arXiv, 2014: 1409.1556.
[6]	He Kaiming, Zhang Xiangyu, Ren Shaoqing, et al. Deep residual learning for image recognition [C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 770-778.
[7]	Zheng Yili, Zhang Lu. Plant leaf image recognition method based on transfer learning with convolutional neural networks [J]. Transactions of the Chinese Society for Agricultural Machinery, 2018, 49(S1): 354-359. (in Chinese)
[8]	Wah C, Branson S, Welinder P, et al. The Caltech-UCSD birds-200-2011 dataset[C]//Computation & Neural Systems Technical Report, CNS-TR, 2011: 001.
[9]	Huang Gao, Liu Zhuang, Laurens V, et al. Densely connected convolutional networks[C]//IEEE Computer Society. IEEE Computer Society, 2016.
[10]	Zhu Siqi, Wang Jue, Cai Yufang. Low-dose CT denoising algorithm based on improved CycleGAN [J]. Acta Optica Sinica, 2020, 40(22): 2210002. (in Chinese)
[11]	Zhuang Fuzhen, Luo Ping, He Qing, et al. Survey on transfer learning research [J]. Journal of Software, 2015, 26(1): 26-39. (in Chinese)
[12]	Zhang Ruiqing, Li Zhangwei, Hao Jianjun, et al. Image recognition of peanut pod grades based on transfer learning with convolutional neural network [J]. Transactions of the Chinese Society of Agricultural Engineering, 2020, 36(23): 171-180. (in Chinese)
[13]	Gong Renjie, Zheng Zhihui, Cong Longjian, et al. Infrared target detection and recognition based on transfer learning with small samples [J]. Journal of Northwestern Polytechnical University, 2021, 39(S1): 84-88. (in Chinese)
[14]	Ren Huan, Wang Xuguang. Review of attention mechanism [J]. Journal of Computer Applications, 2021, 41(S1): 1-6. (in Chinese) doi: 10.11772/j.issn.1001-9081.2020101634
[15]	Zhu Zhangli, Rao Yuan, Wu Yuan, et al. Research progress of attention mechanism in deep learning [J]. Journal of Chinese Information Processing, 2019, 33(6): 1-11. (in Chinese)
[16]	Liu Gang, Guo Jiabao. Bidirectional LSTM with attention mechanism and convolutional layer for text classification [J]. Neurocomputing, 2019, 337: 325-338. doi: 10.1016/j.neucom.2019.01.078
[17]	Zhang Yu, Zhang Pengyuan, Yan Yonghong. Long short-term memory with attention and multitask learning for distant speech recognition [J]. Journal of Tsinghua University (Science and Technology), 2018, 58(3): 249-253. (in Chinese)
[18]	Hu Jie, Shen Li, Albanie S, et al. Squeeze-and-excitation networks [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020, 42(8): 2011-2023. doi: 10.1109/TPAMI.2019.2913372
[19]	Hou Qibin, Zhou Daquan, Feng Jiashi. Coordinate attention for efficient mobile network design [C]//2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021: 13708-13717.
[20]	Li Xiukun, Xu Tianyang, Ji Shoucong. Posture recognition of underwater target based on deep learning method [J]. Journal of Harbin Engineering University, 2021, 42(10): 1503-1509. (in Chinese) doi: 10.11990/jheu.202007107

[1]	王子辰, 王东鹤, 朱纬. 无人机载光电平台隔振性能测试与分析 . 红外与激光工程, 2024, 53(1): 20230432-1-20230432-7. doi: 10.3788/IRLA20230432
[2]	张骏, 朱标, 沈玉真, 张鹏. 基于引导滤波的多分支注意力残差红外图像去噪网络 . 红外与激光工程, 2022, 51(11): 20220060-1-20220060-11. doi: 10.3788/IRLA20220060
[3]	王向军, 欧阳文森. 多尺度循环注意力网络运动模糊图像复原方法 . 红外与激光工程, 2022, 51(6): 20210605-1-20210605-9. doi: 10.3788/IRLA20210605
[4]	谢冰, 万淑慧, 殷云华. 基于改进稀疏表示正则化的SR重建算法 . 红外与激光工程, 2022, 51(3): 20210468-1-20210468-10. doi: 10.3788/IRLA20210468
[5]	蔡仁昊, 程宁, 彭志勇, 董施泽, 安建民, 金钢. 基于深度学习的轻量化红外弱小车辆目标检测算法研究 . 红外与激光工程, 2022, 51(12): 20220253-1-20220253-11. doi: 10.3788/IRLA20220253
[6]	林丽, 刘新, 朱俊臻, 冯辅周. 基于CNN的金属疲劳裂纹超声红外热像检测与识别方法研究 . 红外与激光工程, 2022, 51(3): 20210227-1-20210227-9. doi: 10.3788/IRLA20210227
[7]	樊宽刚, 雷爽, 别同. 智能化无人机入侵检测与跟踪拦截系统设计与实现 . 红外与激光工程, 2022, 51(8): 20210750-1-20210750-10. doi: 10.3788/IRLA20210750
[8]	周国清, 胡皓程, 徐嘉盛, 周祥, 农学勤. 机载单频水深测量LiDAR光机系统设计 . 红外与激光工程, 2021, 50(4): 20200297-1-20200297-15. doi: 10.3788/IRLA20200297
[9]	朱孟真, 陈霞, 刘旭, 谭朝勇, 黎伟. 战术激光武器反无人机发展现状和关键技术分析 . 红外与激光工程, 2021, 50(7): 20200230-1-20200230-13. doi: 10.3788/IRLA20200230
[10]	牛得清, 伍友利, 徐洋, 许瑞. 点源红外诱饵干扰下环境复杂度量化建模 . 红外与激光工程, 2020, 49(2): 0204003-0204003. doi: 10.3788/IRLA202049.0204003
[11]	薛珊, 张振, 吕琼莹, 曹国华, 毛逸维. 基于卷积神经网络的反无人机系统图像识别方法 . 红外与激光工程, 2020, 49(7): 20200154-1-20200154-8. doi: 10.3788/IRLA20200154
[12]	张钟毓, 刘云鹏, 王思奎, 刘天赐, 林智远. 基于DRFP网络的无人机对地车辆目标识别算法 . 红外与激光工程, 2019, 48(S2): 125-133. doi: 10.3788/IRLA201948.S226001
[13]	秦玉鑫, 陈宇, 乔恒恒, 车子琪, 张公平. 灾情侦测无人机动态航迹规划算法设计 . 红外与激光工程, 2019, 48(10): 1026003-1026003(6). doi: 10.3788/IRLA201948.1026003
[14]	谢冰, 段哲民, 郑宾, 殷云华. 基于迁移学习SAE的无人机目标识别算法研究 . 红外与激光工程, 2018, 47(6): 626001-0626001(7). doi: 10.3788/IRLA201847.0626001
[15]	曲蕴杰, 莫宏伟, 王常虹. 一种用于无人机的目标颜色核相关跟踪算法研究 . 红外与激光工程, 2018, 47(3): 326001-0326001(7). doi: 10.3788/IRLA201847.0326001
[16]	黄章斌, 李晓霞, 郭宇翔, 马德跃, 赵亮. 长航时UAV蒙皮红外辐射强度的工程计算 . 红外与激光工程, 2017, 46(3): 304001-0304001(7). doi: 10.3788/IRLA201746.0304001
[17]	刘连伟, 杨淼淼, 邹前进, 姚梅, 王敏, 许振领. 无人机红外辐射建模与图像仿真 . 红外与激光工程, 2017, 46(6): 628002-0628003(7). doi: 10.3788/IRLA201746.0628002
[18]	黄楠楠, 刘贵喜, 张音哲, 姚李阳. 无人机视觉导航算法 . 红外与激光工程, 2016, 45(7): 726005-0726005(9). doi: 10.3788/IRLA201645.0726005
[19]	孙占久, 聂宏, 黄伟. 无人机红外辐射特性计算与分析 . 红外与激光工程, 2014, 43(4): 1037-1046.
[20]	何思远, 刘刚, 王玲, 唐延东. 基于无人机的输电线路设备识别方法研究 . 红外与激光工程, 2013, 42(7): 1940-1944.

点击查看大图

图(14) / 表(3)

计量

文章访问数: 196
HTML全文浏览量: 50
PDF下载量: 58
被引次数: 0

全文HTML

0. 引　言

随着无人机的普及和发展，无人机的“黑飞”行为也越来越威胁到公共安全，尤其在大型公园和游乐场等公共场所，抵制“黑飞”的反无人机系统研发迫在眉睫^[1]。识别无人机是反无人机系统的首要和关键问题之一，运用图像识别无人机是重要的识别方法之一。图像识别的关键在于图像特征的提取，使用显著对象作为图像内容的表示。根据特征提取的方式不同，分为传统图像特征手动提取方法和神经网络自动提取特征两种图像识别方法^[2]。

随着技术的发展，手动提取特征已经被神经网络自动提取所取代。图像识别的神经网络模型发展很快。2012年，Krizhevshy等人提出的AlexNet^[3]模型使ImageNet数据集的分类准确率得到明显提高；2014年，由Google团队提出的GoogLeNet^[4]模型通过引入Inception模块增加网络宽度，从而提高模型的表达能力；同年由Simonyan等人提出的VGG^[5]模型通过多个小感受野卷积核堆叠的方式代替大感受野的卷积核，从而减少网络参数；2015年，由He等人提出的ResNet^[6]模型通过引入残差块结构解决了网络因为层数增加导致的梯度消失和梯度爆炸现象。目前，大部分的卷积神经网络对大样本的识别效果好，准确率高；样本越充分，效果越好^[7]。但是对于小样本效果不佳，准确率不高。如何设计一种网络，针对无人机小样本，能够提取更多的特征，提高识别准确率成为了亟待解决的问题。

文中提出了一种基于迁移学习、密集卷积网络和坐标注意力机制融合的图像识别方法。运用迁移学习节省训练时间，并提高准确率；使用密集卷积网络识别小样本，增多样本的提取通道，增多了特征的提取；由于采用了密集卷积网络，在增加特征提取的同时，也增加了无效通道；运用坐标注意力机制突出有效通道，抑制无效通道，突出有效位置，抑制无效位置，从而提高准确率。

4. 结　论

(1) 为了解决无人机样本小、特征提取少的问题，以及提高反无人机系统中无人机图像识别的准确率，提出了一种基于迁移学习、密集卷积网络和坐标注意力机制融合的反无人机系统图像识别方法。此识别方法采用密集卷积网络增加了图像的特征提取；采用迁移学习方法提高网络的识别准确率；采用坐标注意力机制抑制无效通道，关注位置特征，提高网络的识别准确率。

(2) 运用自制设备建立无人机图像数据集。设计了针对小样本的基于迁移学习的密集卷积网络TL-DenseNet-121、设计了基于迁移学习的密集卷积网络分别和不同个数、不同位置、不同注意力机制融合的网络。针对设计的各种网络进行了识别对比实验，同时将识别效果最好的TL-CA4-DenseNet-121网络与经典卷积神经网络模型进行识别了对比实验。实验结果表明了基于迁移学习和坐标注意力机制的密集卷积网络的优越性和可行性。

参考文献 (20)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于坐标注意力机制融合的反无人机系统图像识别方法

doi: 10.3788/IRLA20211101

作者简介:
薛珊，女，副教授，硕士生导师，博士，主要研究方向为现代检测理论与技术

通讯作者: 吕琼莹，男，教授，博士生导师，博士，主要研究方向为现代检测理论与技术

Image recognition method of anti drone system based on coordinate attention mechanism

计量

基于坐标注意力机制融合的反无人机系统图像识别方法

doi: 10.3788/IRLA20211101

1. 长春理工大学机电工程学院，吉林长春 130022

2. 长春理工大学重庆研究院，重庆 400000

作者简介:
薛珊，女，副教授，硕士生导师，博士，主要研究方向为现代检测理论与技术

通讯作者: 吕琼莹，男，教授，博士生导师，博士，主要研究方向为现代检测理论与技术

English Abstract

Image recognition method of anti drone system based on coordinate attention mechanism

1. College of Mechanical and Electrical Engineering, Changchun University of Science and Technology, Changchun 130022, China

2. Chongqing Research Institute, Changchun University of Science and Technology, Chongqing 400000, China

全文HTML

2.1. 基于迁移学习的密集卷积网络设计

2.2. 基于注意力机制融合的密集卷积网络设计

2.2.1. 基于通道注意力机制融合的密集卷积网络设计

2.2.2. 基于坐标注意力机制融合的密集卷积网络设计

3.1. 无人机图片采集

3.2. 实验环境

3.3. 基于迁移学习的密集卷积网络识别实验

3.4. 融合不同个数、不同位置、不同注意力机制模块的网络实验

3.5. 多模型对比实验

目录

留言板

基于坐标注意力机制融合的反无人机系统图像识别方法

doi: 10.3788/IRLA20211101

作者简介: 薛珊 ，女，副教授，硕士生导师，博士，主要研究方向为现代检测理论与技术

通讯作者: 吕琼莹，男，教授，博士生导师，博士，主要研究方向为现代检测理论与技术

Image recognition method of anti drone system based on coordinate attention mechanism

计量

出版历程

基于坐标注意力机制融合的反无人机系统图像识别方法

doi: 10.3788/IRLA20211101

1. 长春理工大学 机电工程学院，吉林 长春 130022 2. 长春理工大学 重庆研究院，重庆 400000

作者简介: 薛珊 ，女，副教授，硕士生导师，博士，主要研究方向为现代检测理论与技术

通讯作者: 吕琼莹，男，教授，博士生导师，博士，主要研究方向为现代检测理论与技术

English Abstract

Image recognition method of anti drone system based on coordinate attention mechanism

1. College of Mechanical and Electrical Engineering, Changchun University of Science and Technology, Changchun 130022, China 2. Chongqing Research Institute, Changchun University of Science and Technology, Chongqing 400000, China

全文HTML

2.1. 基于迁移学习的密集卷积网络设计

2.2. 基于注意力机制融合的密集卷积网络设计

2.2.1. 基于通道注意力机制融合的密集卷积网络设计

2.2.2. 基于坐标注意力机制融合的密集卷积网络设计

3.1. 无人机图片采集

3.2. 实验环境

3.3. 基于迁移学习的密集卷积网络识别实验

3.4. 融合不同个数、不同位置、不同注意力机制模块的网络实验

3.5. 多模型对比实验

目录

作者简介:
薛珊，女，副教授，硕士生导师，博士，主要研究方向为现代检测理论与技术

1. 长春理工大学机电工程学院，吉林长春 130022

2. 长春理工大学重庆研究院，重庆 400000

作者简介:
薛珊，女，副教授，硕士生导师，博士，主要研究方向为现代检测理论与技术

1. College of Mechanical and Electrical Engineering, Changchun University of Science and Technology, Changchun 130022, China

2. Chongqing Research Institute, Changchun University of Science and Technology, Chongqing 400000, China