基于跨模态数据增强的红外时敏目标检测技术

王思宇; 杨小冈; 卢瑞涛; 李清格; 范继伟; 朱正杰

doi:10.3788/IRLA20220876

基于跨模态数据增强的红外时敏目标检测技术

doi: 10.3788/IRLA20220876

火箭军工程大学导弹工程学院，陕西西安 710025

基金项目: 国家自然科学基金项目（62276274）；航空科学基金项目（201851U8012）

详细信息

作者简介:
王思宇，男，博士生，主要从事视觉导航、目标检测、图像处理等方面的研究

中图分类号: TP391

Infrared time-sensitive target detection technology based on cross-modal data augmentation

Missile Engineering Institute, PLA Rocket Force University of Engineering, Xi'an 710025, China

Funds: National Natural Science Foundation of China (62276274); Aviation Science Foundation (201851U8012)

摘要: 目前红外时敏目标检测技术在无人巡航、精确打击、战场侦察等领域应用广泛，但有些高价值目标图像的获取难度高且成本昂贵。针对红外时敏目标图像数据匮乏、缺少用于训练的多场景多目标数据、检测效果不佳等问题，文中提出一种基于跨模态数据增强的红外时敏目标检测技术，跨模态数据增强方法为两阶段模型。首先在第一阶段通过基于CUT网络的模态转换模型将包含时敏目标的可见光图像转换为红外图像，其次在第二阶段模型中引入coordinate attention注意力机制，随机生成大量红外目标图像，实现了数据增强效果。最后提出一种基于SE模块和CBAM模块改进的Yolov5目标检测架构，实验结果表明，文中提出的Yolov5（CSP-A）目标检测技术与原网络相比，准确率提升了7.36%，召回率提升了5.43%，平均精度提升了2.74%。有效提高了红外时敏目标的检测精度，实现了红外时敏目标精确检测。
- 红外时敏目标 /
- 数据增强 /
- 模态转换 /
- 目标检测
Abstract: Objective Infrared time-sensitive targets refer to infrared targets such as ships and aircraft, which have high military value and the opportunity of attack is limited by the time window. Infrared time-sensitive target detection technology is widely used in military and civilian fields such as unmanned cruise, precision strike, battlefield reconnaissance, etc. The target detection algorithm based on deep learning has made great progress in the field of target detection due to its powerful computing power, deep network structure and a large number of labeled data. However, the acquisition of some high-value target images is difficult and costly. Therefore, the infrared time-sensitive target image data is scarce, and the multi-scene and multi-target data for training is lacking, which makes it difficult to ensure the detection effect. Based on this, this paper proposes an infrared time-sensitive target detection technology based on cross-modal data enhancement, which generates "new data" by processing the data, expands the infrared time-sensitive target data set, and improves the model detection accuracy and generalization ability. Methods We propose an infrared time-sensitive target detection technology based on cross-modal data enhancement. The cross-modal data enhancement method is a two-stage model (Fig.1). First, in the first stage, the visible light image containing time-sensitive targets is converted into infrared images through the mode conversion model based on the CUT network, and then the coordinate attention mechanism is introduced into the second stage model to randomly generate a large number of infrared target images, realizing the data enhancement effect. Finally, an improved Yolov5 target detection architecture based on SE module and CBAM module is proposed (Fig.3). Results and Discussions The proposed cross-modal infrared time-sensitive target data enhancement method combines the style migration model with the target generation model, and uses the visible light image data set to achieve infrared time-sensitive target data enhancement. We can convert remote sensing visible image into infrared image without losing size, structure and field of view, without distortion, noise, distortion and other problems. It can be seen from Fig.6 that the generated infrared time-sensitive target has good texture details and infrared characteristics, and is clearly distinguished from the background. An improved Yolov5 target detection model is proposed. SE and CBAM attention mechanisms are added to the CSP network to enhance the feature expression of the network and better achieve infrared time-sensitive target detection. It can be seen from the analysis of Tab.2 that compared with using the original data to train the deep learning detection network, the data enhancement algorithm proposed in this paper has significantly improved the detection ability of positive samples, the detection accuracy rate, the recall rate, and the average accuracy have increased by 14.57%, 5.99%, and 8.82% respectively. It can be seen from Tab.3 that compared with SSD, Fast R-CNN and Yolov5, the algorithm in this paper has a great improvement in accuracy, average accuracy and F1 index. Compared with the original Yolov5 network, the accuracy rate, the recall rate, the average accuracy, and the F1 index have increased by 7.36%, 5.43%, 2.74%, and 6.45% respectively. Some test results are shown (Fig.9). Conclusion Due to the lack of infrared time-sensitive target data and poor detection effect, we proposes a cross-modal data enhancement infrared time-sensitive target detection technology. In the aspect of two-stage model data enhancement, firstly, the visible light remote sensing image containing time-sensitive targets is converted into the target image with infrared characteristics using the mode conversion network. Secondly, the coordinate attention mechanism is introduced into the sample random generation model. Finally, the Yolov5 detection technology based on the improved CSP module is proposed. Multiple sets of experimental results show that the detection accuracy of the algorithm in this paper is up to 98.06% in the infrared time-sensitive target data set, which solves the problem of the lack of infrared time-sensitive target data and has good target detection ability.
- infrared time-sensitive targets /
- data augmentation /
- modal transformation /
- target detection

图 1 红外时敏目标数据增强两阶段模型概述

Figure 1. Overview of two-stage model for IR time-sensitive target data augmentation

下载: 全尺寸图片幻灯片

图 2 单尺度生成模型结构

Figure 2. Single-scale generative model structure

下载: 全尺寸图片幻灯片

图 3 Yolov5s目标检测总体架构

Figure 3. Overall structure of Yolov5s target detection

下载: 全尺寸图片幻灯片

图 4 添加注意力机制的CSP-A架构

Figure 4. The CSP-A structure of adding attention mechanism

下载: 全尺寸图片幻灯片

图 5 模态转换训练数据集

Figure 5. Modality transfer partial training dataset

下载: 全尺寸图片幻灯片

图 6 可见光红外图像转换结果

Figure 6. Visible infrared image conversion results

下载: 全尺寸图片幻灯片

图 7 部分生成图像数据

Figure 7. Partial generated image data

下载: 全尺寸图片幻灯片

图 8 典型红外时敏目标检测结果对比

Figure 8. Comparison of typical IR time-sensitive target detection results

下载: 全尺寸图片幻灯片

图 9 部分图像检测结果

Figure 9. Test results on partial image

下载: 全尺寸图片幻灯片

表 1 模态转换效果对比

Table 1. Comparison of modal transformation effects

	Mean	Standard deviation	Variance	Information entropy	Contrast ratio	Mean gradient
Original IR images	126.049 31	43.281 21	0.029 37	7.231 47	147.147 56	5.135 38
Transfer IR images	137.884 15	45.599 76	0.032 64	7.188 54	43.930 82	3.158 52

下载: 导出CSV

表 2 数据增强前后性能对比

Table 2. Performance comparison before and after data augmentation

Dataset	Precision	Recall	mAP@0.5	F₁
Origin	0.786 3	0.910 5	0.892 4	0.843 9
Augmentation	0.932 0	0.970 4	0.980 6	0.950 8

下载: 导出CSV

表 3 不同检测方法的对比实验

Table 3. Comparison experiments of different detection methods

Method	Precision rate	Recall rate	mAP@0.5	mAP@0.5 (ship)	mAP@0.5 (aircraft)	F₁
SSD	0.3564	0.8423	0.8271	0.7693	0.8848	0.5009
Fast R-CNN	0.4328	0.8534	0.8327	0.8564	0.8180	0.5743
Yolov5	0.8584	0.9161	0.9532	0.9687	0.9376	0.8863
Yolov5 (CSP-A)	0.9320	0.9704	0.9806	0.9807	0.9805	0.9508

下载: 导出CSV

表 4 消融实验结果

Table 4. Ablation experiment results

Number	SE	CBAM	mAP@0.5
1	-	-	0.937 6
2	-	√	0.956 8
3	√	-	0.962 4
4	√	√	0.980 5

下载: 导出CSV

[1]	Redmon J, Divvala S, Girshick R, et al. You only look once: Unified, real-time object detection[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016: 779-788.
[2]	Yu X, Hong S, Yu J, et al. Research on a ship target data augmentation method of visible remote sensing image [J]. Chinese Journal of Scientific Instrument, 2020, 41(11): 261-269. (in Chinese)
[3]	Ma Y, Tang P, Zhao L, et al. Review of data augmentation for image in deep learning [J]. Image Graphics, 2021, 26(3): 487-502. (in Chinese) doi: 10.11834/jig.200089
[4]	Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks [J]. Communications of the ACM, 2017, 60(6): 84-90. doi: 10.1145/3065386
[5]	Taylor L, Nitschke G. Improving deep learning with generic data augmentation[C]//2018 IEEE Symposium Series on Computational Intelligence (SSCI), IEEE, 2018, 1542-1547.
[6]	Zhong Z, Zheng L, Kang G, et al. Random erasing data augmentation[C]//Proceedings of the AAAI Conference on Artificial Intelligence, 2020, 34(7): 13001-13008.
[7]	Ma D, Tang P, Zhao L. SiftingGAN: Generating and sifting labeled samples to improve the remote sensing image scene classification baseline in vitro [J]. IEEE Geoscience and Remote Sensing Letters, 2019, 16(7): 1046-1050. doi: 10.1109/LGRS.2018.2890413
[8]	Gulrajani I, Ahmed F, Arjovsky M, et al. Improved training of wasserstein gans[EB/OL]. (2017-12-25) [2022-12-06]. https://arxiv.org/abs/1704.00028.
[9]	Zheng Z, Zheng L, Yang Y. Unlabeled samples generated by gan improve the person re-identification baseline in vitro[C]//Proceedings of the IEEE International Conference on Computer Vision, 2017: 3754-3762.
[10]	Zhong Z, Zheng L, Zheng Z, et al. Camera style adaptation for person re-identification[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 5157-5166.
[11]	Liu W, Anguelov D, Erhan D, et al. SSD: Single shot multibox detector[C]//Proceedings of the IEEE European Conference on Computer Vision, 2016: 21-37.
[12]	Redmon J, Divvala S, Girshick R, et al. You only look once: unified, real-time object detection[C]//2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016: 779-788.
[13]	Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]// 2014 IEEE Comference on Computer Vision and Pattern Recognition, 2014: 580-587.
[14]	Girshick R. Fast r-cnn[C]//Proceedings of the IEEE International Conference on Computer Vision, 2015: 1440-1448.
[15]	Ju M, Luo J, Liu G, et al. ISTDet: An efficient end-to-end neural network for infrared small target detection [J]. Infrared Physics & Technology, 2021, 114: 103659. doi: 10.1016/J.INFRARED.2021.103659
[16]	Yao S, Zhu Q, Zhang T, et al. Infrared image small-target detection based on improved FCOS and spatio-temporal features [J]. Electronics, 2022, 11(6): 933. doi: 10.3390/electronics11060933
[17]	Lu X F, Bai X F, Li S X, et al. Infrared small target detection method based on the improved weighted enhanced local contrast measurement [J]. Infrared and Laser Engineering, 2022, 51(8): 20210914. (in Chinese) doi: 10.3788/IRLA20210914
[18]	Jiang R Q, Peng Y P, Xie W X, et al. Improved YOLOv4 small target detection algorithm with embedded scSE module [J]. Journal of Graphics, 2021, 42(4): 546-555. (in Chinese) doi: 10.11996/JG.j.2095-302X.2021040546
[19]	Owens A, Wu J, McDermott J H, et al. Ambient sound provides supervision for visual learning[C]//European conference on computer vision. Springer, Cham, 2016: 801-816.
[20]	Goodfellow I, Pouget-abadie J, Mirza M, et al. Generative adversarial networks [J]. Communications of the ACM, 2020, 63(11): 139-144. doi: 10.1145/3422622
[21]	Hou Q, Zhou D, Feng J. Coordinate attention for efficient mobile network design[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2021: 13713-13722.
[22]	Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 7132-7141.
[23]	Woo S, Park J, Lee J Y, et al. Cbam: Convolutional block attention module[C]//Proceedings of the European Conference on Computer Vision (ECCV), 2018: 3-19.
[24]	Li K, Wan G, Cheng G, et al. Object detection in optical remote sensing images: A survey and a new benchmark [J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2020, 159: 296-307. doi: 10.1016/j.isprsjprs.2019.11.023
[25]	Xia G S, Bai X, Ding J, et al. DOTA: A large-scale dataset for object detection in aerial images[C]//Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2018: 3974-3983.
[26]	Chen H, Qi Z, Shi Z. Remote sensing image change detection with transformers [J]. IEEE Transactions on Geoscience and Remote Sensing, 2021, 60: 1-14. doi: 10.1109/TGRS.2021.3095166

[1]	翟光, 胡圣冉, 孙一勇. 面向天基红外预警的高动态弱小目标LSTM检测方法研究 . 红外与激光工程, 2023, 52(10): 20230010-1-20230010-11. doi: 10.3788/IRLA20230010
[2]	刘芬, 孙杰, 张帅, 桑宏强, 孙秀军. 基于YOLOv5的红外船舶目标检测算法 . 红外与激光工程, 2023, 52(10): 20230006-1-20230006-12. doi: 10.3788/IRLA20230006
[3]	蔡仁昊, 程宁, 彭志勇, 董施泽, 安建民, 金钢. 基于深度学习的轻量化红外弱小车辆目标检测算法研究 . 红外与激光工程, 2022, 51(12): 20220253-1-20220253-11. doi: 10.3788/IRLA20220253
[4]	李岳楠, 徐浩宇, 董浩. 频域内面向目标检测的领域自适应 . 红外与激光工程, 2022, 51(7): 20210638-1-20210638-9. doi: 10.3788/IRLA20210638
[5]	高凡, 杨小冈, 卢瑞涛, 王思宇, 高久安, 夏海. Anchor-free轻量级红外目标检测方法（特邀） . 红外与激光工程, 2022, 51(4): 20220193-1-20220193-9. doi: 10.3788/IRLA20220193
[6]	张景程, 乔新博, 赵永强. 红外偏振摄像机动目标检测跟踪系统（特邀） . 红外与激光工程, 2022, 51(4): 20220233-1-20220233-10. doi: 10.3788/IRLA20220233
[7]	蒋昕昊, 蔡伟, 杨志勇, 徐佩伟, 姜波. 基于YOLO-IDSTD算法的红外弱小目标检测 . 红外与激光工程, 2022, 51(3): 20210106-1-20210106-10. doi: 10.3788/IRLA20210106
[8]	韩金辉, 魏艳涛, 彭真明, 赵骞, 陈耀弘, 覃尧, 李楠. 红外弱小目标检测方法综述 . 红外与激光工程, 2022, 51(4): 20210393-1-20210393-24. doi: 10.3788/IRLA20210393
[9]	黄攀, 杨小冈, 卢瑞涛, 常振良, 刘闯. 基于空间联合的红外舰船目标数据增强方法 . 红外与激光工程, 2021, 50(12): 20210281-1-20210281-10. doi: 10.3788/IRLA20210281
[10]	魏豪, 张凯, 郑磊, 曹源, 张丁文. 基于HOG-RCNN的电力巡检红外图像目标检测 . 红外与激光工程, 2020, 49(S2): 20200411-20200411. doi: 10.3788/IRLA20200411
[11]	南天章, 耿建君, 陈旭, 陈颖. 基于邻域特征的红外低慢小目标检测 . 红外与激光工程, 2019, 48(S1): 174-180. doi: 10.3788/IRLA201948.S128002
[12]	唐聪, 凌永顺, 郑科栋, 杨星, 郑超, 杨华, 金伟. 基于深度学习的多视窗SSD目标检测方法 . 红外与激光工程, 2018, 47(1): 126003-0126003(9). doi: 10.3788/IRLA201847.0126003
[13]	陈卫, 孙晓兵, 乔延利, 陈震庭, 殷玉龙. 海面耀光背景下的目标偏振检测 . 红外与激光工程, 2017, 46(S1): 63-68. doi: 10.3788/IRLA201746.S117001
[14]	孙照蕾, 惠斌, 秦莫凡, 常铮, 罗海波, 夏仁波. 红外图像显著目标检测算法 . 红外与激光工程, 2015, 44(9): 2633-2637.
[15]	彭志勇, 王向军, 卢进. 窗口热辐射下基于视觉显著性的红外目标检测方法 . 红外与激光工程, 2014, 43(6): 1772-1776.
[16]	吴明军, 许建铮, 周桢, 张亚涛. 针对运动摄像机的快速低存储开销运动目标检测算法 . 红外与激光工程, 2013, 42(8): 2275-2280.
[17]	刘志刚, 卢云龙, 魏一苇. 有监督的高光谱图像伪装目标检测方法 . 红外与激光工程, 2013, 42(11): 3076-3081.
[18]	黎志华, 李新国. 基于OpenCV的红外弱小运动目标检测与跟踪 . 红外与激光工程, 2013, 42(9): 2561-2565.
[19]	杨亚威, 李俊山, 杨威, 赵方舟. 利用稀疏化生物视觉特征的多类多视角目标检测方法 . 红外与激光工程, 2012, 41(1): 267-272.
[20]	何莲, 蔡敬菊, 张启衡. 改进的基于弦切变换的目标检测方法 . 红外与激光工程, 2012, 41(1): 239-247.

点击查看大图

图(9) / 表(4)

计量

文章访问数: 158
HTML全文浏览量: 43
PDF下载量: 64
被引次数: 0

全文HTML

0. 引　言

红外时敏目标是指打击机会受限于时间窗口，且具有极高军事价值的舰船飞机等红外目标。红外时敏目标检测技术在无人巡航、精确打击、战场侦察等领域应用广泛。为满足红外时敏目标检测精度需求，基于深度学习的方法得益于强大的算力、深层网络结构以及大量的标注数据在目标检测领域^[1]取得了巨大进展。由于具备大量的可见光遥感数据集，当前的时敏目标检测研究主要集中在可见光领域^[2]，受限于数据获取难度较大、标注成本较高，针对红外时敏目标检测的研究较少，而通过对数据进行处理生成“新数据”，则成为扩大数据集同时提高模型泛化能力的一项重要手段^[3]。

研究人员通过设计合理的神经网络模型结构，利用大量已标注的数据集计算损失函数，从而实现对目标任务的数据挖掘，通过对模型参数进行迭代优化，最终得出基于任务的深度学习模型。数据作为深度学习的驱动力，在目标检测模型训练中起到至关重要的作用，数据增强作为一种常规的增加训练数据的手段，可以有效防止模型在训练过程中的过拟合问题，并且在一定程度上提高了模型的检测精度以及泛化能力。

目前较多领域存在数据集规模较小、分布不均匀等情况。有些高价值目标图像的获取难度高且成本昂贵，为解决此类问题，部分学者对原始样本进行数据增强，从而扩充数据集目标多样性及丰富度^[4]。在图像数据增强技术中，如何在扩充宏观数据集数量的同时丰富其目标微观特征数量，则成为了研究的主要关注点。

传统的数据增强方法主要有几何变换、颜色变换等有监督的数据增强方法。通过平移、旋转、缩放、裁剪、噪声、模糊、填充等方式实现数据集中的样本增强。Taylor等人^[5]将图像裁剪运用到包含101类目标的数据集中，将精度提升了13.82%。Zhong等人^[6]提出了一种基于随机擦除的数据增强方法，使得深度学习模型学习更深层次的特征。Ma等人^[7]将椒盐、高斯等噪声加入到训练集中进行图像分类训练，结果表明，该方法对遥感分类任务并没有取得明显的精度提升。因此可以发现基于数据变形的数据增强方法虽然操作简单，但是在复杂任务场景下对深度学习模型的效能提升有限。

基于深度学习的智能数据增强方法主要体现在生成对抗网络，通过模型学习生成新的训练数据，从而产生更好的模型。Gulrajani等人提出了WGAN^[8]从而解决了模型训练过程中的梯度消失问题，使得生成的图像更加真实。为了去除数据集所带的不平衡性，Zheng等人^[9]提出DCGAN作为一种数据增强工具，模型从大多数类中学习有效特征并为少数类生成图像。通过神经风格迁移进行数据增强，可以通过选择一组k个风格，并将它们应用于训练集中的所有图像。Zhong等人^[10]利用CycleGAN将标记的训练图像进行风格转换，并与原始训练样本一起形成增强训练集。

传统的目标检测算法主要利用人工特征提取的方法，因此获取的图像信息较为片面，针对背景复杂的场景检测效果不佳，自从Alexnet在计算机视觉领域取得成功，卷积神经网络在图像目标检测领域取得了巨大的进展。现阶段基于深度学习的目标检测算法主要包括两类：单阶段检测和两阶段检测，其中单阶段检测算法主要为SSD^[11]和Yolo系列^[12]算法，两阶段算法主要包括R-CNN^[13]、Fast R-CNN^[14]等。

目前基于深度学习的红外目标检测算法相对较少，时敏目标为金属壳体，表面具有附加涂层，通过反射太阳光产生辐射，目标表面温度会高于背景温度，因此红外目标具有较强的红外特性。由于成像质量受天气影响，成像视角较高，红外图像中的时敏目标具有外观模糊、细节信息丢失严重、边界不清晰等特性。因此，常规的深度学习目标检测算法效果较差。Ju等人^[15]提出了一种端到端网络ISTDet，该方法将图像滤波与目标检测相结合，在抑制背景的同时增强目标的响应。Yao等人^[16]以图像序列的形式加入时域特征，使网络能够学习图像序列中的时空相关特征，从而实现了红外目标实时检测。Lu等人^[17]将局部对比度机制与信杂比的计算相结合，在增强图像中疑似红外弱小目标区域的同时也提高图像的信杂比。Jiang等人^[18]提出一种scSE-IYOLOv4的目标检测算法，通过在YOLOv4主干网络中嵌入scSE模块提高了目标检测精度。

针对红外时敏目标图像数据匮乏、缺乏颜色和纹理特征导致检测效果较差等问题。文中提出了一种基于跨模态数据增强的红外时敏目标检测技术，首先跨模态数据增强两阶段模型的第一阶段将包含多种时敏目标的可见光图像迁移为红外图像数据，第二阶段则在此基础上对单张红外图像进行生成式模型训练，实现样本随机生成。然后在Yolov5模型中引入SE模块和CBAM模块，增强红外时敏目标的特征提取。与同类模型相比可以发现，文中算法有效提升了红外时敏目标的检测准确率。

文中的主要贡献有：

1）提出了一种跨模态红外时敏目标数据增强方法，通过将风格迁移模型与目标生成式模型相结合，利用可见光图像数据集实现红外时敏目标数据增强。

2）提出一种基于coordinate attention注意力机制的生成器结构，增强图像目标的特征提取，同时丰富了目标的细节纹理，从而实现随机红外时敏目标样本生成。

3）提出了一种改进的Yolov5目标检测模型，在CSP网络中增加SE和CBAM注意力机制，增强了网络的特征表达，更好的实现红外时敏目标检测。

4. 结　论

针对红外时敏目标数据匮乏和检测效果不佳的问题，文中提出了一种跨模态数据增强的红外时敏目标检测技术。在两阶段模型数据增强方面，首先利用模态转换网络将包含时敏目标的可见光遥感图像转换为具备红外特性的目标图像，其次在样本随机生成模型中引入coordinate attention注意力机制，最后提出基于改进CSP模块的Yolov5检测技术。多组实验结果表明，文中算法在红外时敏目标数据集中检测准确率高达98.06%，解决红外时敏目标数据匮乏的问题的同时具有较好的目标检测能力，下一步拟对不同光谱条件下的红外图像数据进行分析实验，提升算法的准确性以及适应能力。

参考文献 (26)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于跨模态数据增强的红外时敏目标检测技术

doi: 10.3788/IRLA20220876

作者简介:
王思宇，男，博士生，主要从事视觉导航、目标检测、图像处理等方面的研究

Infrared time-sensitive target detection technology based on cross-modal data augmentation

计量

基于跨模态数据增强的红外时敏目标检测技术

doi: 10.3788/IRLA20220876

火箭军工程大学导弹工程学院，陕西西安 710025

作者简介:
王思宇，男，博士生，主要从事视觉导航、目标检测、图像处理等方面的研究

English Abstract

Infrared time-sensitive target detection technology based on cross-modal data augmentation

Missile Engineering Institute, PLA Rocket Force University of Engineering, Xi'an 710025, China

全文HTML

1.1. 可见光红外图像模态转换模型

1.1.1. 模态转换网络架构设计

1.1.2. 损失函数设计与分析

1.2. 对抗性随机样本生成模型

1.2.1. 多尺度生成对抗网络架构

1.2.2. 生成器模型改进

1.2.3. 模型训练

2.1. 目标检测网络架构

2.2. 改进特征提取层网络结构

3.1. 实验环境及数据集

3.2. 实验评估指标

3.3. 实验结果及分析

3.3.1. 模态转换实验

3.3.2. 随机样本生成实验

3.3.3. 不同模型性能对比测试

3.3.4. 消融实验

目录

留言板

基于跨模态数据增强的红外时敏目标检测技术

doi: 10.3788/IRLA20220876

作者简介: 王思宇，男，博士生，主要从事视觉导航、目标检测、图像处理等方面的研究

Infrared time-sensitive target detection technology based on cross-modal data augmentation

计量

出版历程

基于跨模态数据增强的红外时敏目标检测技术

doi: 10.3788/IRLA20220876

火箭军工程大学 导弹工程学院，陕西 西安 710025

作者简介: 王思宇，男，博士生，主要从事视觉导航、目标检测、图像处理等方面的研究

English Abstract

Infrared time-sensitive target detection technology based on cross-modal data augmentation

Missile Engineering Institute, PLA Rocket Force University of Engineering, Xi'an 710025, China

全文HTML

1.1. 可见光红外图像模态转换模型

1.1.1. 模态转换网络架构设计

1.1.2. 损失函数设计与分析

1.2. 对抗性随机样本生成模型

1.2.1. 多尺度生成对抗网络架构

1.2.2. 生成器模型改进

1.2.3. 模型训练

2.1. 目标检测网络架构

2.2. 改进特征提取层网络结构

3.1. 实验环境及数据集

3.2. 实验评估指标

3.3. 实验结果及分析

3.3.1. 模态转换实验

3.3.2. 随机样本生成实验

3.3.3. 不同模型性能对比测试

3.3.4. 消融实验

目录

作者简介:
王思宇，男，博士生，主要从事视觉导航、目标检测、图像处理等方面的研究

火箭军工程大学导弹工程学院，陕西西安 710025

作者简介:
王思宇，男，博士生，主要从事视觉导航、目标检测、图像处理等方面的研究