卷积神经网络结合NSST的红外与可见光图像融合

宦克为; 李向阳; 曹宇彤; 陈笑

doi:10.3788/IRLA20210139

卷积神经网络结合NSST的红外与可见光图像融合

doi: 10.3788/IRLA20210139

长春理工大学物理学院，吉林长春 130022

基金项目: 吉林省科技发展计划（20210101158JC）

详细信息

作者简介:
宦克为，男，副教授，硕士生导师，博士，主要从事红外成像技术与近红外光谱分析技术方面的研究

中图分类号: TP391

Infrared and visible image fusion of convolutional neural network and NSST

College of Physics, Changchun University of Science and Technology, Changchun 130022, China

Funds: Jilin Province Science and Technology Development Plan（20210101158JC）

摘要: 传统的多尺度红外与可见光图像融合方法，所提取的图像特征固定，并不能很好的应用于各类复杂的图像环境，而深度学习可以自主选择合适图像特征，改良特征提取单一性问题，因此提出一种基于卷积神经网络与非下采样剪切波变换（NSST）相结合的红外与可见光图像融合方法。首先，用卷积神经网络提取红外目标与背景的二分类图，利用调频（FT）显著性检测算法对分类图进行精准分割，同时，利用NSST将源图像多尺度、多方向进行分解；其次，利用目标显著性结合自适应模糊逻辑算法进行低频子带融合，利用高频系数局部方差对比度方法进行高频子带融合；最后，通过NSST逆变换得到融合后图像。实验结果表明：相比于传统图像融合算法，该方法在信息熵、平均梯度、空间频率、互信息和交叉熵等多个客观评价指标上至少分别提高了0.01%、0.30%、1.43%、2.32%、1.14%。一定程度提高了融合图像对比度，丰富了背景细节信息，更有利于人眼识别，可以广泛的应用于光电侦察、光电告警、多传感器信息融合等光电信息领域。
- 图像融合 /
- 卷积神经网络 /
- 显著性提取 /
- 非下采样剪切波变换 /
- 模糊逻辑
Abstract: Traditional multi-scale infrared and visible image fusion methods couldnot be well applied to all kinds of complex image environments, because the extracted image features were fixed. However, deep learning could independently select appropriate image features to solve the unicity in feature extraction of multi-scale methods. Therefore, an infrared and visible image fusion method based on the combination of convolutional neural network and non-subsampled shear wave transform (NSST) was proposed. Firstly, the binary classification map of the infrared target and background was extracted by convolutional neural network, and the classification map was accurately segmented by frequency-tuned (FT) saliency detection algorithm. At the same time, the NSST was used to decompose the source image in multiple scales and directions; Secondly, the target saliency combined with adaptive fuzzy logic algorithm was used for the fusion of low frequency sub-bands, and the high frequency coefficient local variance contrast method was used for the fusion of high frequency sub-bands; Finally, the fused image was obtained through the inverse transformation of NSST. The experiment results show that compared with the traditional image fusion algorithm, this method improves objective evaluation indicators such as information entropy, average gradient, spatial frequency, mutual information and cross entropy at least increased by 0.01%, 0.30%, 1.43%, 2.32%, 1.14%, respectively. The contrast of fusion image is greatly improved, and the background details are enriched, which is more conducive to human eye recognition. It can be widely used in electro-optical reconnaissance, electro-optical warning, multi-sensor information fusion and other electro-optical information fields.
- image fusion /
- convolutional neural network /
- saliency extraction /
- NSST /
- fuzzy logic

图 1 卷积神经网络结构

Figure 1. Convolutional neural network structure

下载: 全尺寸图片幻灯片

图 2 交叉熵损失函数

Figure 2. Cross entropy loss function

下载: 全尺寸图片幻灯片

图 3 红外图像“UN Camp”及多种方法显著性提取后图像。 (a) 红外图像“UN Camp”；(b) 图像“UN Camp”标准分割图；(c) AC方法；(d) SR方法；(e) LC方法；(f) FT方法；(g) CNN方法；(h) CNN+FT方法

Figure 3. Infrared image ''UN Camp''and images after saliency extraction by various methods. (a) Infrared image "UN Camp"; (b) Standard segmentation of image "UN Camp"; (c) AC method; (d) SR method; (e) LC method; (f) FT method; (g) CNN method; (h) CNN+FT method

下载: 全尺寸图片幻灯片

图 4 红外图像“dune”及多种方法显著性提取后图像。 (a) 红外图像“dune”；(b) 图像“dune”标准分割图；(c) AC方法(d) SR方法；(e) LC方法；(f) FT方法；(g) CNN方法；(h) CNN+FT方法

Figure 4. Infrared image ''dune'' and images after saliency extraction by various methods. (a) Infrared image "dune"; (b) Standard segmentation of image "dune"; (c) AC method; (d) SR method; (e) LC method; (f) FT method; (g) CNN method; (h) CNN+FT method

下载: 全尺寸图片幻灯片

图 5 基于卷积神经网络与NSST的图像融合模型

Figure 5. Image fusion model based on convolutional neural network and NSST

下载: 全尺寸图片幻灯片

图 6 “UN Camp”红外和可见光图像以及融合结果。(a) 红外图像；(b) 可见光图像；(c) DWT方法；(d) CS方法；(e) BEMD方法；(f) NSCT+FL方法；(g) NSST+FL方法；(h) 文中方法；(i) 显著区域融合图像

Figure 6. ''UN Camp'' infrared and visible images and fusion results. (a) Infrared image; (b) Visible image; (c) DWT method; (d) CS method; (e) BEMD method; (f) NSCT+FL method; (g) NSST+FL method; (h) Proposed method; (i) Significant area fusion image

下载: 全尺寸图片幻灯片

图 7 “dune”红外和可见光图像以及融合结果。（a) 红外图像；(b) 可见光图像；(c) DWT方法；(d) CS方法；(e) BEMD方法；(f) NSCT+FL方法；(g) NSST+FL方法；(h) 文中方法；(i) 显著区域融合图像

Figure 7. ''dune'' infrared and visible images and fusion results. (a) Infrared image; (b) Visible image; (c) DWT method; (d) CS method; (e) BEMD method; (f) NSCT+FL method; (g) NSST+FL method; (h) Proposed method; (i) Significant area fusion image

下载: 全尺寸图片幻灯片

图 8 “iron”红外和可见光图像以及融合结果。（a) 红外图像；(b) 可见光图像；(c) DWT方法；(d) CS方法；(e) BEMD方法；(f) NSCT+FL方法；(g) NSST+FL方法；(h) 文中方法；(i) 显著区域融合图像

Figure 8. ''iron'' infrared and visible images and fusion results. (a) Infrared image; (b)Visible image; (c) DWT method; (d) CS method; (e) BEMD method; (f) NSCT+FL method; (g) NSST+FL method; (h) Proposed method; (i) Significant area fusion image

下载: 全尺寸图片幻灯片

表 1 目标显著性提取评价指标MAE

Table 1. Target significance extraction evaluation index MAE

Method	AC	SR	LC	FT	CNN	CNN+FT
MAE₁	15.64	1.00	8.59	9.34	0.50	0.27
MAE₂	8.45	0.55	2.91	2.91	0.63	0.35

下载: 导出CSV

表 2 红外与可见光图像融合效果评价

Table 2. Infrared and visible image fusion effect evaluation

Image	Method	E	AG	SF	MI	CE
UN	DWT	6.934 2	7.049 2	13.921 7	2.668 3	0.356 3
Camp	CS	6.252 7	4.965 9	10.300 5	1.593 3	0.599 9
	BEMD	6.603 8	6.146 2	12.007 7	1.573 8	0.575 9
	NSCT+FL	6.810 5	7.110 9	14.126 5	2.432 8	0.326 4
	NSST+FL	6.855 5	7.051 6	14.475 8	2.355 0	0.279 8
	Proposed method	7.116 3	8.008 2	16.106 9	2.991 9	0.264 8
Dune	DWT	6.657 4	6.507 5	12.371 8	2.594 3	0.307 2
	CS	5.903 8	4.694 8	9.790 9	1.188 4	0.654 5
	BEMD	6.156 6	5.426 7	10.391 4	1.196 5	0.576 0
	NSCT+FL	6.675 8	6.578 3	15.540 9	2.222 0	0.292 5
	NSST+FL	6.666 7	7.365 3	13.954 2	2.637 0	0.301 4
	Proposed method	6.701 1	7.386 8	14.153 9	2.815 7	0.289 2
Iron	DWT	6.677 8	12.647 6	33.730 4	3.617 3	0.550 3
	CS	6.537 2	7.647 5	20.025 4	3.177 3	0.498 7
	BEMD	6.677 7	9.090 6	23.349 1	3.392 5	0.539 7
	NSCT+FL	6.767 7	14.998 1	38.969 7	3.397 5	0.474 5
	NSST+FL	6.751 1	15.645 4	40.410 4	3.177 4	0.427 9
	Proposed method	6.768 7	16.282 2	41.718 8	3.701 3	0.409 3

下载: 导出CSV

[1]	Lu Jiuming, Meng Weihua. Infrared small target detection based on fully convolutional neural network and visual saliency [J]. Acta Photonica Sinica, 2020, 49(7): 0710003. (in Chinese)
[2]	Yan Ge, Xu Tingfa, Ma Xu, et al. Hyperspectral image com-pression sensing based on dynamic measurement [J]. Chinese Optics, 2018, 11(4): 550-559. (in Chinese)
[3]	Chen Qingjiang, Zhang Yanbo, Chai Yuzhou, et al. Fusionof infrared and visible images based on finite discrete she-arlet domain [J]. Chinese Optics, 2016, 9(5): 523-531. (in Chinese) doi: 10.3788/co.20160905.0523
[4]	Zhou Yuren, Geng Anhui, Zhang Qian, et al. Fusion of in-frared and visible images based on compressive sensing [J]. Optics and Precision Engineering, 2015, 23(3): 855-863. (in Chinese) doi: 10.3788/OPE.20152303.0855
[5]	Wang Wenxiu, Fu Yutian, Dong Feng, et al. Infrared ship T-arget detection method based on deep convolution neural network [J]. Acta Optica Sinica, 2018, 38(7): 0712006. (in Chinese)
[6]	Zeng Hanlin, Meng Xiangyong, Qian Weixian. Image fusion algorithm based on DOG filter [J]. Infrared and Laser Engineering, 2020, 49(S1): 20200091. (in Chinese)
[7]	Wang Xi, Ji Tongbo, Liu Fu. Fusion of infrared and visible images based on target segmentation and compressed sens-ing [J]. Optics and Precision Engineering, 2016, 24(7): 1743-1753. (in Chinese) doi: 10.3788/OPE.20162407.1743
[8]	Dai Jindun, Liu Yadong, Mao Xianyin, et al. Infrared and visi-ble light image fusion based on FDST and dual-channel PCNN [J]. Infrared and Laser Engineering, 2019, 48(2): 0204001. (in Chinese)
[9]	Li Jiao, Yang Yangchun, Dang Jianwu, et al. NSST and gui-ded filtering for multi-focus image fusion algorithm [J]. Journal of Harbin Institute of Technology, 2018, 50(11): 145-152. (in Chinese) doi: 10.11918/j.issn.0367-6234.201805006
[10]	Rahman M A, Liu S, Wong C Y. Multi-focal image fusion using degree of focus and fuzzy logic [J]. Digital Signal Processing, 2017, 1(60): 1-19.
[11]	He Kangjian, Zhou Dongming, Zhang Xuejie, et al. Multi-focus image fusion combining focus region level partition and pulsecoupled neural network [J]. Soft Computing, 2019(23): 4685-4699.
[12]	Li Hui, Wu Xiaojun. Infrared and visible image fusion with ResNet and zero phase component analysis [J]. Infrared Physics and Technology, 2019, 11(102): 103039.
[13]	Yong Yong, Nie Zhipeng, Huang Shuying, et al. Multi-level features convolutional neural network for multi-focus image fusion [J]. IEEE Transactions on Computational Imaging, 2019, 5(2): 262-273. doi: 10.1109/TCI.2018.2889959
[14]	An Wenbo, Wang Hongmei. Infrared and visible image fu-sion with supervised convolutional neural network [J]. Optik, 2020: 165120.
[15]	Liu Yu, Chen Xun, Cheng Juan, et al. Infrared and visible image fusion with convolutional neural networks [J]. International Journal of Wavelets, Multiresolution and Information Processing, 2018, 6(5): 1850018.
[16]	Toet A. TNO Image fusion dataset[DB/OL]. [2014-04-26]. https://figshare.om/articles/TN/Image/Fusion/Data-setYang

[1]	陆建华. 融合CNN和SRC决策的SAR图像目标识别方法 . 红外与激光工程, 2022, 51(3): 20210421-1-20210421-7. doi: 10.3788/IRLA20210421
[2]	齐悦, 董云云, 王溢琴. 基于汇聚级联卷积神经网络的旋转人脸检测方法 . 红外与激光工程, 2022, 51(12): 20220176-1-20220176-8. doi: 10.3788/IRLA20220176
[3]	刘瀚霖, 辛璟焘, 庄炜, 夏嘉斌, 祝连庆. 基于卷积神经网络的混叠光谱解调方法 . 红外与激光工程, 2022, 51(5): 20210419-1-20210419-9. doi: 10.3788/IRLA20210419
[4]	庄子波, 邱岳恒, 林家泉, 宋德龙. 基于卷积神经网络的激光雷达湍流预警 . 红外与激光工程, 2022, 51(4): 20210320-1-20210320-10. doi: 10.3788/IRLA20210320
[5]	李保华, 王海星. 基于增强卷积神经网络的尺度不变人脸检测方法 . 红外与激光工程, 2022, 51(7): 20210586-1-20210586-8. doi: 10.3788/IRLA20210586
[6]	蒋筱朵, 赵晓琛, 冒添逸, 何伟基, 陈钱. 采用传感器融合网络的单光子激光雷达成像方法 . 红外与激光工程, 2022, 51(2): 20210871-1-20210871-7. doi: 10.3788/IRLA20210871
[7]	闵莉, 曹思健, 赵怀慈, 刘鹏飞. 改进生成对抗网络实现红外与可见光图像融合 . 红外与激光工程, 2022, 51(4): 20210291-1-20210291-10. doi: 10.3788/IRLA20210291
[8]	李霖, 王红梅, 李辰凯. 红外与可见光图像深度学习融合方法综述 . 红外与激光工程, 2022, 51(12): 20220125-1-20220125-20. doi: 10.3788/IRLA20220125
[9]	高泽宇, 李新阳, 叶红卫. 流场测速中基于深度卷积神经网络的光学畸变校正技术 . 红外与激光工程, 2020, 49(10): 20200267-1-20200267-10. doi: 10.3788/IRLA20200267
[10]	徐云飞, 张笃周, 王立, 华宝成. 非合作目标局部特征识别轻量化特征融合网络设计 . 红外与激光工程, 2020, 49(7): 20200170-1-20200170-7. doi: 10.3788/IRLA20200170
[11]	裴晓敏, 范慧杰, 唐延东. 多通道时空融合网络双人交互行为识别 . 红外与激光工程, 2020, 49(5): 20190552-20190552-6. doi: 10.3788/IRLA20190552
[12]	薛珊, 张振, 吕琼莹, 曹国华, 毛逸维. 基于卷积神经网络的反无人机系统图像识别方法 . 红外与激光工程, 2020, 49(7): 20200154-1-20200154-8. doi: 10.3788/IRLA20200154
[13]	张秀, 周巍, 段哲民, 魏恒璐. 基于卷积稀疏自编码的图像超分辨率重建 . 红外与激光工程, 2019, 48(1): 126005-0126005(7). doi: 10.3788/IRLA201948.0126005
[14]	郭强, 芦晓红, 谢英红, 孙鹏. 基于深度谱卷积神经网络的高效视觉目标跟踪算法 . 红外与激光工程, 2018, 47(6): 626005-0626005(6). doi: 10.3788/IRLA201847.0626005
[15]	郭全民, 王言, 李翰山. 改进IHS-Curvelet变换融合可见光与红外图像抗晕光方法 . 红外与激光工程, 2018, 47(11): 1126002-1126002(9). doi: 10.3788/IRLA201847.1126002
[16]	张腊梅, 陈泽茜, 邹斌. 基于3D卷积神经网络的PolSAR图像精细分类 . 红外与激光工程, 2018, 47(7): 703001-0703001(8). doi: 10.3788/IRLA201847.0703001
[17]	杨风暴, 蔺素珍. 基于变换域多合成规则的双色中波红外图像融合 . 红外与激光工程, 2014, 43(11): 3663-3669.
[18]	张勇, 金伟其. 夜视融合图像质量主观评价方法 . 红外与激光工程, 2013, 42(2): 528-532.
[19]	杨扬, 戴明, 周箩鱼. 基于均匀离散曲波变换的多聚焦图像融合 . 红外与激光工程, 2013, 42(9): 2547-2552.
[20]	纪超, 刘慧英, 邵刚, 孙景峰. 基于生物激励计算模型在图像显著性提取中的研 . 红外与激光工程, 2013, 42(3): 823-828.

点击查看大图

图(8) / 表(2)

计量

文章访问数: 475
HTML全文浏览量: 203
PDF下载量: 60
被引次数: 0

全文HTML

0. 引　言

红外图像是红外探测器通过测量物体表面的红外辐射并进一步转换后生成的图像，其在分辨率、对比度等方面不如可见光图像，其受天气条件制约较少；可见光图像具有分辨率高、对比度好等优势，但是其受到天气条件等因素影响，不能够全天候进行工作，因此若将红外图像与可见光图像的互补信息进行有效融合，可以形成更加适合人类视觉认知系统的新图像^[1-3]。融合后生成的新图像鲁棒性强，具有更加丰富的细节信息，可以被广泛的应用于光电侦察、光电告警、多传感器信息融合等光电信息领域^[4-5]。

传统的多尺度融合方法如非下采样剪切波变换（Non-subsampled Shearlet Transform，NSST）性能表现良好，从而被广泛的应用到图像融合领域之中，但一般融合方式的图像特征器需要手动设计，运算效率较低，所提取的图像特征固定、单一，并不能很好的应用于各类复杂的图像环境，影响融合图像的质量^[6-10]。近年来，随着深度学习理论的发展，深度学习因其强大的特征学习能力在图像融合领域中取得了优异的成果^[11-12]。Yang等提出了基于卷积神经网络（Convolutional Neural Network，CNN）的图像融合方法，使融合图像特征更为清晰，提升了融合图像质量^[13]。An等提出基于监督卷积神经网络的红外与可见光图像融合算法，该方法能较好地保留红外图像的目标特征^[14]。传统的深度学习图像融合算法，在的训练过程中并没有对图像进行高低频的划分，影响了融合图像的视觉效果。而通过深度学习结合多尺度算法可以更好的提取源图像的低频信息，有效保留源图像的高频信息，从而进一步提升融合图像质量。

综上，文中首次提出了基于卷积神经网络与NSST相结合的红外与可见光图像融合方法。首先，将具有目标、背景二分功能的卷积神经网络模型训练成功，并将红外图像输入训练好的模型之中，区分红外图像的目标与背景；其次，结合调频（Frequency-tuned，FT)显著性检测算法进行目标特征提取，得到目标显著图；最后，利用NSST结合目标显著图，得到红外与可见光融合图像。实验结果表明，相比于传统多尺度图像融合方法，文中方法在融合效果的主客观评价上均有明显提高，融合图像更加清晰，细节更加丰富，更有利于观察者识别。

4. 结　论

提出了一种新型的卷积神经网络结合NSST的红外与可见光图像融合方法。文中首次利用卷积神经网络和调频显著性相结合方法获取红外图像目标显著图，该方法相比于AC、SR、LC、FT、CNN方法明显降低了与标准分割图之间的平均绝对误差，提高了目标提取准确性；同时，首次使用自适应的模糊逻辑方法结合目标显著图的方式进行低频图像融合，突出了源图像的特征。实验结果表明，相比于传统算法，文中方法可以更好的提取源图像的特征信息，提高了融合图像对比度，图像更加清晰，更有利于人眼观察，同时，融合图像在信息熵、平均梯度、空间频率、互信息、交叉熵等多个客观评价指标上至少分别提高了0.01%、0.30%、1.43%、2.32%、1.14%。该方法可以很好的应用于光电侦察告警、红外搜索与跟踪、多传感器信息融合等光电信息领域。

参考文献 (16)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

卷积神经网络结合NSST的红外与可见光图像融合

doi: 10.3788/IRLA20210139

作者简介:
宦克为，男，副教授，硕士生导师，博士，主要从事红外成像技术与近红外光谱分析技术方面的研究

Infrared and visible image fusion of convolutional neural network and NSST

计量

卷积神经网络结合NSST的红外与可见光图像融合

doi: 10.3788/IRLA20210139

长春理工大学物理学院，吉林长春 130022

作者简介:
宦克为，男，副教授，硕士生导师，博士，主要从事红外成像技术与近红外光谱分析技术方面的研究

English Abstract

Infrared and visible image fusion of convolutional neural network and NSST

College of Physics, Changchun University of Science and Technology, Changchun 130022, China

全文HTML

1.1. 卷积网络模型构建与训练

1.2. 卷积神经网络的目标显著性提取

2.1. 图像融合模型

2.2. 融合规则

目录

留言板

卷积神经网络结合NSST的红外与可见光图像融合

doi: 10.3788/IRLA20210139

作者简介: 宦克为，男，副教授，硕士生导师，博士，主要从事红外成像技术与近红外光谱分析技术方面的研究

Infrared and visible image fusion of convolutional neural network and NSST

计量

出版历程

卷积神经网络结合NSST的红外与可见光图像融合

doi: 10.3788/IRLA20210139

长春理工大学 物理学院，吉林 长春 130022

作者简介: 宦克为，男，副教授，硕士生导师，博士，主要从事红外成像技术与近红外光谱分析技术方面的研究

English Abstract

Infrared and visible image fusion of convolutional neural network and NSST

College of Physics, Changchun University of Science and Technology, Changchun 130022, China

全文HTML

1.1. 卷积网络模型构建与训练

1.2. 卷积神经网络的目标显著性提取

2.1. 图像融合模型

2.2. 融合规则

目录

作者简介:
宦克为，男，副教授，硕士生导师，博士，主要从事红外成像技术与近红外光谱分析技术方面的研究

长春理工大学物理学院，吉林长春 130022

作者简介:
宦克为，男，副教授，硕士生导师，博士，主要从事红外成像技术与近红外光谱分析技术方面的研究