并行多特征提取网络的红外图像增强方法

庞忠祥; 刘勰; 刘桂华; 龚泿军; 周晗; 罗洪伟

doi:10.3788/IRLA20210957

并行多特征提取网络的红外图像增强方法

doi: 10.3788/IRLA20210957

1.
西南科技大学信息工程学院，四川绵阳 621000
2.
深圳市朗驰欣创科技股份有限公司，广东深圳 518000

基金项目: 国家自然科学基金(11602292)；四川省科技支撑计划(2021YFG0380)

详细信息

作者简介:
庞忠祥，男，硕士生，主要从事深度学习、图像处理以及计算机视觉等方面的研究

刘桂华，女，教授，博士生导师，博士，研究方向为机器人场景智能感知、图像处理、机器视觉以及 FPGA 集成电路设计等

中图分类号: TP391

Parallel multifeature extracting network for infrared image enhancement

1.
School of Information Engineering, Southwest University of Science and Technology, Mianyang 621000, China
2.
Shenzhen Launch Digital Technology Co., Ltd, Shenzhen 518000, China

Funds: National Natural Science Foundation of China (11602292)；Science and Technology Support Plan of Sichuan Province(2021YFG0380)

摘要: 为解决低质量红外图像细节模糊、对比度低等问题，提出了并行多特征提取网络的红外图像增强方法，设计了结构特征映射网络和双尺度特征提取网络。结构特征映射网络用于建立全局结构特征权重，以保持原始图像的空间结构信息。双尺度特征提取网络采用多尺度卷积层和融合多空洞卷积的注意力，增强网络对上下文信息的关注力，提升网络对感兴趣区域的特征提取能力，同时学习不同尺度的特征信息，完成双尺度间信息的交换，生成目标增强映射，实现目标区域细节纹理自适应增强。实验证明，所提方法能有效提高对比度，避免过增强，丰富图像细节纹理，减少伪影和光晕现象，在BSD200数据集上的PSNR与SSIM较典型的传统方法和深度学习方法分别提升了约37.35%、2.1%与25.94%、3.15%，在真实红外数据集上分别提升了约30.62%、1.04%与24.83%、2.08%，且对不同对比度因子的低质量图像，文中方法也具有良好的增强效果。
- 红外图像 /
- 图像增强 /
- 深度学习 /
- 空洞卷积 /
- 注意力机制
Abstract: To solve the problems of fuzzy details and low contrast of low-quality infrared images, a parallel multifeature extraction network for infrared image enhancement is proposed, and a structural feature mapping network and a two-scale feature extraction network are designed. The structural feature mapping network is used to establish the global structural feature weight to maintain the spatial structure information of the original images. The two-scale feature extraction network using multiscale convolutional layers and the attention mechanism fused dilated convolutions is applied to enhance the attention on contextual information, improve the feature extraction capability for regions of interest, and simultaneously learn feature information of different scales, complete the exchange of information of the two scales, and then generate a target enhancement map to achieve adaptive enhancement of detailed texture of target areas. Experiments have proven that the proposed method can effectively improve contrast, avoid overenhancement, enrich image details and textures, and reduce artifacts and halos. Compared with typical traditional methods and deep learning methods, the PSNR and SSIM on the BSD200 dataset are increased by approximately 37.35%, 2.1% and 25.94%, 3.15%, and increased by approximately 30.62%, 1.04% and 24.83%, 2.08% on real infrared images. The proposed method also has good generalization performance on low-quality images with different contrast factors as well.
- infrared image /
- image enhancement /
- deep learning /
- dilated convolution /
- attention mechanism

图 1 整体网络结构

Figure 1. Architecture of the overall network

下载: 全尺寸图片幻灯片

图 2 多尺度特征提取模块

Figure 2. Module of the MS-feature extraction

下载: 全尺寸图片幻灯片

图 3 注意力模块

Figure 3. Architecture of attention block

下载: 全尺寸图片幻灯片

图 4 解码块结构

Figure 4. Architecture of decoder block

下载: 全尺寸图片幻灯片

图 5 部分训练样本对

Figure 5. Part of the training sample pairs

下载: 全尺寸图片幻灯片

图 6 $ \alpha \in \left[\mathrm{0.5,0.51}\right] $条件下BSD200图像增强效果

Figure 6. Image enhancement on BSD200 with $ \alpha \in \left[\mathrm{0.5,0.51}\right] $

下载: 全尺寸图片幻灯片

图 7 $ \alpha \in \left[\mathrm{0.5,0.51}\right] $条件下真实红外图像测试结果

Figure 7. Test result on real infrared images with $ \alpha \in \left[\mathrm{0.5,0.51}\right] $

下载: 全尺寸图片幻灯片

图 8 在不同$ \alpha $作用下文中方法在BSD200数据集的图像增强效果

Figure 8. Image enhancement effect on BSD200 with different $ \alpha $ using proposed method

下载: 全尺寸图片幻灯片

表 1 结构特征权重映射块参数

Table 1. Parameters of SFW map blocks

Type	K	s	p	c
Conv	1	1	0	32
conv	3	1	1	32
Conv	5	1	2	32
Conv	3	1	1	32

下载: 导出CSV

表 2 双尺度特征提取块参数

Table 2. Parameters of TSFEB

Path	Type	K	s	p	c
2_1	Conv	3	2	1	48
2_2	Conv	1	1	0	32
2_3	MS-FE	/	/	/	62
2_4	MS-FE	/	/	/	96
2_5	Conv	3	1	1	96
3_1	Conv	4	4	0	96
3_2	Conv	1	1	0	64
3_3	AB	/	/	/	64
3_4	Conv	3	1	2	64
3_5	Conv	3	1	2	64
3_6	Conv	3	1	2	64
3_7	AB	/	/	/	64
3_8	deconv	4	2	1	96

下载: 导出CSV

表 3 ${\boldsymbol{\alpha}} $ ∈ [0.5, 0.51]条件下BSD200数据集测试结果

Table 3. Test result on BSD200 with ${\boldsymbol{\alpha}} $∈ [0.5, 0.51]

Method	HE	CLAHE	SSR	MSR	TEN	TIECNN	IE-GAN	Proposed
PSNR	15.95	22.19	16.51	17.57	25.07	24.60	26.23	35.42
SSIM	0.72	0.93	0.88	0.90	0.82	0.80	0.92	0.95

下载: 导出CSV

表 4 ${\boldsymbol{\alpha}}$∈ [0.5, 0.51]条件下真实红外图像增强效果

Table 4. Test result on real infrared images with ${\boldsymbol{\alpha}}$∈ [0.5, 0.51]

Method	HE	CLAHE	SSR	MSR	TEN	TIECNN	IE-GAN	Proposed
PSNR	13.06	24.78	15.89	18.36	25.77	23.25	26.85	35.72
SSIM	0.53	0.95	0.89	0.93	0.89	0.88	0.94	0.96

下载: 导出CSV

表 5 在不同${\boldsymbol{\alpha}}$作用下文中方法在BSD200数据集的PSNR和SSIM

Table 5. PSNR and SSIM on BSD200 with different ${\boldsymbol{\alpha}}$ using proposed method

$ \alpha $	$ \left[\mathrm{0.1,0.11}\right] $	$ \left[\mathrm{0.2,0.21}\right] $	$ \left[\mathrm{0.3,0.31}\right] $	$ \left[\mathrm{0.4,0.41}\right] $
PSNR	30.5762	31.0370	28.1290	32.0594
SSIM	0.8343	0.8867	0.9074	0.9058

下载: 导出CSV

表 6 消融实验

Table 6. Ablation experiments

Method	PSNR	SSIM	Time/s
Path1(SFW)	27.35	0.92	0.05
Path2 without MS-feature extraction	25.26	0.83	0.09
Path2	27.03	0.87	0.14
Path3	24.58	0.92	0.14
TSFEB	29.15	0.90	0.21
SFW + TSFEB without MS-feature extraction	30.99	0.93	0.22
SFW + TSFEB	35.42	0.95	0.26

下载: 导出CSV

[1]	左岑, 杨秀杰, 张捷, 王璇. 基于轻量级金字塔密集残差网络的红外图像超分辨增强[J]. 红外技术, 2021, 43(03): 251-257. Zuo C, Yang X, Zhang J, et al. Super-resolution enhancement of infrared images using a lightweight dense residual network [J]. Infrared Technology, 2021, 43(3): 251-257. (in Chinese)
[2]	李萍, 刘以安, 徐安林. 基于多尺度耦合的密集残差网络红外图像增强[J]. 电子测量与仪器学报, 2021, 35(07): 148-155. Li P, Liu Y, Xu A. Infrared image enhancement using dense residual network with multi-scale coupling [J]. Journal of Electronic Measurement and Instrumentation, 2021, 35(7): 148-155. (in Chinese)
[3]	Choi Y, Kim N, Hwang S, et al. Thermal image enhancement using convolutional neural network[C]//2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2016.
[4]	王笛, 沈涛, 孙宾宾, 崔晓荣. 基于大气灰度因子的红外图像增强算法[J]. 激光与红外, 2019, 49(09): 1135-1140. Wang D, Shen T, Sun B, et al. Infrared image enhancement algorithm based on atmospheric gray factor [J]. Laser and Infrared, 2019, 49(9): 1135-1140. (in Chinese)
[5]	李牧, 周瑞杰, 田哲嘉. 基于直方图的热红外图像增强方法[J]. 红外技术, 2020, 42(09): 880-885. Li M, Zhou R, Tian Z. A thermal infrared image enhancement method based on histogram [J]. Infrared Technology, 2020, 42(9): 880-885. (in Chinese)
[6]	李佳, 李少娟, 段小虎等. 基于Retinex理论与概率非局部均值的红外图像增强方法[J]. 光子学报, 2020, 49(4): 0410003. doi: 10.3788/gzxb20204904.0410003 Li J, Li S, Duan X, et al. Infrared image enhancement based on retinex and probability nonlocal means filtering [J]. Acta Photonica Sinica, 2020, 49(4): 0410003. (in Chinese) doi: 10.3788/gzxb20204904.0410003
[7]	曹海杰, 刘宁, 许吉等. 红外图像自适应逆直方图增强技术[J]. 红外与激光工程, 2020, 49(4): 0426003. doi: 10.3788/IRLA202049.0426003 Cao H, Liu N, Xu J, et al. Infrared image adaptive inverse histogram enhancement technology [J]. Infrared and Laser Engineering, 2020, 49(4): 0426003. (in Chinese) doi: 10.3788/IRLA202049.0426003
[8]	Li S, Jin W, Li L, et al. An improved contrast enhancement algorithm for infrared images based on adaptive double plateaus histogram equalization [J]. Infrared Physics & Technology, 2018, 90: 164-174.
[9]	Liang X, Tian Y, Yan S, et al. A real-time infrared image enhancement algorithm based on improved CLAHE[C]//2018 International Conference on Image and Video Processing, and Artificial Intelligence, 2018: 10836.
[10]	Lee K, Lee J, Lee J, et al. Brightness-based convolutional neural network for thermal image enhancement [J]. IEEE Access, 2017, 5: 26867-26879. doi: 10.1109/ACCESS.2017.2769687
[11]	Kuang X, Sui X, Liu Y, et al. Single infrared image enhancement using a deep convolutional neural network [J]. Neurocomputing, 2019, 332: 119-128. doi: 10.1016/j.neucom.2018.11.081
[12]	He Z, Tang S, Yang J, et al. Cascaded deep networks with multiple receptive fields for infrared image super-resolution [J]. IEEE Transactions on Circuits and Systems for Video Technology, 2019, 29(8): 2310-2322. doi: 10.1109/TCSVT.2018.2864777
[13]	王向军, 欧阳文森. 多尺度循环注意力网络运动模糊图像复原方法[J]. 红外与激光工程, 2022, 51(6): 20210605. . doi: 10.3788/IRLA20210605 Wang X J, Ouyang W S. Multi-scale recurrent attention network for image motion deblurring [J]. Infrared and Laser Engineering, 2022, 51(6): 20210605. (in Chinese) doi: 10.3788/IRLA20210605
[14]	Tian C, Xu Y, Zuo W. Image denoising using deep CNN with batch renormalization [J]. Neural Networks, 2020, 121: 461-473. doi: 10.1016/j.neunet.2019.08.022
[15]	Deng J, Dong W, Socher R, et al. Imagenet: A large-scale hierarchical image database[C]//2009 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 2009.
[16]	Martin D, Fowlkes C, Tal D, et al. A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics[C]//Proceedings Eighth IEEE International Conference on Computer Vision. ICCV. IEEE, 2001, 2: 416-423.
[17]	Toet A. TNO image fusion dataset. figshare[DB/OL]. (2014)[2021-12-13]. https://doi.org/10.6084/m9.figshare.1008029.v1.
[18]	Davis J W, Keck M A. A two-stage template approach to person detection in thermal imagery[C]//2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05). IEEE, 2005, 1: 364-369.

[1]	李鹏越, 续欣莹, 唐延东, 张朝霞, 韩晓霞, 岳海峰. 基于并行多轴自注意力的图像去高光算法 . 红外与激光工程, 2024, 53(3): 20230538-1-20230538-11. doi: 10.3788/IRLA20230538
[2]	李昭慧, 寇鸽子. 基于改进的Deeplabv3+的红外航拍图像架空导线识别算法 . 红外与激光工程, 2022, 51(11): 20220112-1-20220112-9. doi: 10.3788/IRLA20220112
[3]	张方, 肖辉. 基于三角函数变换与IRDPSO优化的图像增强算法 . 红外与激光工程, 2022, 51(8): 20210709-1-20210709-8. doi: 10.3788/IRLA20210709
[4]	蔡仁昊, 程宁, 彭志勇, 董施泽, 安建民, 金钢. 基于深度学习的轻量化红外弱小车辆目标检测算法研究 . 红外与激光工程, 2022, 51(12): 20220253-1-20220253-11. doi: 10.3788/IRLA20220253
[5]	廖莎莎. 基于筛选深度特征的红外图像目标识别方法 . 红外与激光工程, 2022, 51(5): 20210372-1-20210372-6. doi: 10.3788/IRLA20210372
[6]	王鹏翔, 张兆基, 杨怀. 结合多特征融合和极限学习机的红外图像目标分类方法 . 红外与激光工程, 2022, 51(6): 20210597-1-20210597-6. doi: 10.3788/IRLA20210597
[7]	夏信, 何传亮, 吕英杰, 王守志, 张博, 陈晨, 陈海鹏, 李美萱. 深度学习驱动的智能电网运行图像数据压缩技术 . 红外与激光工程, 2022, 51(12): 20220097-1-20220097-6. doi: 10.3788/IRLA20220097
[8]	李霖, 王红梅, 李辰凯. 红外与可见光图像深度学习融合方法综述 . 红外与激光工程, 2022, 51(12): 20220125-1-20220125-20. doi: 10.3788/IRLA20220125
[9]	王志远, 赖雪恬, 林惠川, 陈福昌, 曾峻, 陈子阳, 蒲继雄. 基于深度学习实现透过浑浊介质图像重构（特邀） . 红外与激光工程, 2022, 51(8): 20220215-1-20220215-10. doi: 10.3788/IRLA20220215
[10]	王向军, 欧阳文森. 多尺度循环注意力网络运动模糊图像复原方法 . 红外与激光工程, 2022, 51(6): 20210605-1-20210605-9. doi: 10.3788/IRLA20210605
[11]	汪伟, 许德海, 任明艺. 一种改进的红外图像自适应增强方法 . 红外与激光工程, 2021, 50(11): 20210086-1-20210086-9. doi: 10.3788/IRLA20210086
[12]	史国军. 深度特征联合表征的红外图像目标识别方法 . 红外与激光工程, 2021, 50(3): 20200399-1-20200399-6. doi: 10.3788/IRLA20200399
[13]	张智, 孙权森, 林栩凌, 韩明亮. 基于临近时空帧间信息的空间目标图像增强方法 . 红外与激光工程, 2019, 48(S1): 193-197. doi: 10.3788/IRLA201948.S128004
[14]	梁欣凯, 宋闯, 赵佳佳. 基于深度学习的序列图像深度估计技术 . 红外与激光工程, 2019, 48(S2): 134-141. doi: 10.3788/IRLA201948.S226002
[15]	姚旺, 刘云鹏, 朱昌波. 基于人眼视觉特性的深度学习全参考图像质量评价方法 . 红外与激光工程, 2018, 47(7): 703004-0703004(8). doi: 10.3788/IRLA201847.0703004
[16]	张秀玲, 侯代标, 张逞逞, 周凯旋, 魏其珺. 深度学习的MPCANet火灾图像识别模型设计 . 红外与激光工程, 2018, 47(2): 203006-0203006(6). doi: 10.3788/IRLA201847.0203006
[17]	耿磊, 梁晓昱, 肖志涛, 李月龙. 基于多形态红外特征与深度学习的实时驾驶员疲劳检测 . 红外与激光工程, 2018, 47(2): 203009-0203009(9). doi: 10.3788/IRLA201847.0203009
[18]	刘雪超, 吴志勇, 王弟男, 杨华, 黄德天. 结合自适应窗口的二维直方图图像增强 . 红外与激光工程, 2014, 43(6): 2027-2034.
[19]	徐利民, 范文慧, 刘佳. 太赫兹图像的降噪和增强 . 红外与激光工程, 2013, 42(10): 2865-2870.
[20]	孙韶媛, 李琳娜, 赵海涛. 采用KPCA和BP神经网络的单目车载红外图像深度估计 . 红外与激光工程, 2013, 42(9): 2348-2352.

点击查看大图

图(8) / 表(6)

计量

文章访问数: 331
HTML全文浏览量: 85
PDF下载量: 81
被引次数: 0

全文HTML

0. 引　言

红外图像是仅反映目标物体红外辐射能量的灰度图像，其受环境干扰较小，已在野外侦察、航空航天及居家看护等军事和民生领域发挥着不可替代的作用^[1]。然而，红外波段的辐射波长比可见光长，导致红外图像的空间分辨力比可见光低，图像细节信息不丰富^[2]。此外，受红外成像器件本身的缺陷和外部环境的影响，红外图像通常呈现低对比度，目标边缘不清晰和人眼视觉效果不佳等缺点，很难完成目标定位识别、人体姿态估计等机器视觉任务^[3]。因此，为了得到适合人眼观察或机器识别的高质量红外图像，有必要对红外图像增强，提高图像对比度、丰富细节特征，区分背景和目标，从而提高上述任务的效率和精度^[4]。

深度学习流行前，针对红外图像增强任务，主要采用基于灰度变换的直方图均衡化（Histogram Equalization, HE）^[5]和基于物理模型的Retinex算法^[6]。HE在低照度图像增强任务上取得了良好的效果。然而，针对低对比度红外图像增强，HE通常会加大红外图像的各种噪声，局部区域呈现过增强，产生非常差的结果^[7]。为了解决这个问题，出现了许多基于直方图均衡化的变体，如DPHE^[8]、CLAHE^[9]等。这些方法虽能够在一定程度上提高对比度和抑制噪声，但会产生光晕现象，并造成边缘模糊，降低人眼视觉效果。模仿人眼视觉系统的Retinex方法在处理红外图像时，能更进一步地保留细节信息，丰富图像纹理，然而其依赖参数选择，模型泛化能力差，不能自适应地优化图像。

深度学习广泛应用于计算机视觉领域后，Choi等人^[3]受SRCNN启发，首次设计了一个相对浅的卷积神经网络TEN用于热图像增强。Lee等人^[10]提出了TIECNN用于红外图像增强，它结合亮度域和残差学习，以提高网络性能和收敛速度。Kuang等人^[11]在卷积神经网络结构中加入生成对抗网络，提出了IE-GAN用于单帧红外图像增强，能有效抑制背景噪声，并增强图像对比度和细节，但需要设计相对复杂的损失函数。He等人^[12]提出了一种具有多个感受野（CDN_MRF）的深度级联网络架构，以解决具有大比例因子的单帧红外图像超分辨率问题。虽然上述基于CNN的方法对红外图像增强做出了贡献，但这些方法在增强对比度的同时会加重图像伪影，造成目标边缘模糊及产生光晕，不能充分展现图像的边缘和纹理。由此，提出了端到端学习的并行多特征提取网络红外图像增强方法解决以上问题。一方面，设计了结构特征权重映射块，用于生成全局特征权重，以保留原始图像的空间结构特征；另一方面，构造了双尺度特征提取块对不同尺度的特征图进行深度特征提取，学习图像的多特征信息，捕获细节和纹理。最后，解码模块对已提取的特征和初始权重进行融合，提升对比度，丰富细节纹理，生成高质量的增强红外图像。