红外与可见光图像融合技术的研究进展

沈英; 黄春红; 黄峰; 李杰; 朱梦娇; 王舒

doi:10.3788/IRLA20200467

红外与可见光图像融合技术的研究进展

doi: 10.3788/IRLA20200467

福州大学机械工程及自动化学院，福建福州 350116

基金项目: 国家自然科学基金（62005049）；福建省自然科学基金（2020J01451）；福建省教育厅中青年教师教育科研项目（JAT190003）

详细信息

作者简介:
沈英，女，教授，博士，主要从事光机电一体化方面的研究

通讯作者: 王舒，男，助理研究员，博士，主要从事光学成像方面的研究。

中图分类号: TP391.4

Research progress of infrared and visible image fusion technology

College of Mechanical Engineering and Automation, Fuzhou University, Fuzhou 350116, China

摘要: 红外与可见光融合图像既具有红外的辐射信息又具有可见光的细节信息，在生产生活、军事监视等场景得到广泛应用，已然成为图像融合领域的重点研究方向。根据图像融合方法的核心思想、融合框架、研究进展对基于多尺度变换、稀疏表示、神经网络等融合方法进行详细阐述对比，并综述了红外与可见光图像融合在各领域内的应用现状，以及常用的评价指标。并选择具有代表性的多种融合方法与评价指标，应用于六个不同场景，验证各方法的优势与不足。最后，实验分析并总结现有红外与可见光图像融合方法存在的问题，对红外与可见光图像融合技术的发展趋势进行展望。
- 红外图像 /
- 可见光图像 /
- 图像融合 /
- 多尺度变换 /
- 稀疏表示 /
- 神经网络
Abstract: Infrared and visible image fusion combines the infrared thermal radiation information and visible detail information. The image fusion technique has facilitated development in numerous fields, including production, life sciences, military surveillance and others, and has become a key research direction in the field of image technology. According to the core idea, fusion framework and research progress of image fusion methods, the fusion methods based on multi-scale transformation, sparse representation, neural network, etc. are elaborated and compared, and the application status of infrared and visible light image fusion in various fields and the commonly used the evaluation index. The most representative methods and evaluation indicators are selected and applied to six different scenes in order to verify the advantages and disadvantages of each one. Finally, the existing problems of infrared and visible image fusion methods are experimentally analyzed and summarized , the development prospects of infrared and visible image fusion technology are presented.
- infrared image /
- visible image /
- image fusion /
- multi-scale transform /
- sparse representation /
- neural nework

图 1 基于多尺度变换的红外与可见光图像融合框架

Figure 1. Multi-scale transform-based infrared and visible image fusion frame

下载: 全尺寸图片幻灯片

图 2 基于稀疏表示的红外与可见光图像融合框架

Figure 2. Sparse representation-based infrared and visible image fusion frame

下载: 全尺寸图片幻灯片

图 3 可见光、红外与偏振进行人脸识别的差异^[3]

Figure 3. Difference between visible light, infrared and polarization for face recognition^[3]

下载: 全尺寸图片幻灯片

图 4 用于水果检测的图像融合处理框架^[9]

Figure 4. Image fusion processing framework for fruit detection^[9]

下载: 全尺寸图片幻灯片

图 5 自然颜色映射在图像融合中的应用^[111]

Figure 5. Application of natural color mapping in image fusion^[111]

下载: 全尺寸图片幻灯片

图 6 九种代表性方法对红外与可见光图像的融合效果

Figure 6. Effect of nine representative methods on the fusion of infrared and visible images

下载: 全尺寸图片幻灯片

图 7 不同方法的客观评价指标

Figure 7. Objective evaluation metrics with different methods

下载: 全尺寸图片幻灯片

表 1 红外与可见光图像融合方法的对比

Table 1. Comparison of infrared and visible image fusion methods

Fusion methods	Specific methods	Fusion strategies	Advantages	Limitations	Applicable scenes
Pyramid transforms	Laplacian pyramid	Fuzzy logic^[9]	Smoothing image edge； Less time consumption； Less artifacts	Losing image details； Block phenomenon； Redundancy of data	Short-distance scenes with sufficient light, such as equipment detection
	Contrast pyramid	Clonal selection algorithm^[17]； Teaching learning based optimization^[88]； Multi-objective evolutionary algorithm^[89]	High image contrast； Abundant characteristic information	Low computing efficiency； Losing image details
	Steerable pyramid	The absolute value maximum selection(AVMS)^[90]; The expectation maximization(EM) algorithm^[91]; PCNN and weighting^[92]	Abundant edge detail; Inhibiting the Gibbs effect effectively; Fusing the geometrical and thematic feature availably	Increasing the complexity of algorithm; Losing the image details
Wavelet transform	Discrete wavelet transform	Regional energy^[93]； Target region segmentation^[21]	Significant texture information； Highly independent scale information； Less blocking artifacts; Higher signal-to-noise ratios	Image aliasing； Ringing artifacts； Strict registration requirements	Short-distance scenes, such as face recognition
	Dual tree discrete wavelet transform	Particle swarm optimization^[22]； Fuzzy logic and population-based optimization^[94]	Less redundant information; Less time consumption	Limited directional information
	Lifting wavelet transform	Local regional energy^[23]； PCNN^[85]	High computing speed； Low space complexity；	Losing image details； Distorting image
Nonsubsampled multi-scale and multi-direction geometrical transform	NSCT	Fuzzy logic^[29]； Region of interest^[30]	Distinct edge features； Eliminating the Gibbs effect； Better visual perception	Losing image details； Low computing efficiency； Poor real-time	Scenes with a complex background, such as rescue scenes
	NSST	Region average energy and local directional contrast^[33]； FNMF^[34]	Superior sparse ability； High real-time performance	Losing luminance information； Strict registration requirement； Losing image details of high frequency	Cases need real-time treatment, such as intelligent traffic monitoring
Sparse representation		Saliency detection^{[44, 86-87]}； PCNN^{[56, 95]}	Better robustness； Less artifacts； Reducing misregistration； Abundant brightness information	Smoothing edge texture information； Complex calculation； Losing edge features of high frequency images	Scenes with little feature points, such as the surface of the sea

下载: 导出CSV

续表 1 Tab.1 Continued
Neural network	PCNN	Multi-scale transform and sparse representation； Multi-scale transform	Superior adaptability； Higher signal-to-noise ratios； High fault tolerance	Model parameters are not easy to set； Complex and time-consuming algorithms	Automatic target detection and localization
	Deep learning	VGG-19 and multi-layer fusion^[69]; VGG-19 and saliency detection^[70]	Less artificial noise; Abundant characteristic information Less artifacts	Requiring the ground truth in advance
	Deep learning	GAN^[71]	Avoiding manually designing complicated activity level measurements and fusion rules	The visual information fidelity and correlation coefficient is not optimal
Hybrid methods	Multi-scale transform and saliency	Weight calculation^[76-80]; Salient object extraction^{[81, 82]}	Maintaining the integrity of the salient object region; Improving the visual quality of the fused image；Reducing the noise	Highlighting saliency area inconsistently; Losing the background information	The surveillance application, such as object detection and tracking
Hybrid methods	Multi-scale transform and SR	The absolute values of coefficient and SR^[38]; The fourth-order correlation coefficients match and SR^[83]	Retaining luminance information; Excellent stability and robustness	Poor real-time Losing the image details

下载: 导出CSV

表 2 无参考图像的评价指标

Table 2. Evaluation index without reference image

Evaluation indicators	Definition	Explanation
IE^[124]	${{IE} } = - \displaystyle\sum\limits_{i = 0}^{L - 1} { {p_i} } {\log _2}{p_i}$	Amount of information contained in an image increases as IE improves
SD^[125]	${{SD} } = \sqrt {\frac{1}{ {MN} }\displaystyle\mathop \sum \limits_{i = 1}^M \displaystyle\mathop \sum \limits_{j = 1}^N { {\left( {F\left( {i,j} \right) - \mu } \right)}^2} }$	Deviation between pixels and pixel mean is evaluated by SD, which improves with the increase of SD, resulting in improvement in contrast of images
AG^[126]	${{AG} } = \frac{1}{ {\left( {M - 1} \right)\left( {N - 1} \right)} }\displaystyle\sum\limits_{i = 1}^{M - 1} {\displaystyle\sum\limits_{j = 1}^{N - 1} {\sqrt {\frac{ {\left( {\vartriangle Z_i^2 + \vartriangle Z_j^2} \right)} }{2} } } }$	A wealth of detailed information is exhibited by a high value of AG which is used to reflect the gray variation of the image
Q^AB/F^[127]	${ {{Q} }^{{ {AB/F} } } } = \frac{ {\displaystyle\sum\limits_{i = 0}^{M - 1} {\displaystyle\sum\limits_{j = 0}^{N - 1} {\left( {Q_{\left( {i,j} \right)}^{AF}w_{\left( {i,j} \right)}^A + Q_{\left( {i,j} \right)}^{BF}w_{\left( {i,j} \right)}^B} \right)} } } }{ {\displaystyle\sum\limits_{i = 0}^{M - 1} {\displaystyle\sum\limits_{j = 0}^{N - 1} {\left( {w_{\left( {i,j} \right)}^A + w_{\left( {i,j} \right)}^B} \right)} } } }$	Fusion effect of image exhibits better as the value of Q^AB/F which is used to evaluate the transfer of edge information, approaches 1
MI^[2]	$\begin{array}{l}{I_{ { {FA} } } }(i,j) = \displaystyle\sum\limits_{i = 1}^{M - 1} {\displaystyle\sum\limits_{j = 1}^{N - 1} { {P_{ { {FA} } } }\left( {i,j} \right)} } {\log _2}\dfrac{ { {P_{FA} }\left( {i,j} \right)} }{ { {P_F}\left( i \right){P_B}\left( j \right)} }\\MI_{AB}^F = {I_{ { {FA} } } } + {I_{ { {FB} } } }\end{array}$	Amount of information preserved in an image increases with the improvement of MI which is utilized to characterize inheritance of image information
CC^[128]	${{CC} } = \frac{ {\displaystyle\sum\limits_{i = 1}^M {\displaystyle\sum\limits_{j = 1}^N {\left[ {\left( {F\left( {i,j} \right) - {\mu _F} } \right) \times \left( {S\left( {i,j} \right) - {\mu _S} } \right)} \right]} } } }{ {\sqrt {\displaystyle\sum\limits_{i = 1}^M {\displaystyle\sum\limits_{j = 1}^N {\left[ { { {\left( {F\left( {i,j} \right) - {\mu _F} } \right)}^2} } \right]\displaystyle\sum\limits_{i = 1}^M {\displaystyle\sum\limits_{j = 1}^N {\left[ { { {\left( {S\left( {i,j} \right) - {\mu _S} } \right)}^2} } \right]} } } } } } }$	Similarity between images improves as CC increases, thereby preserving more image information

下载: 导出CSV

表 3 基于参考图像的评价指标

Table 3. Evaluation index based on reference image

Evaluation indicators	Definition	Explanation
SSIM^[129]	$SSI{M_{RF}} = \displaystyle\prod\limits_{i = 1}^3 {\dfrac{{2{\mu _R}{\mu _F} + {c_i}}}{{\mu _R^2 + \mu _F^2 + {c_i}}}} $	Similarity between source image and fusion image enhances with the increase of SSIM which is used to measure image luminance, contrast and structural distortion level
RMSE^[2]	$RMSE = \sqrt {\dfrac{1}{{M \times N}}\displaystyle\sum\limits_{i = 1}^M {\displaystyle\sum\limits_{j = 1}^N {{{\left[ {R\left( {i,j} \right) - F\left( {i,j} \right)} \right]}^2}} } } $	Performance indicators of images promote with the reduction of RMSE
PSNR^[2]	$PSNR = 10 \cdot \lg \dfrac{{{{\left( {255^2 \times M \times N} \right)}}}}{{\displaystyle\sum\limits_{i = 1}^M {\displaystyle\sum\limits_{j = 1}^N {{{\left[ {R\left( {i,j} \right) - F\left( {i,j} \right)} \right]}^2}} } }}$	The distortion of images decreases as the improvement of PSNR using to evaluate whether the image noise is suppressed

下载: 导出CSV

[1]	Li S T, Kang X D, Fang L Y, et al. Pixel-level image fusion: A survey of the state of the art [J]. Information Fusion, 2017, 33: 100-112. doi: 10.1016/j.inffus.2016.05.004
[2]	Ma J Y, Ma Y, Li C. Infrared and visible image fusion methods and applications: A survey [J]. Information Fusion, 2019, 45: 153-178. doi: 10.1016/j.inffus.2018.02.004
[3]	Short N J, Yuffa A J, Videen G, et al. Effects of surface materials on polarimetric-thermal measurements: Applications to face recognition [J]. Applied Optics, 2016, 55(19): 5226-5233. doi: 10.1364/AO.55.005226
[4]	Heo J, Kong S G, Abidi B R. Fusion of visual and thermal signatures with eyeglass removal for robust face recognition[C]//Computer Vision and Pattern Recognition Workshop, 2004,19: 122-127.
[5]	Kumar K S, Kavitha G, Subramanian R, et al. MATLAB-A Ubiquitous Tool for the Practical Engineer[M]. Croatia: In Tech, 2011: 307-326.
[6]	Castillo J C, Fernandez-Caballero A, Serrano-Cuerda J, et al. Smart environment architecture for robust people detection by infrared and visible video fusion [J]. Journal of Ambient Intelligence and Humanized Computing, 2017, 8(2): 223-237. doi: 10.1007/s12652-016-0429-5
[7]	Fendri E, Boukhriss R R, Hammami M. Fusion of thermal infrared and visible spectra for robust moving object detection [J]. Pattern Analysis and Applications, 2017, 20(4): 907-926. doi: 10.1007/s10044-017-0621-z
[8]	Apatean A, Rogozan A, Bensrhair A. Visible-infrared fusion schemes for road obstacle classification [J]. Transportation Research Part C-Emerging Technologies, 2013, 35: 180-192. doi: 10.1016/j.trc.2013.07.003
[9]	Bulanon D M, Burks T F, Alchanatis V. Image fusion of visible and thermal images for fruit detection [J]. Biosystems Engineering, 2009, 103(1): 12-22. doi: 10.1016/j.biosystemseng.2009.02.009
[10]	Raza S E A, Sanchez V, Prince G, et al. Registration of thermal and visible light images of diseased plants using silhouette extraction in the wavelet domain [J]. Pattern Recognition, 2015, 48(7): 2119-2128. doi: 10.1016/j.patcog.2015.01.027
[11]	Burt P, Adelson E. The laplacian pyramid as a compact image code [J]. IEEE Transations on Communications, 1983, 31(4): 532-540. doi: 10.1109/TCOM.1983.1095851
[12]	Toet A. Image fusion by a ration of low-pass pyramid [J]. Pattern Recognition Letters, 1989, 9(4): 245-253. doi: 10.1016/0167-8655(89)90003-2
[13]	Toet A, Vanruyven L J, Valeton J M. Merging thermal and visual images by a contrast pyramid [J]. Optical Engineering, 1989, 28(7): 789-792.
[14]	Toet A. A morphological pyramidal image decomposition [J]. Pattern Recognition Letters, 1989, 9(4): 255-261. doi: 10.1016/0167-8655(89)90004-4
[15]	Freeman W T, Adelson E H, Intell M. The design and use of steerable filters [J]. IEEE Transpattern Anal, 1991, 13(9): 891-906. doi: 10.1109/34.93808
[16]	Yu X L, Ren J L, Chen Q, et al. A false color image fusion method based on multi-resolution color transfer in normalization YCBCR space [J]. Optik, 2014, 125(20): 6010-6016. doi: 10.1016/j.ijleo.2014.07.059
[17]	Jin H Y, Jiao L C, Liu F, et al. Fusion of infrared and visual images based on contrast pyramid directional filter banks using clonal selection optimizing [J]. Optical Engineering, 2008, 47(2): 27002-27008. doi: 10.1117/1.2857417
[18]	He D X, Meng Y, Wang C Y. Contrast pyramid based image fusion scheme for infrared image and visible image[C]//2011 IEEE International Geoscience and Remote Sensing Symposium, 2011: 597-600.
[19]	Grossmann A, Morlet J. Decomposition of hardy functions into square integrable wavelets of constant shape [J]. Siam Journal on Mathematical Analysis, 1984, 15(4): 723-736. doi: 10.1137/0515056
[20]	Mallat S G. A theory for multiresolution signal decomposition-the wavelet representation [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1989, 11(7): 674-693. doi: 10.1109/34.192463
[21]	Niu Y, Xu S, Wu L, et al. Airborne infrared and visible image fusion for target perception based on target region segmentation and discrete wavelet transform [J]. Mathematical Problems in Engineering, 2012, 2012: 732-748.
[22]	Madheswari K, Venkateswaran N. Swarm intelligence based optimisation in thermal image fusion using dual tree discrete wavelet transform [J]. Quantitative Infrared Thermography Journal, 2017, 14(1): 24-43. doi: 10.1080/17686733.2016.1229328
[23]	Zou Y, Liang X, Wang T. Visible and infrared image fusion using the lifting wavelet [J]. Telkomnika Indonesian Journal of Electrical Engineering, 2013, 11(11): 6290-6295.
[24]	Chai P F, Luo X Q, Zhang Z C. Image fusion using quaternion wavelet transform and multiple features [J]. IEEE Access, 2017, 5: 6724-6734. doi: 10.1109/ACCESS.2017.2685178
[25]	Yan X, Qin H L, Li J, et al. Infrared and visible image fusion with spectral graph wavelet transform [J]. Journal of the Optical Society of America a-Optics Image Science and Vision, 2015, 32(9): 1643-1652. doi: 10.1364/JOSAA.32.001643
[26]	Tao G Q, Li D P, Lu G H. On image fusion based on different fusion rules of wavelet transform [J]. Acta Photonica Sinica, 2004, 33(2): 221-224.
[27]	Selesnick I W, Baraniuk R G, Kingsbury N G. The dual-tree complex wavelet transform [J]. IEEE Signal Processing Magazine, 2005, 22(6): 123-151. doi: 10.1109/MSP.2005.1550194
[28]	Da Cunha A L, Zhou J P, Do M N. The nonsubsampled contourlet transform: theory, design, and applications [J]. IEEE Transactions on Image Processing, 2006, 15(10): 3089-3101. doi: 10.1109/TIP.2006.877507
[29]	Yin S, Cao L, Tan Q, et al. Infrared and visible image fusion based on NSCT and fuzzy logic[C]//Proceedings of the 2010 IEEE International Conference on Mechatronics and Automation, 2010,5: 671-675.
[30]	Liu H X, Zhu T H, Zhao J J. Infrared and visible image fusion based on region of interest detection and nonsubsampled contourlet transform [J]. Journal of Shanghai Jiaotong University (Science), 2013, 18(5): 526-534. doi: 10.1007/s12204-013-1437-7
[31]	Guo K, Labate D. Optimally sparse multidimensional representation using shearlets [J]. Siam Journal on Mathematical Analysis, 2007, 39(1): 298-318. doi: 10.1137/060649781
[32]	Easley G, Labate D, Lim W Q. Sparse directional image representations using the discrete shearlet transform [J]. Applied and Computational Harmonic Analysis, 2008, 25(1): 25-46. doi: 10.1016/j.acha.2007.09.003
[33]	Kong W W, Wang B H, Lei Y. Technique for infrared and visible image fusion based on non-subsampled shearlet transform and spiking cortical model [J]. Infrared Physics & Technology, 2015, 71: 87-98.
[34]	Kong W, Lei Y, Zhao H. Adaptive fusion method of visible light and infrared images based on non-subsampled shearlet transform and fast non-negative matrix factorization [J]. Infrared Physics & Technology, 2014, 67: 161-172.
[35]	Hu H M, Wu J W, Li B, et al. An adaptive fusion algorithm for visible and infrared videos based on entropy and the cumulative distribution of gray levels [J]. IEEE Transactions on Multimedia, 2017, 19(12): 2706-2719. doi: 10.1109/TMM.2017.2711422
[36]	Zhang X Y, Ma Y, Fan F, et al. Infrared and visible image fusion via saliency analysis and local edge-preserving multi-scale decomposition [J]. Journal of the Optical Society of America a-Optics Image Science and Vision, 2017, 34(8): 1400-1410. doi: 10.1364/JOSAA.34.001400
[37]	Yang B, Li S T. Multifocus image fusion and restoration with sparse representation [J]. IEEE Transactions on Instrumentation and Measurement, 2010, 59(4): 884-892. doi: 10.1109/TIM.2009.2026612
[38]	Liu Y, Liu S P, Wang Z F. A general framework for image fusion based on multi-scale transform and sparse representation [J]. Information Fusion, 2015, 24: 147-164. doi: 10.1016/j.inffus.2014.09.004
[39]	Yin H T. Sparse representation with learned multiscale dictionary for image fusion [J]. Neurocomputing, 2015, 148: 600-610. doi: 10.1016/j.neucom.2014.07.003
[40]	Yang B, Li S T. Pixel-level image fusion with simultaneous orthogonal matching pursuit [J]. Information Fusion, 2012, 13(1): 10-19. doi: 10.1016/j.inffus.2010.04.001
[41]	Liu Y, Wang Z F. Simultaneous image fusion and denoising with adaptive sparse representation [J]. Iet Image Processing, 2015, 9(5): 347-357. doi: 10.1049/iet-ipr.2014.0311
[42]	Yin H T, Li S T. Multimodal image fusion with joint sparsity model [J]. Optical Engineering, 2011, 50(6): 067007-067009. doi: 10.1117/1.3584840
[43]	Nejati M, Samavi S, Shirani S. Multi-focus image fusion using dictionary-based sparse representation [J]. Information Fusion, 2015, 25: 72-84. doi: 10.1016/j.inffus.2014.10.004
[44]	Wang J, Peng J Y, Feng X Y, et al. Fusion method for infrared and visible images by using non-negative sparse representation [J]. Infrared Physics & Technology, 2014, 67: 477-489.
[45]	Zhang Q, Levine M D. Robust multi-focus image fusion using multi-task sparse representation and spatial context [J]. IEEE Transactions on Image Processing, 2016, 25(5): 2045-2058. doi: 10.1109/TIP.2016.2524212
[46]	Zhang Q H, Fu Y L, Li H F, et al. Dictionary learning method for joint sparse representation-based image fusion [J]. Optical Engineering, 2013, 52(5): 1-11.
[47]	Yu N N, Qiu T S, Bi F, et al. Image features extraction and fusion based on joint sparse representation [J]. IEEE Journal of Selected Topics in Signal Processing, 2011, 5(5): 1074-1082. doi: 10.1109/JSTSP.2011.2112332
[48]	Engan K, Aase S O, Husoy J H. Method of optimal directions for frame design[C]//1999 IEEE International Conference on Acoustics, Speech, and Signal Processing,1999: 2443-2446.
[49]	Aharon M, Elad M, Bruckstein A. K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation [J]. IEEE Transactions on Signal Processing, 2006, 54(11): 4311-4322. doi: 10.1109/TSP.2006.881199
[50]	Rubinstein R, Zibulevsky M, Elad M. Double sparsity: Learning sparse dictionaries for sparse signal approximation [J]. IEEE Transactions on Signal Processing, 2010, 58(3): 1553-1564. doi: 10.1109/TSP.2009.2036477
[51]	Kim M, Han D K, Ko H. Joint patch clustering-based dictionary learning for multimodal image fusion [J]. Information Fusion, 2016, 27: 198-214. doi: 10.1016/j.inffus.2015.03.003
[52]	Dong W S, Li X, Zhang L, et al. Sparsity-based image denoising via dictionary learning and structural clustering[C]//2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011: 457-464.
[53]	Chatterjee P, Milanfar P. Clustering-based denoising with locally learned dictionaries [J]. IEEE Transactions on Image Processing, 2009, 18(7): 1438-1451. doi: 10.1109/TIP.2009.2018575
[54]	Yao Y, Guo P, Xin X, et al. Image fusion by hierarchical joint sparse representation [J]. Cognitive Computation, 2014, 6(3): 281-292. doi: 10.1007/s12559-013-9235-y
[55]	Ophir B, Lustig M, Elad M. Multi-scale dictionary learning using wavelets [J]. IEEE Journal of Selected Topics in Signal Processing, 2011, 5(5): 1014-1024. doi: 10.1109/JSTSP.2011.2155032
[56]	Lu X Q, Zhang B H, Zhao Y, et al. Theinfrared and visible image fusion algorithm based on target separation and sparse representation [J]. Infrared Physics & Technology, 2014, 67: 397-407.
[57]	Zhu Z Q, Yin H P, Chai Y, et al. A novel multi-modality image fusion method based on image decomposition and sparse representation [J]. Information Sciences, 2018, 432: 516-529. doi: 10.1016/j.ins.2017.09.010
[58]	Wang K P, Qi G Q, Zhu Z Q, et al. A novel geometric dictionary construction approach for sparse representation based image fusion [J]. Entropy, 2017, 19(7): 306. doi: 10.3390/e19070306
[59]	Zhang Q, Liu Y, Blum R S, et al. Sparse representation based multi-sensor image fusion for multi-focus and multi-modality images: A review [J]. Information Fusion, 2018, 40: 57-75. doi: 10.1016/j.inffus.2017.05.006
[60]	Kong W W, Zhang L J, Lei Y. Novel fusion method for visible light and infrared images based on NSST-SF-PCNN [J]. Infrared Physics & Technology, 2014, 65: 103-112.
[61]	Xiang T Z, Yan L, Gao R R. A fusion algorithm for infrared and visible images based on adaptive dual-channel unit-linking PCNN in NSCT domain [J]. Infrared Physics & Technology, 2015, 69: 53-61.
[62]	Ma L J, Zhao C H. An effective image fusion method based on nonsubsampled contourlet transform and pulse coupled neural network[C]//Proceedings of the 2nd International Conference on Computer and Information Applications (ICCIA 2012), 2012: 8-12.
[63]	Li Y, Song G-H, Yang S-C. Multi-sensor image fusion by NSCT-PCNN transform[C]//2011 IEEE International Conference on Computer Science and Automation Engineering, 2011: 638-642.
[64]	Kong W W, Lei Y J, Lei Y, et al. Image fusion technique based on non-subsampled contourlet transform and adaptive unit-fast-linking pulse-coupled neural network [J]. Iet Image Processing, 2011, 5(2): 113-121. doi: 10.1049/iet-ipr.2009.0425
[65]	Qu X B, Yan J W, Xiao H Z, et al. Image fusion algorithm based on spatia frequency-motivated pulse coupled neural networks in nonsubsampled contourlet transform Domain [J]. Acta Automatica Sinica, 2009, 12(34): 1508-1514.
[66]	El-taweel G S, Helmy A K. Image fusion scheme based on modified dual pulse coupled neural network [J]. Iet Image Processing, 2013, 7(5): 407-414. doi: 10.1049/iet-ipr.2013.0045
[67]	Yu Z, Yan L, Han N, et al. Image fusion algorithm based on contourlet transform and PCNN for detecting obstacles in forests [J]. Cybernetics and Information Technologies, 2015, 15(1): 116-125. doi: 10.1515/cait-2015-0010
[68]	Liu S, Piao Y, Tahir M. Research on fusion technology based on low-light visible image and infrared image [J]. Optical Engineering, 2016, 55(12): 123104. doi: 10.1117/1.OE.55.12.123104
[69]	Li H, Wu X J, Kittler J. Infrared and visible image fusion using a deep learning framework[C]//2018 24th International Conference on Pattern Recognition, 2018: 2705-2710.
[70]	Ren X, Meng F, Hu T, et al. Infrared-visible image fusion based on convolutional neural networks (CNN)[C]//International Conference on Intelligent Science and Big Data Engineering, 2018: 301-307.
[71]	Ma J Y, Yu W, Liang P W, et al. FusionGAN: A generative adversarial network for infrared and visible image fusion [J]. Information Fusion, 2019, 48: 11-26. doi: 10.1016/j.inffus.2018.09.004
[72]	Li H, Liu L, Huang W, et al. An improved fusion algorithm for infrared and visible images based on multi-scale transform [J]. Infrared Physics & Technology, 2016, 74: 28-37.
[73]	Fu Z Z, Wang X, Xu J, et al. Infrared and visible images fusion based on RPCA and NSCT [J]. Infrared Physics & Technology, 2016, 77: 114-123.
[74]	Cvejic N, Bull D, Canagarajah N. Region-based multimodal image fusion using ICA bases [J]. IEEE Sensors Journal, 2007, 7(5): 743-751. doi: 10.1109/JSEN.2007.894926
[75]	Mou J, Gao W, Song Z. Image fusion based on non-negative matrix factorization and infrared feature extraction[C]// 2013 6th International Congress on Image and Signal Processing (CISP), 2013: 1046-1050.
[76]	Liu Z W, Feng Y, Chen H, et al. A fusion algorithm for infrared and visible based on guided filtering and phase congruency in NSST domain [J]. Optics and Lasers in Engineering, 2017, 97: 71-77. doi: 10.1016/j.optlaseng.2017.05.007
[77]	Bavirisetti D P, Dhuli R. Two-scale image fusion of visible and infrared images using saliency detection [J]. Infrared Physics & Technology, 2016, 76: 52-64.
[78]	Gan W, Wu X H, Wu W, et al. Infrared and visible image fusion with the use of multi-scale edge-preserving decomposition and guided image filter [J]. Infrared Physics & Technology, 2015, 72: 37-51.
[79]	Cui G M, Feng H J, Xu Z H, et al. Detail preserved fusion of visible and infrared images using regional saliency extraction and multi-scale image decomposition [J]. Optics Communications, 2015, 341: 199-209. doi: 10.1016/j.optcom.2014.12.032
[80]	Zhao J F, Zhou Q, Chen Y T, et al. Fusion of visible and infrared images using saliency analysis and detail preserving based image decomposition [J]. Infrared Physics & Technology, 2013, 56: 93-99.
[81]	Zhang B H, Lu X Q, Pei H Q, et al. A fusion algorithm for infrared and visible images based on saliency analysis and non-subsampled shearlet transform [J]. Infrared Physics & Technology, 2015, 73: 286-297.
[82]	Meng F, Song M, Guo B, et al. Image fusion based on object region detection and non-subsampled contourlet transform [J]. Computers & Electrical Engineering, 2017, 62: 375-383.
[83]	Cai J J, Cheng Q M, Peng M J, et al. Fusion of infrared and visible images based on nonsubsampled contourlet transform and sparse K-SVD dictionary learning [J]. Infrared Physics & Technology, 2017, 82: 85-95.
[84]	Yin M, Duan P H, Liu W, et al. A novel infrared and visible image fusion algorithm based on shift-invariant dual-tree complex shearlet transform and sparse representation [J]. Neurocomputing, 2017, 226: 182-191. doi: 10.1016/j.neucom.2016.11.051
[85]	Chai Y, Li H F, Qu J F. Image fusion scheme using a novel dual-channel PCNN in lifting stationary wavelet domain [J]. Optics Communications, 2010, 283(19): 3591-3602. doi: 10.1016/j.optcom.2010.04.100
[86]	Yang B, Li S T. Visual attention guided image fusion with sparse representation [J]. Optik, 2014, 125(17): 4881-4888. doi: 10.1016/j.ijleo.2014.04.036
[87]	Liu C H, Qi Y, Ding W R. Infrared and visible image fusion method based on saliency detection in sparse domain [J]. Infrared Physics & Technology, 2017, 83: 94-102.
[88]	Kong W W. Technique for gray-scale visual light and infrared image fusion based on non-subsampled shearlet transform [J]. Infrared Physics & Technology, 2014, 63: 110-118.
[89]	Adu J H, Gan J H, Wang Y, et al. Image fusion based on nonsubsampled contourlet transform for infrared and visible light image [J]. Infrared Physics & Technology, 2013, 61: 94-100.
[90]	Liu Z, Tsukada K, Hanasaki K, et al. Image fusion by using steerable pyramid [J]. Pattern Recognition Letters, 2001, 22(9): 929-939. doi: 10.1016/S0167-8655(01)00047-2
[91]	G Liu, Z L Jing, S Y Sun, et al. Image fusion based on expectation maximization algorithm and steerable pyramid [J]. Chinese Optics Letters, 2004, 2(7): 18-21.
[92]	Deng H, Ma Y. Image Fusion based on steerable pyramid and PCNN[C]//2009 Second International Conference on the Applications of Digital Information and Web Technologies, 2009: 569-573.
[93]	Zhan L, Zhuang Y, Huang L. Infrared and visible images fusion method based on discrete wavelet transform [J]. Journal of Computers, 2017, 28(2): 057-071.
[94]	Saeedi J, Faez K. Infrared and visible image fusion using fuzzy logic and population-based optimization [J]. Applied Soft Computing, 2012, 12(3): 1041-1054. doi: 10.1016/j.asoc.2011.11.020
[95]	Chang L H, Feng X C, Zhang R, et al. Image decomposition fusion method based on sparse representation and neural network [J]. Applied Optics, 2017, 56(28): 7969-7977. doi: 10.1364/AO.56.007969
[96]	Omri F, Foufou S, Abidi M. NIR and visible image fusion for improving face recognition at long distance[C]//International Conference on Image and Signal Processing, 2014: 549-557.
[97]	Singh S, Gyaourova A, Bebis G, et al. Infrared and visible image fusion for face recognition[C]//Biometric Technology for Human Identification, International Society for Optics and Photonics, 2004: 585-596.
[98]	Heo J, Kong S G, Abidi B R, et al. Fusion of visual and thermal signatures with eyeglass removal for robust face recognition[C]//2004 Conference on Computer Vision and Pattern Recognition Workshop, 2004: 122-122.
[99]	Abaza A, Bourlai T. On ear-based human identification in the mid-wave infrared spectrum [J]. Image Vision Computing, 2013, 31(9): 640-648. doi: 10.1016/j.imavis.2013.06.001
[100]	Uzair M, Mahmood A, Mian A, et al. Periocular region-based person identification in the visible, infrared and hyperspectral Imagery [J]. Neurocomputing, 2015, 149: 854-867. doi: 10.1016/j.neucom.2014.07.049
[101]	Han J G, Pauwels E J, de Zeeuw P. Fast saliency-aware multi-modality image fusion [J]. Neurocomputing, 2013, 111: 70-80. doi: 10.1016/j.neucom.2012.12.015
[102]	Schnelle S R, Chan A L. Enhanced target tracking through infrared-visible image fusion[C]//14th International Conference on Information Fusion, 2011: 1-8.
[103]	Jin X, Jiang Q, Yao S W, et al. A survey of infrared and visual image fusion methods [J]. Infrared Physics & Technology, 2017, 85: 478-501.
[104]	Toet A. Natural colour mapping for multiband nightvision imagery [J]. Information Fusion, 2003, 4(3): 155-166. doi: 10.1016/S1566-2535(03)00038-1
[105]	Toet A, Hogervorst M A. Progress in color night vision [J]. Optical Engineering, 2012, 51(1): 010901. doi: 10.1117/1.OE.51.1.010901
[106]	Davis J W, Sharma V. Background-subtraction using contour-based fusion of thermal and visible imagery [J]. Computer Vision and Image Understanding, 2007, 107(2-3): 162-182.
[107]	Niu Y F, Xu S T, Wu L Z, et al. Airborne infrared and visible image fusion for target perception based on target region segmentation and discrete wavelet transform [J]. Mathematical Problems in Engineering, 2012, 10: 732-748.
[108]	Bhatnagar G, Liu Z. A novel image fusion framework for night-vision navigation and surveillance [J]. Signal Image and Video Processing, 2015, 9: 165-175. doi: 10.1007/s11760-014-0740-6
[109]	Paramanandham N, Rajendiran K. Multi sensor image fusion for surveillance applications using hybrid image fusion algorithm [J]. Multimedia Tools and Applications, 2018, 77(10): 12405-12436. doi: 10.1007/s11042-017-4895-3
[110]	Tsagaris V, Anastassopoulos V. Fusion of visible and infrared imagery for night color vision [J]. Displays, 2005, 26(4): 191-196.
[111]	Hogervorst M A, Toet A. Fast natural color mapping for night-time imagery [J]. Information Fusion, 2010, 11(2): 69-77. doi: 10.1016/j.inffus.2009.06.005
[112]	Mendoza F, Lu R F, Cen H Y. Comparison and fusion of four nondestructive sensors for predicting apple fruit firmness and soluble solids content [J]. Postharvest Biology and Technology, 2012, 73: 89-98. doi: 10.1016/j.postharvbio.2012.05.012
[113]	Hanna B V, Gorbach A M, Gage F A, et al. Intraoperative assessment of critical biliary structures with visible range/infrared image fusion [J]. Journal of the American College of Surgeons, 2008, 206(6): 1227-1231. doi: 10.1016/j.jamcollsurg.2007.10.012
[114]	Eslami M, Mohammadzadeh A. Developing a spectral-based strategy for urban object detection from airborne hyperspectral TIR and visible data [J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2016, 9(5): 1808-1816. doi: 10.1109/JSTARS.2015.2489838
[115]	Han L, Wulie B, Yang Y L, et al. Direct fusion of geostationary meteorological satellite visible and infrared images based on thermal physical properties [J]. Sensors, 2015, 15(1): 703-714. doi: 10.3390/s150100703
[116]	Li H G, Ding W R, Cao X B, et al. Image registration and fusion of visible and infrared integrated camera for medium-altitude unmanned aerial vehicle remote sensing [J]. Remote Sensing, 2017, 9(5): 441-469. doi: 10.3390/rs9050441
[117]	Chang X, Jiao L C, Liu F, et al. Multicontourlet-based adaptive fusion of infrared and visible remote sensing images [J]. IEEE Geoscience and Remote Sensing Letters, 2010, 7(3): 549-553. doi: 10.1109/LGRS.2010.2041323
[118]	Lu X C, Zhang J P, Li T, et al. Synergetic classification of long-wave infrared hyperspectral and visible images [J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2015, 8(7): 3546-3557. doi: 10.1109/JSTARS.2015.2442594
[119]	Gargano M, Bertani D, Greco M, et al. A perceptual approach to the fusion of visible and NIR images in the examination of ancient documents [J]. Journal of Cultural Heritage, 2015, 16(4): 518-525. doi: 10.1016/j.culher.2014.09.006
[120]	Kim S J, Deng F, Brown M S. Visual enhancement of old documents with hyperspectral imaging [J]. Pattern Recognition, 2011, 44(7): 1461-1469. doi: 10.1016/j.patcog.2010.12.019
[121]	Feng Z J, Zhang X L, Yuan L Y, et al. Infrared target detection and location for visual surveillance using fusion scheme of visible and infrared images [J]. Mathematical Problems in Engineering, 2013, 2013(3): 831-842.
[122]	Zhao C, Guo Y, Wang Y. A fast fusion scheme for infrared and visible light images in NSCT domain [J]. Infrared Physics & Technology, 2015, 72: 266-275.
[123]	Zhang X L, Li X F, Li J. Validation and correlation analysis of metrics for evaluation performance of image fusion [J]. Acta Automatica Sinica, 2014, 40(2): 306-315. (in Chinese)
[124]	Aardt V, J an. Assessment of image fusion procedures using entropy, image quality, and multispectral classification [J]. Journal of Applied Remote Sensing, 2008, 2(1): 1-28.
[125]	Yun J, R ao. In-fibre Bragg grating sensors [J]. Measurement Science Technology, 1997, 8(4): 355. doi: 10.1088/0957-0233/8/4/002
[126]	Zhu X X, Bamler R. A sparse image fusion algorithm with application to pan-sharpening [J]. IEEE Transactions on Geoscience Remote Sensing, 2012, 51(5): 2827-2836.
[127]	Xydeas C S, V. P V. Objective image fusion performance measure [J]. Military Technical Courier, 2000, 56(4): 181-193.
[128]	Deshmukh M, Bhosale U. Image fusion and image quality assessment of fused images [J]. International Journal of Image Processing, 2010, 4(5): 484-508.
[129]	Wang Z, Bovik A C, Sheikh H R, et al. Image quality assessment: From error visibility to structural similarity [J]. IEEE Trans Image Process, 2004, 13(4): 600-612. doi: 10.1109/TIP.2003.819861
[130]	Li S, Yang B, Hu J. Performance comparison of different multi-resolution transforms for image fusion [J]. Information Fusion, 2011, 12(2): 74-84. doi: 10.1016/j.inffus.2010.03.002
[131]	Qu X B, Yan J W, Xiao H Z, et al. Image fusion algorithm based on spatial frequency-motivated pulse coupled neural networks in nonsubsampled contourlet transform domain [J]. Acta Automatica Sinica, 2008, 34(12): 1508-1514. doi: 10.1016/S1874-1029(08)60174-3

[1]	庞忠祥, 刘勰, 刘桂华, 龚泿军, 周晗, 罗洪伟. 并行多特征提取网络的红外图像增强方法 . 红外与激光工程, 2022, 51(8): 20210957-1-20210957-9. doi: 10.3788/IRLA20210957
[2]	闵莉, 曹思健, 赵怀慈, 刘鹏飞. 改进生成对抗网络实现红外与可见光图像融合 . 红外与激光工程, 2022, 51(4): 20210291-1-20210291-10. doi: 10.3788/IRLA20210291
[3]	宦克为, 李向阳, 曹宇彤, 陈笑. 卷积神经网络结合NSST的红外与可见光图像融合 . 红外与激光工程, 2022, 51(3): 20210139-1-20210139-8. doi: 10.3788/IRLA20210139
[4]	李霖, 王红梅, 李辰凯. 红外与可见光图像深度学习融合方法综述 . 红外与激光工程, 2022, 51(12): 20220125-1-20220125-20. doi: 10.3788/IRLA20220125
[5]	赵璐, 熊森. 多视角红外图像目标识别方法 . 红外与激光工程, 2021, 50(11): 20210206-1-20210206-6. doi: 10.3788/IRLA20210206
[6]	戴进墩, 刘亚东, 毛先胤, 盛戈皞, 江秀臣. 基于FDST和双通道PCNN的红外与可见光图像融合 . 红外与激光工程, 2019, 48(2): 204001-0204001(8). doi: 10.3788/IRLA201948.0204001
[7]	刘永峰, 王年, 王峰, 李从利, 刘晓, 徐国明. 基于谱间相似性的高光谱图像稀疏超分辨率算法 . 红外与激光工程, 2019, 48(S1): 181-192. doi: 10.3788/IRLA201948.S128003
[8]	郭全民, 王言, 李翰山. 改进IHS-Curvelet变换融合可见光与红外图像抗晕光方法 . 红外与激光工程, 2018, 47(11): 1126002-1126002(9). doi: 10.3788/IRLA201847.1126002
[9]	薛俊韬, 倪晨阳, 杨斯雪. 特征聚类的局部敏感稀疏图像修复 . 红外与激光工程, 2018, 47(11): 1126001-1126001(9). doi: 10.3788/IRLA201847.1126001
[10]	郭全民, 董亮, 李代娣. 红外与可见光图像融合的汽车抗晕光系统 . 红外与激光工程, 2017, 46(8): 818005-0818005(6). doi: 10.3788/IRLA201746.0818005
[11]	曾祥通, 张玉珍, 孙佳嵩, 喻士领. 颜色对比度增强的红外与可见光图像融合方法 . 红外与激光工程, 2015, 44(4): 1198-1202.
[12]	孙斌, 常本康, 张俊举, 王贵圆, 李英杰. 基于红外运动目标分割的夜视融合系统设计 . 红外与激光工程, 2015, 44(7): 2064-2069.
[13]	杨桄, 童涛, 孟强强, 孙嘉成. 基于梯度加权的红外与可见光图像融合方法 . 红外与激光工程, 2014, 43(8): 2772-2779.
[14]	杨风暴, 蔺素珍. 基于变换域多合成规则的双色中波红外图像融合 . 红外与激光工程, 2014, 43(11): 3663-3669.
[15]	王金玲, 贺小军, 宋克非. 采用区域互信息的多光谱与全色图像融合算法 . 红外与激光工程, 2014, 43(8): 2757-2764.
[16]	张宝辉, 闵超波, 窦亮, 张俊举, 常本康. 目标增强的红外与微光图像融合算法 . 红外与激光工程, 2014, 43(7): 2349-2353.
[17]	杨扬, 戴明, 周箩鱼. 基于均匀离散曲波变换的多聚焦图像融合 . 红外与激光工程, 2013, 42(9): 2547-2552.
[18]	李新娥, 任建岳, 吕增明, 沙巍, 张立国, 何斌. NSCT域内基于改进PCNN和区域能量的多光谱和全色图像融合方法 . 红外与激光工程, 2013, 42(11): 3096-3102.
[19]	孙韶媛, 李琳娜, 赵海涛. 采用KPCA和BP神经网络的单目车载红外图像深度估计 . 红外与激光工程, 2013, 42(9): 2348-2352.
[20]	赵春晖, 刘振龙. 改进的红外图像神经网络非均匀性校正算法 . 红外与激光工程, 2013, 42(4): 1079-1083.

点击查看大图

图(7) / 表(4)

计量

文章访问数: 1540
HTML全文浏览量: 459
PDF下载量: 388
被引次数: 0

全文HTML

0. 引　言

在图像处理领域中，单一传感器获取的图像通常只具备某一方面的信息，已无法满足市场需求，因此图像融合技术应运而生。红外图像主要依靠物体自身的热辐射进行成像，突出背景中隐藏的热目标，其不受光照条件、天气的影响，但对比度较低，纹理细节不丰富^[1]。可见光图像通过反射可见光进行成像，纹理细节和对比度更适合人类的视觉感知，但可见光图像在烟雾、夜间等条件下的成像效果差^[2]。基于此，两者融合后能够获得一幅既有可见光图像边缘、细节信息又有红外热辐射目标信息的互补融合图像。随着目标检测与识别、军事监视等应用需求的不断提高，红外与可见光图像的融合技术成为该领域研究的热点方向。安防领域，红外与可见光融合图像可准确识别黑暗环境、化妆打扮^[3]、佩戴眼镜^[4]等条件下的人脸，为商业应用、公安执法等提供便利需求；军事领域，红外与可见光融合图像可实现恶劣环境下隐藏目标的识别与跟踪^[5]；智能交通领域，红外与可见光融合图像应用于行人检测^[6]、车辆识别与车距检测^[7]、道路障碍物分类^[8]；农业生产领域，红外与可见光融合图像可应用于水果的成熟度检测^[9]、植物的病态检测^[10]等。

近几十年来，大量红外与可见光图像的融合方法相继被提出，并在实际应用中得到推广。目前的图像融合综述中，大部分文献都选择对整个图像融合领域进行综述，较少文献针对红外与可见光图像融合方法进行详细阐述；在部分阐述红外与可见光图像融合的综述中，只对现阶段的图像融合方法进行简要分析，没有与实际应用相结合，缺少应用实例。文中首先综述了红外与可见光图像常用融合方法的研究现状，其次概述了红外与可见光图像融合的主要应用，以及用于评价融合质量的性能指标，并针对选定的六个应用场景，选择九种该领域典型的融合方法和六个图像质量评价指标进行实验分析，最后对红外与可见光图像融合技术的发展与应用进行总结与展望。

3. 评价指标

图像融合质量的评价方法主要分为主观法和客观法。主观法是将图像划分为五个等级，分别是“特别好”、“好”、“一般”、“差”和“特别差”。主观法属于定性分析，具有较强的主观意识，对于两幅融合效果较为相近的图像无法做出客观的判断，同时，相邻的评价级别没有明确的划分界限，存在着一定的不足。客观评价法是通过特定的公式计算图像的相关指标信息以对融合图像进行定量分析，主要分为无参考图像与有参考图像两类评价指标^[123]。

常用的图像融合评价指标的定义及说明如表2、表3所示。设源图像的尺寸大小为M×N，其中A,B,S表示源图像，F和S表示融合图像和参考图像；µ为图像的灰度均值；p_k表示像素值为k的概率（k=0,1,2,···,255）。设Z=A，B，S，F，R；i=1,2,···,M；j=1,2,···,N。其中，Z(i, j)表示图像Z的灰度值；ΔZ表示图像Z的差分；表示边缘信息量；P_Z和P_ZZ分别表示图像的概率密度函数和图像间的联合概率密度；函数表示图像的边缘强度函数。

表 2 无参考图像的评价指标

Table 2. Evaluation index without reference image

Evaluation indicators	Definition	Explanation
IE^[124]	${{IE} } = - \displaystyle\sum\limits_{i = 0}^{L - 1} { {p_i} } {\log _2}{p_i}$	Amount of information contained in an image increases as IE improves
SD^[125]	${{SD} } = \sqrt {\frac{1}{ {MN} }\displaystyle\mathop \sum \limits_{i = 1}^M \displaystyle\mathop \sum \limits_{j = 1}^N { {\left( {F\left( {i,j} \right) - \mu } \right)}^2} }$	Deviation between pixels and pixel mean is evaluated by SD, which improves with the increase of SD, resulting in improvement in contrast of images
AG^[126]	${{AG} } = \frac{1}{ {\left( {M - 1} \right)\left( {N - 1} \right)} }\displaystyle\sum\limits_{i = 1}^{M - 1} {\displaystyle\sum\limits_{j = 1}^{N - 1} {\sqrt {\frac{ {\left( {\vartriangle Z_i^2 + \vartriangle Z_j^2} \right)} }{2} } } }$	A wealth of detailed information is exhibited by a high value of AG which is used to reflect the gray variation of the image
Q^AB/F^[127]	${ {{Q} }^{{ {AB/F} } } } = \frac{ {\displaystyle\sum\limits_{i = 0}^{M - 1} {\displaystyle\sum\limits_{j = 0}^{N - 1} {\left( {Q_{\left( {i,j} \right)}^{AF}w_{\left( {i,j} \right)}^A + Q_{\left( {i,j} \right)}^{BF}w_{\left( {i,j} \right)}^B} \right)} } } }{ {\displaystyle\sum\limits_{i = 0}^{M - 1} {\displaystyle\sum\limits_{j = 0}^{N - 1} {\left( {w_{\left( {i,j} \right)}^A + w_{\left( {i,j} \right)}^B} \right)} } } }$	Fusion effect of image exhibits better as the value of Q^AB/F which is used to evaluate the transfer of edge information, approaches 1
MI^[2]	$\begin{array}{l}{I_{ { {FA} } } }(i,j) = \displaystyle\sum\limits_{i = 1}^{M - 1} {\displaystyle\sum\limits_{j = 1}^{N - 1} { {P_{ { {FA} } } }\left( {i,j} \right)} } {\log _2}\dfrac{ { {P_{FA} }\left( {i,j} \right)} }{ { {P_F}\left( i \right){P_B}\left( j \right)} }\\MI_{AB}^F = {I_{ { {FA} } } } + {I_{ { {FB} } } }\end{array}$	Amount of information preserved in an image increases with the improvement of MI which is utilized to characterize inheritance of image information
CC^[128]	${{CC} } = \frac{ {\displaystyle\sum\limits_{i = 1}^M {\displaystyle\sum\limits_{j = 1}^N {\left[ {\left( {F\left( {i,j} \right) - {\mu _F} } \right) \times \left( {S\left( {i,j} \right) - {\mu _S} } \right)} \right]} } } }{ {\sqrt {\displaystyle\sum\limits_{i = 1}^M {\displaystyle\sum\limits_{j = 1}^N {\left[ { { {\left( {F\left( {i,j} \right) - {\mu _F} } \right)}^2} } \right]\displaystyle\sum\limits_{i = 1}^M {\displaystyle\sum\limits_{j = 1}^N {\left[ { { {\left( {S\left( {i,j} \right) - {\mu _S} } \right)}^2} } \right]} } } } } } }$	Similarity between images improves as CC increases, thereby preserving more image information

表 3 基于参考图像的评价指标

Table 3. Evaluation index based on reference image

Evaluation indicators	Definition	Explanation
SSIM^[129]	$SSI{M_{RF}} = \displaystyle\prod\limits_{i = 1}^3 {\dfrac{{2{\mu _R}{\mu _F} + {c_i}}}{{\mu _R^2 + \mu _F^2 + {c_i}}}} $	Similarity between source image and fusion image enhances with the increase of SSIM which is used to measure image luminance, contrast and structural distortion level
RMSE^[2]	$RMSE = \sqrt {\dfrac{1}{{M \times N}}\displaystyle\sum\limits_{i = 1}^M {\displaystyle\sum\limits_{j = 1}^N {{{\left[ {R\left( {i,j} \right) - F\left( {i,j} \right)} \right]}^2}} } } $	Performance indicators of images promote with the reduction of RMSE
PSNR^[2]	$PSNR = 10 \cdot \lg \dfrac{{{{\left( {255^2 \times M \times N} \right)}}}}{{\displaystyle\sum\limits_{i = 1}^M {\displaystyle\sum\limits_{j = 1}^N {{{\left[ {R\left( {i,j} \right) - F\left( {i,j} \right)} \right]}^2}} } }}$	The distortion of images decreases as the improvement of PSNR using to evaluate whether the image noise is suppressed

无参考图像的评价指标又可以分为基于单一图像的评价指标和基于源图像的评价指标。基于单一图像的图像评价方法是基于最终融合图像所进行的图像性能评价，包括信息熵（Information Entropy, IE）、标准差（Standard Deviation, SD）、平均梯度（Average Gradient, AG）、空间频率等，其通过不同的方式度量融合图像本身的信息量、灰度值分布等。IE和SD分别通过统计图像灰度分布和度量像素灰度值相较于灰度均值的偏离程度来反映融合图像的信息量。AG和空间频率反映图像的灰度变化率和清晰度。基于源图像的评价指标通常只考虑图像的某一统计特征，与主观评价的结果有出入。无参考图像的评价指标还有一类是基于源图像进行衡量，主要从信息论的角度出发，度量融合图像从源图像处所提取的信息。常用的有互信息（Mutual Information, MI）、相关系数（Correlation Coefficient, CC）以及边缘信息传递量的Q^AB/F。此外，还有从信息熵引申出来的交叉熵、联合熵，IE反映的仅仅是融合图像的信息量，无法说明图像的整体融合效果，而交叉熵和联合熵可弥补该不足。

基于参考图像的评价指标是通过比较源图像与标准参考图像间灰度值、噪声等的差异以评价其性能。主要包括结构相似度（Structural Similarity, SSIM）、均方根误差（Root-Mean-Square Error, RMSE）、峰值信噪比（Peak Signal-to-Noise Ratio, PSNR）等。SSIM是通过比较图像间的亮度、对比度、结构失真水平的差异性来评价图像的性能；RMSE、偏差指数和扭曲程度都是通过比较图像间像素的灰度值进行评估；PSNR是通过度量融合图像的噪声是否得到抑制来评价图像的质量。在实际的图像融合过程中，往往没有参考图像作为一个标准，所以该评价方法目前还未大规模应用。

5. 结　论

文中综述了目前常用的红外与可见光图像融合方法的发展进程及应用研究，重点阐述基于多尺度变换、稀疏表示、神经网络等方法的核心思想、发展进程、优势不足，并对常用的融合方法进行总结对比；介绍了红外与可见光图像融合方法的应用领域，包括目标识别、目标检测与跟踪、夜视监控等；总结了九种目前较常使用的图像融合评价指标；最后针对六种典型场景，选择代表性的融合方法和六种评价指标进行实验对比。

针对当前红外与可见光图像融合算法存在的问题，有三方面改进建议：

（1）基于多尺度变换的图像融合方法虽然已经成为图像融合领域的热门研究方向，但还可从固定基函数和分解层数的自适应设计方面进一步改进算法；（2）未来可根据不同场景的特点，对不同融合方法进行组合式创新。但在构建混合模型时，还需综合考虑算法的性能表现；（3）基于深度学习的融合方法是未来的重点研究方向。近年来提出的端到端网络模型解决了卷积神经网络模型中存在的大部分问题，但还需根据红外与可见光图像成像原理来设计有针对性的损失函数，并在不同场景中采集大量的数据集，从而进一步提高该模型的泛化能力。

针对红外与可见光图像融合应用方面的挑战，可从以下方面进一步探索：

（1）提高实际场景中红外与可见光图像的配准精度，将空间变换设为变量因素，实现配准和融合的同步，以减少伪影现象；（2）减少红外图像的噪声，引入显著性检测算法，提取主要红外目标减少噪声干扰，或设计多孔径成像系统并对所得红外图像进行超分，在提高图像分辨率的同时扩大视场范围；（3）提高融合算法的实时性，将并行运算应用于图像融合领域，实现算法时间和空间上的并行，从而提高运行效率。

参考文献 (131)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

红外与可见光图像融合技术的研究进展

doi: 10.3788/IRLA20200467

作者简介:
沈英，女，教授，博士，主要从事光机电一体化方面的研究

通讯作者: 王舒，男，助理研究员，博士，主要从事光学成像方面的研究。

Research progress of infrared and visible image fusion technology

计量

红外与可见光图像融合技术的研究进展

doi: 10.3788/IRLA20200467

福州大学机械工程及自动化学院，福建福州 350116

作者简介:
沈英，女，教授，博士，主要从事光机电一体化方面的研究

通讯作者: 王舒，男，助理研究员，博士，主要从事光学成像方面的研究。

English Abstract

Research progress of infrared and visible image fusion technology

College of Mechanical Engineering and Automation, Fuzhou University, Fuzhou 350116, China

全文HTML

1.1. 基于多尺度变换的融合方法

1.1.1. 金字塔变换

1.1.2. 小波变换

1.1.3. 非下采样多尺度多方向几何变换

1.1.4. 其他方法

1.2. 基于稀疏表示的融合方法

1.3. 基于神经网络的融合方法

1.4. 其他方法

1.5. 图像融合规则

2.1. 目标识别

2.2. 目标检测与跟踪

2.3. 夜视监控

2.4. 其他融合应用

目录

留言板

红外与可见光图像融合技术的研究进展

doi: 10.3788/IRLA20200467

作者简介: 沈英，女，教授，博士，主要从事光机电一体化方面的研究

通讯作者: 王舒，男，助理研究员，博士，主要从事光学成像方面的研究。

Research progress of infrared and visible image fusion technology

计量

出版历程

红外与可见光图像融合技术的研究进展

doi: 10.3788/IRLA20200467

福州大学 机械工程及自动化学院，福建 福州 350116

作者简介: 沈英，女，教授，博士，主要从事光机电一体化方面的研究

通讯作者: 王舒，男，助理研究员，博士，主要从事光学成像方面的研究。

English Abstract

Research progress of infrared and visible image fusion technology

College of Mechanical Engineering and Automation, Fuzhou University, Fuzhou 350116, China

全文HTML

1.1. 基于多尺度变换的融合方法

1.1.1. 金字塔变换

1.1.2. 小波变换

1.1.3. 非下采样多尺度多方向几何变换

1.1.4. 其他方法

1.2. 基于稀疏表示的融合方法

1.3. 基于神经网络的融合方法

1.4. 其他方法

1.5. 图像融合规则

2.1. 目标识别

2.2. 目标检测与跟踪

2.3. 夜视监控

2.4. 其他融合应用

目录

作者简介:
沈英，女，教授，博士，主要从事光机电一体化方面的研究

福州大学机械工程及自动化学院，福建福州 350116

作者简介:
沈英，女，教授，博士，主要从事光机电一体化方面的研究