RGBT dual-modal Siamese tracking network with feature fusion

Shen Yali

doi:10.3788/IRLA20200459

Infrared imaging technology has been widely used for object tracking in military, remote sensing, security and other fields. However, thermal infrared images generally suffer from low contrast and blurry targets. Therefore, it has great importance of fusing infrared images with visible images. Compared with single-modal RGB trackers, dual-modal RGBT(RGB/Thermal infrared) trackers are more robust to illumination variation and fog. In this paper, a RGBT dual-modal siamese tracking network with feature fusion was proposed. Convolutional features extracted from the visible image and infrared image were fused to improve the appearance feature discrimination. The network can use the training data for end-to-end off-line training. Experimental results on the public RGBT234 dataset demonstrate that our tracker achieves robust and persistent tracking in complex scenarios.

HTML

4. 结　论

针对可见光图像和热红外图像在视觉跟踪任务上的互补优势，文中利用特征融合提出了可见光-热红外双模态孪生跟踪网络模型。该网络首先将RGBT双模态图像中提取的深度特征进行堆叠从而实现特征融合，然后对网络模板分支和搜索分支上的融合特征输入相关滤波层实现快速的目标跟踪。文中提出网络对光照变化、云雾遮挡具有较强的鲁棒性，并且可以利用训练数据进行端到端的离线训练。实验表明，和基准算法CFNet+RGBT相比，文中提出双模态视觉跟踪网络在复杂跟踪场景中能够实现鲁棒跟踪，并具有一定的性能提升。

Reference (16)

[1]	Chen X J, Yang Y M. Realization of dual-band fire detector based on infrared video [J]. Journal of Electronic Measurement and Instrumentation, 2016, 33(3): 473-479.
[2]	Li C L, Liang X Y, Lu Y J, et al. Rgb-t object tracking: benchmark and baseline [J]. Pattern Recognition, 2019, 96: 106977.
[3]	Guan H, Xue X Y, An Z Y. Online single object video tracking: A survey [J]. Mini-Micro Systems, 2017, 38(1): 147-153.
[4]	Yilmaz A, Javed O, Shah M. Object tracking: A survey [J]. ACM Computing Surveys, 2006, 38(4): 1-45.
[5]	Wu Y, Blasch E, Chen G S, et al. Multiple source data fusion via sparse representation for robust visual tracking[C]//International Conference on Information Fusion, 2011.
[6]	Sun F, Liu H. Fusion tracking in color and infrared images using joint sparse representation [J]. Science China Information Sciences, 2012, 55(3): 590-599.
[7]	Li C, Cheng H, Hu S, et al. Learning collaborative sparse representation for grayscale-thermal tracking [J]. IEEE Transactions on Image Processing, 2016, 25(12): 5743-5756.
[8]	Li C, Nan Z. Lu Y, et al. Weighted sparse representation regularized graph learning for rgb-t object tracking[C]//ACM on Multimedia Conference, 2017.
[9]	Li C, Wu X, Zhao N, et al. Fusing two-stream convolutional neural networks for rgb-t object tracking [J]. Neurocomputing, 2018, 28(1): 78-85.
[10]	Tao R, Gavves E, Smeulders A W M. Siamese instance search for tracking[C]//IEEE Conference on Computer Vision and Pattern Recognition, 2016: 1420-1429.
[11]	Held D, Thrun S, Savarese S. Learning to track at 100 FPS with deep regression networks[C]//European Conference on Computer Vision, 2015, 15(12): 625-637.
[12]	Bertinetto L, Valmadre J, Henriques J F, et al. Fully-convolutional siamese networks for object tracking[C]//IEEE Conference on Computer Vision, 2015: 3119-3127.
[13]	Valmadre J, Bertinetto L, Henriques J, et al. End-to-end representation learning for correlation filter based tracking[C]//IEEE Conference on Computer Vision and Pattern Recognition, 2017: 4057-4068.
[14]	Wang Q, Gao J, Xing J, et al. DCFNet: Discriminant correlation filters network for visual tracking[C]//IEEE Conference on Computer Vision and Pattern Recognition, 2017: 3027-3038.
[15]	Xiong Y J, Zhang H T, Deng X. RGBT dual-modal tracking with weighted discriminative correlation filters [J]. Journal of Signal Processing, 2020, 36(9): 1590-1597. (in Chinese)
[16]	Vedaldi A, Lenc K. Matconvnet: convolutional neural networks for matlab[C]//Association for Computing Machinery, 2015: 689-692.

Layer	Kernel size	Channel×Map	Stride	Size	Channel
Input	11×11			255×255	3
Conv1	3×3	16×3	2	123×123	16
Pool1	5×5		2	61×61	16
Conv2	3×3	32×16	1	57×57	32
Pool2	3×3		1	55×55	32
Conv3	3×3	64×32	1	53×53	64
Conv4	3×3	128×64	1	51×51	128
Conv5	3×3	32×128	1	49×49	32

RGBT dual-modal Siamese tracking network with feature fusion

doi: 10.3788/IRLA20200459

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views