基于深度学习物体检测的视觉跟踪方法

唐聪; 凌永顺; 杨华; 杨星; 郑超

doi:10.3788/IRLA201847.0526001

基于深度学习物体检测的视觉跟踪方法

doi: 10.3788/IRLA201847.0526001

唐聪^1,2,
凌永顺^1,2,
杨华^1,2,
杨星^1,2,
郑超^1,2

1.
国防科技大学,安徽合肥 230037;
2.
脉冲功率激光技术国家重点实验室,安徽合肥 230037

基金项目:

国家自然科学基金（61405248，61503394）;安徽省自然科学基金（1708085MF137）

详细信息

作者简介:
唐聪(1989-),男,博士生,主要从事计算机视觉、深度学习、模式识别等方面的研究。Email:tangcong_eei@163.com

中图分类号: TP391.4

A visual tracking method via object detection based on deep learning

1.
National University of Defense Technology,Hefei 230037,China;
2.
State Key Laboratory of Pulsed Power Laser Technology,Hefei 230037,China

摘要: 提出了一种基于深度学习物体检测的视觉跟踪方法。该方法利用深度学习在特征表达上的优势，采用基于回归的深度检测模型SSD （Single Shot Multibox Detector）提取候选目标，并结合颜色直方图特征和HOG （Histogram of Oriented Gradient）特征进行目标筛选，实现目标跟踪。为了提升深度检测模型的物体检测性能，文中构建了多尺度目标搜索图，可在一张图上实现不同尺度的目标检测。在标准跟踪测试库上选取八个具有代表性的跟踪视频序列，并选取六种具有代表性的跟踪方法进行了对比测试。结果表明，文中所提方法在跟踪效果上，整体优于参与对比的其他算法，且对于物体姿态变化、尺寸变化、旋转变化、光照变化、复杂背景杂波等影响因素具有较好的鲁棒性。
- 视觉跟踪 /
- 深度学习 /
- SSD /
- 非在线更新
Abstract: A visual tracking method via object detection based on deep learning was proposed. In consideration of the advantages of deep learning in feature representation, deep model SSD(Single Shot Multibox Detector) was used as the candidate object extractor in the tracking model. Simultaneously, the color histogram feature and HOG(Histogram of Oriented Gradient) feature were combined to select the tracking object. In the process of tracking, multi-scale object searching map, which was applied to implement the object detection in different scales, was built to improve the detection performance of deep learning model. In the experiment of eight respective tracking video sequences in the baseline dataset, compared with six typical tracking methods, the proposed method has better performance in tracking effect, and has better robustness in the tracking challenging factors, such as deformation, scale variation, rotation variation, illumination variation, and background clutters.
- visual tracking /
- deep learning /
- SSD /
- non-online updating

[1]	Sivanantham S, Paul N N, Iyer R S. Object tracking algorithm implementation for security applications[J]. Far East Journal of Electronics and Communications, 2016, 16(1):1-13.
[2]	Kwak S, Cho M, Laptev I, et al. Unsupervised object discovery and tracking in video collections[C]//IEEE International Conference on Computer Vision, 2015:3173-3181.
[3]	Luo Haibo, Xu Lingyun, Hui Bin, et al. Status and prospect of target tracking based on deep learning[J]. Infrared and Laser Engineering, 2017, 46(5):0502002. (in Chinese)
[4]	Mei X, Ling H. Robust visual tracking using l1 minimization[C]//IEEE International Conference on Computer Vision, 2010:1436-1443.
[5]	Ross D A, Lim J, Lin R S, et al. Incremental learning for robust visual tracking[J]. International Journal of Computer Vision, 2008, 77(1-3):125-141.
[6]	Wang N, Wang J, Yeung D Y. Online robust non-negative dictionary learning for visual tracking[C]//IEEE International Conference on Computer Vision, 2013:657-664.
[7]	Henriques J F, Rui C, Martins P, et al. High-speed tracking with kernelized correlation filters[J]. IEEE Transactions on Pattern Analysis Machine Intelligence, 2014, 37(3):583-596.
[8]	Babenko B, Yang M H, Belongie S. Robust object tracking with online multiple instance learning[J]. IEEE Transactions on Pattern Analysis Machine Intelligence, 2011, 33(8):1619-1632.
[9]	Grabner H, Grabner M, Bischof H. Real-time tracking via on-line boosting[C]//British Machine Vision Conference, 2006:47-56.
[10]	Hare S, Saffari A, Torr P H S. Struck:structured output tracking with kernels[C]//IEEE International Conference on Computer Vision, 2011:263-270.
[11]	Wang N, Yeung D Y. Learning a deep compact image representation for visual tracking[C]//International Conference on Neural Information Processing Systems, 2013:809-817.
[12]	Nam H, Han B. Learning multi-domain convolutional neural networks for visual tracking[C]//IEEE Conference on Computer Vision and Pattern Recognition, 2016:4293-4302.
[13]	Held D, Thrun S, Savarese S. Learning to track at 100 FPS with deep regression networks[C]//European Conference on Computer Vision, 2016:749-765.
[14]	Ma C, Huang J B, Yang X, et al. Hierarchical convolutional features for visual tracking[C]//IEEE International Conference on Computer Vision, 2015:3074-3082.
[15]	Wang L, Liu T, Wang G, et al. Video tracking using learned hierarchical features[J]. IEEE Transactions on Image Processing, 2015, 24(4):1424-1435.
[16]	Wang N, Li S, Gupta A, et al. Transferring rich feature hierarchies for robust visual tracking[J]. Computer Science, 2015, arXiv:1501.0458.
[17]	Wang X, Hou Z, Yu W, et al. Robust visual tracking via multiscale deep sparse networks[J]. Optical Engineering, 2017, 56(4):043107.
[18]	Redmon J, Divvala S, Girshick R, et al. You only look once:unified, real-time object detection[C]//IEEE Conference on Computer Vision and Pattern Recognition, 2016:779-788.
[19]	Liu W, Anguelov D, Erhan D, et al. SSD:Single shot multibox detector[C]//European Conference on Computer Vision, 2016:21-37.
[20]	Cai Z, Fan Q, Feris R S, et al. A unified multi-scale deep convolutional neural network for fast object detection[C]//European Conference on Computer Vision, 2016:354-370.
[21]	Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[C]//ICLR, 2015:arXiv:1409.1556.
[22]	Yin S F, Wang Y C, Cao L C, et al. Fast correlation matching based on fast fourier fransform and integral image[J]. Acta Photonia Sinica, 2010, 39(12):2246-2250. (in Chinese)
[23]	Bal A, Alum M S. Automatic target tracking in FLIR image sequences[C]//SPIE, 2004, 5426:30-36.
[24]	Wu Y, Lim J, Yang M H. object tracking benchmark[J]. IEEE Transactions on Pattern Analysis Machine Intelligence, 2015, 37(9):1834-1848.
[25]	Kalal Z, Matas J, Mikolajczyk K. P-N learning:bootstrapping binary classifiers by structural constraints[C]//IEEE Conference on Computer Vision and Pattern Recognition, 2010, 238(6):49-56.
[26]	Zhang K, Zhang L, Yang M H. Real-time compressive tracking[C]//European Conference on Computer Vision, 2012:864-877.
[27]	Learnedmiller E, Sevillalara L. Distribution fields for tracking[C]//IEEE Conference on Computer Vision and Pattern Recognition, 2012:1910-1917.

[1]	熊子涵, 宋良峰, 刘欣, 左超, 郜鹏. 基于深度学习的荧光显微性能提升（特邀） . 红外与激光工程, 2022, 51(11): 20220536-1-20220536-18. doi: 10.3788/IRLA20220536
[2]	王志远, 赖雪恬, 林惠川, 陈福昌, 曾峻, 陈子阳, 蒲继雄. 基于深度学习实现透过浑浊介质图像重构（特邀） . 红外与激光工程, 2022, 51(8): 20220215-1-20220215-10. doi: 10.3788/IRLA20220215
[3]	陈寒梅, 于春荣, 刘智超. 基于深度学习的室内照明智能调节系统 . 红外与激光工程, 2022, 51(7): 20210829-1-20210829-6. doi: 10.3788/IRLA20210829
[4]	吴志洋, 王双, 刘铁根, 靳党鹏. 基于深度学习视觉和激光辅助的盾构管片自动拼装定位方法 . 红外与激光工程, 2022, 51(4): 20210183-1-20210183-9. doi: 10.3788/IRLA20210183
[5]	苏晏园, 范广宇, 龚海梅, 李雪, 陈永平. InGaAs近红外人脸图像检测超轻量算法研究 . 红外与激光工程, 2022, 51(10): 20220078-1-20220078-10. doi: 10.3788/IRLA20220078
[6]	范有臣, 马旭, 马淑丽, 钱克昌, 郝红星. 基于深度学习的激光干扰效果评价方法 . 红外与激光工程, 2021, 50(S2): 20210323-1-20210323-7. doi: 10.3788/IRLA20210323
[7]	赵洋, 傅佳安, 于浩天, 韩静, 郑东亮. 深度学习精确相位获取的离焦投影三维测量 . 红外与激光工程, 2020, 49(7): 20200012-1-20200012-8. doi: 10.3788/IRLA20200012
[8]	杨程, 鄢秋荣, 祝志太, 王逸凡, 王明, 戴伟辉. 基于深度学习的压缩光子计数激光雷达 . 红外与激光工程, 2020, 49(S2): 20200380-20200380. doi: 10.3788/IRLA20200380
[9]	钟锦鑫, 尹维, 冯世杰, 陈钱, 左超. 基于深度学习的散斑投影轮廓术 . 红外与激光工程, 2020, 49(6): 20200011-1-20200011-11. doi: 10.3788/IRLA20200011
[10]	胡善江, 贺岩, 陶邦一, 俞家勇, 陈卫标. 基于深度学习的机载激光海洋测深海陆波形分类 . 红外与激光工程, 2019, 48(11): 1113004-1113004(8). doi: 10.3788/IRLA201948.1113004
[11]	唐聪, 凌永顺, 杨华, 杨星, 路远. 基于深度学习的红外与可见光决策级融合检测 . 红外与激光工程, 2019, 48(6): 626001-0626001(15). doi: 10.3788/IRLA201948.0626001
[12]	梁欣凯, 宋闯, 赵佳佳. 基于深度学习的序列图像深度估计技术 . 红外与激光工程, 2019, 48(S2): 134-141. doi: 10.3788/IRLA201948.S226002
[13]	周宏强, 黄玲玲, 王涌天. 深度学习算法及其在光学的应用 . 红外与激光工程, 2019, 48(12): 1226004-1226004(20). doi: 10.3788/IRLA201948.1226004
[14]	赵晓枫, 徐明扬, 王聃漂, 杨佳星, 张志利. 基于改进SSD的特种车辆红外伪装检测方法 . 红外与激光工程, 2019, 48(11): 1104003-1104003(10). doi: 10.3788/IRLA201948.1104003
[15]	耿磊, 梁晓昱, 肖志涛, 李月龙. 基于多形态红外特征与深度学习的实时驾驶员疲劳检测 . 红外与激光工程, 2018, 47(2): 203009-0203009(9). doi: 10.3788/IRLA201847.0203009
[16]	张秀玲, 侯代标, 张逞逞, 周凯旋, 魏其珺. 深度学习的MPCANet火灾图像识别模型设计 . 红外与激光工程, 2018, 47(2): 203006-0203006(6). doi: 10.3788/IRLA201847.0203006
[17]	姚旺, 刘云鹏, 朱昌波. 基于人眼视觉特性的深度学习全参考图像质量评价方法 . 红外与激光工程, 2018, 47(7): 703004-0703004(8). doi: 10.3788/IRLA201847.0703004
[18]	唐聪, 凌永顺, 郑科栋, 杨星, 郑超, 杨华, 金伟. 基于深度学习的多视窗SSD目标检测方法 . 红外与激光工程, 2018, 47(1): 126003-0126003(9). doi: 10.3788/IRLA201847.0126003
[19]	郭强, 芦晓红, 谢英红, 孙鹏. 基于深度谱卷积神经网络的高效视觉目标跟踪算法 . 红外与激光工程, 2018, 47(6): 626005-0626005(6). doi: 10.3788/IRLA201847.0626005
[20]	罗海波, 许凌云, 惠斌, 常铮. 基于深度学习的目标跟踪方法研究现状与展望 . 红外与激光工程, 2017, 46(5): 502002-0502002(7). doi: 10.3788/IRLA201746.0502002

点击查看大图

计量

文章访问数: 746
HTML全文浏览量: 73
PDF下载量: 167
被引次数: 0

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于深度学习物体检测的视觉跟踪方法

doi: 10.3788/IRLA201847.0526001

作者简介:
唐聪(1989-),男,博士生,主要从事计算机视觉、深度学习、模式识别等方面的研究。Email:tangcong_eei@163.com

A visual tracking method via object detection based on deep learning

计量

基于深度学习物体检测的视觉跟踪方法

doi: 10.3788/IRLA201847.0526001

1. 国防科技大学,安徽合肥 230037;

2. 脉冲功率激光技术国家重点实验室,安徽合肥 230037

作者简介:
唐聪(1989-),男,博士生,主要从事计算机视觉、深度学习、模式识别等方面的研究。Email:tangcong_eei@163.com

English Abstract

A visual tracking method via object detection based on deep learning

1. National University of Defense Technology,Hefei 230037,China;

2. State Key Laboratory of Pulsed Power Laser Technology,Hefei 230037,China

全文HTML

目录

留言板

基于深度学习物体检测的视觉跟踪方法

doi: 10.3788/IRLA201847.0526001

作者简介: 唐聪(1989-),男,博士生,主要从事计算机视觉、深度学习、模式识别等方面的研究。Email:tangcong_eei@163.com

A visual tracking method via object detection based on deep learning

计量

出版历程

基于深度学习物体检测的视觉跟踪方法

doi: 10.3788/IRLA201847.0526001

1. 国防科技大学,安徽 合肥 230037; 2. 脉冲功率激光技术国家重点实验室,安徽 合肥 230037

作者简介: 唐聪(1989-),男,博士生,主要从事计算机视觉、深度学习、模式识别等方面的研究。Email:tangcong_eei@163.com

English Abstract

A visual tracking method via object detection based on deep learning

1. National University of Defense Technology,Hefei 230037,China; 2. State Key Laboratory of Pulsed Power Laser Technology,Hefei 230037,China

全文HTML

目录

作者简介:
唐聪(1989-),男,博士生,主要从事计算机视觉、深度学习、模式识别等方面的研究。Email:tangcong_eei@163.com

1. 国防科技大学,安徽合肥 230037;

2. 脉冲功率激光技术国家重点实验室,安徽合肥 230037

作者简介:
唐聪(1989-),男,博士生,主要从事计算机视觉、深度学习、模式识别等方面的研究。Email:tangcong_eei@163.com

1. National University of Defense Technology,Hefei 230037,China;

2. State Key Laboratory of Pulsed Power Laser Technology,Hefei 230037,China