Single-photon LiDAR imaging method based on sensor fusion network

Jiang Xiaoduo; Zhao Xiaochen; Mao Tianyi; He Weiji; Chen Qian

doi:10.3788/IRLA20210871

Volume 51 Issue 2

Feb. 2022

Turn off MathJax

Article Contents

Article Navigation > Infrared and Laser Engineering > 2022 > 51(2): 20210871

Jiang Xiaoduo, Zhao Xiaochen, Mao Tianyi, He Weiji, Chen Qian. Single-photon LiDAR imaging method based on sensor fusion network[J]. Infrared and Laser Engineering, 2022, 51(2): 20210871. doi: 10.3788/IRLA20210871

Citation:

Jiang Xiaoduo, Zhao Xiaochen, Mao Tianyi, He Weiji, Chen Qian. Single-photon LiDAR imaging method based on sensor fusion network[J]. Infrared and Laser Engineering, 2022, 51(2): 20210871. doi: 10.3788/IRLA20210871

Single-photon LiDAR imaging method based on sensor fusion network

doi: 10.3788/IRLA20210871

School of Electronic and Optical Engineering, Nanjing University of Technology and Science, Nanjing 210094, China

Received Date: 2021-11-23
Rev Recd Date: 2021-12-28

Available Online: 2022-03-04

Publish Date: 2022-02-28

Abstract

LiDAR systems with active illumination obtain depth information of the scene using Single-Photon Avalanche Diode(SPAD) detectors to record the arrival time of reflected photons from the laser pulse. However, there is ambient light that interferes measurements during the detection period. Sensor fusion is one of the effective methods for single-photon imaging. Recently, many data-driven methods based on intensity-LiDAR fusion have achieved gratifying results, but most of them use the scanning LiDAR which has a slow depth acquisition speed. The advent of the SPAD array can overcome the limitation of frame rates. The SPAD array allows the collection of multiple returned photons at the same time, which accelerates the information collection process. However, the spatial resolution of SPAD array detectors is typically low, and the detection process is also interfered by the ambient light. Therefore, it is necessary to break the inherent limitation of the SPAD array through an algorithm to separate the depth information from the noise. In this paper, for the SPAD array detector with the array size of 32×32 pixel, a convolutional neural network was proposed, which could reconstruct high-resolution clean TCSPC histogram under the guidance of the intensity image. A multi-scale approach was adopted to extract input features, and the fusion of depth data and intensity data was further processed based on the attention mechanism in the network. In addition, a loss function combination suitable for the TCSPC histogram data processing network was designed, where the overall distribution of photons and the ordinal relationship between time bins in the temporal dimension could be simultaneously considered. The method proposed in this paper can successfully increase the depth spatial resolution by 4 times, and the efficacy of proposed method is verified on realistic data, which is superior to state-of-the-art methods qualitatively and quantitatively.
- LiDAR,
- single-photon imaging method,
- sensor fusion,
- SPAD array,
- convolutional neural network

References

[1]	Henderson R K, Johnston N, Hutchings S W, et al. 5.7 A 256×256 40 nm/90 nm CMOS 3D-stacked 120 dB dynamic-range reconfigurable time-resolved spad imager[C]//ISSCC, 2019: 106–108.
[2]	Poland S P, Krstajić N, Monypenny J, et al. A high speed multifocal multiphoton fluorescence lifetime imaging microscope for live-cell FRET imaging [J]. Biomed Opt Express, 2015, 6(2): 277-296. doi: 10.1364/BOE.6.000277
[3]	黄鹤, 李昕芮, 宋京, 等. 多尺度窗口的自适应透射率修复交通图像去雾方法[J]. 中国光学, 2019, 12(6): 1311-1320. doi: 10.3788/CO.20191206.1311 Huang He, Li Xinrui, Song Jing, et al. A traffic image dehaze method based on adaptive transmittance estimation with multi-scale window [J]. Chinese Optics, 2019, 12(6): 1311-1320. (in Chinese) doi: 10.3788/CO.20191206.1311
[4]	Shin D, Xu F, Venkatraman D, et al. Photon-efficient imaging with a single-photon camera [J]. Nat Commun, 2016, 7(1): 12046. doi: 10.1038/ncomms12046
[5]	冯肖维, 胡海云, 庄瑞卿, 等. 三维点云自适应稀疏优化重构[J]. 光学精密工程, 2021, 29(10): 2495-2503. doi: 10.37188/OPE.20212910.2495 Feng Xiaowei, Hu Haiyun, Zhuang Ruiqing, et al. Adaptive reconstruction of 3D point cloud by sparse optimization [J]. Optics and Precision Engineering, 2021, 29(10): 2495-2503. (in Chinese) doi: 10.37188/OPE.20212910.2495
[6]	Rapp J, Goyal V K. A few photons among many: Unmixing signal and noise for photon-efficient active imaging [J]. IEEE Trans Comput Imaging, 2017, 3(3): 445-459. doi: 10.1109/TCI.2017.2706028
[7]	王春哲, 安军社, 姜秀杰, 等. 基于卷积神经网络的候选区域优化算法[J]. 中国光学, 2019, 12(6): 1348-1361. doi: 10.3788/CO.20191206.1348 Wang Chunzhe, An Junshe, Jiang Xiujie, et al. Region proposal optimization algorithm based on convolutional neural networks [J]. Chinese Optics, 2019, 12(6): 1348-1361. (in Chinese) doi: 10.3788/CO.20191206.1348
[8]	周宏强, 黄玲玲, 王涌天. 深度学习算法及其在光学的应用[J]. 红外与激光工程, 2019, 48(12): 1226004-1226004. doi: 10.3788/IRLA201948.1226004 Zhou Hongqiang, Huang Lingling, Wang Yongtian, et al. Deep learning algorithm and its application in optics [J]. Infrared and Laser Engineering, 2019, 48(12): 1226004. (in Chinese) doi: 10.3788/IRLA201948.1226004
[9]	曾瀚林, 孟祥勇, 钱惟贤, 等. 高斯差分滤波图像融合方法[J]. 红外与激光工程, 2020, 49(S1): 20200091. doi: 10.3788/IRLA20200091 Zeng Hanlin, Meng Xiangyong, Qian Weixian, et al. Image fusion algorithm based on DOG filter [J]. Infrared and Laser Engineering, 2020, 49(S1): 20200091. (in Chinese) doi: 10.3788/IRLA20200091
[10]	Lindell D B, O’Toole M, Wetzstein G. Single-photon 3D imaging with deep sensor fusion [J]. ACM Trans Graph, 2018, 37(4): 1-12.
[11]	Sun Z H, Lindell D B, Solgaard O, et al. SPADnet: Deep RGB-SPAD sensor fusion assisted by monocular depth estimation [J]. Opt Express, 2020, 28(10): 14948-14962. doi: 10.1364/OE.392386
[12]	Ruget A, McLaughlin S, Henderson R K, et al. Robust super-resolution depth imaging via a multi-feature fusion deep network [J]. Opt Express, 2021, 29(8): 11917-11937. doi: 10.1364/OE.415563
[13]	雷俊锋, 贺睿, 肖进胜. 融合空间注意力机制的行车障碍预测网络[J]. 光学精密工程, 2020, 28(8): 1850-1860. doi: 10.3788/OPE.20202808.1850 Lei Junfeng, He Rui, Xiao Jinsheng. Driving obstacles prediction network merged with spatial attention [J]. Optics and Precision Engineering, 2020, 28(8): 1850-1860. (in Chinese) doi: 10.3788/OPE.20202808.1850
[14]	Woo S, Park J, Lee J Y, et al. CBAM: Convolutional block attention module[C]//ECCV, 2018: 3-19.
[15]	Silberman N, Hoiem D, Kohli P, et al. Indoor segmentation and support inference from RGBD images[C]//ECCV, 2019: 746-760.
[16]	He K M, Sun J, Tang X O. Guided image filtering [J]. IEEE Trans Pattern Anal Mach Intell, 2013, 35(6): 1397-1409. doi: 10.1109/TPAMI.2012.213

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(5) / Tables(2)

Get Citation

PDF

XML

Article Metrics

Article views(483) PDF downloads(126) Cited by()

Proportional views

HTML

0. 引　言

从感知场景中推断出正确的深度信息对许多应用来说是至关重要的，例如自动驾驶、虚拟现实、增强现实和机器人技术。激光雷达是深度成像中的领先技术，目前，大多数激光雷达系统采取单点/扫描的方式，使用共轴对齐的激光二极管和单光子探测器，由激光器发射激光，探测器时间标注经场景反射回来的到达光子。扫描式激光雷达系统虽然能够获取较为准确的深度信息，但采集速度慢。然而，越来越多的应用要求对场景进行快速获取，在此需求之下，单光子雪崩二极管（Single-Photon Avalanche Diode, SPAD）阵列应运而生。通过同时采集多个回波光子，SPAD阵列可以提供准确快速的场景深度信息。

近年来，许多研究团队着力发展SPAD阵列^[1-2]。目前，激光雷达的分辨率较低，尤其是SPAD阵列，因此，深度重建也是一个热门的研究方向^[3-6]，也有许多研究依赖于神经网络方法^[7-8]。仅从激光雷达系统中获取的信息进行深度重建效果是有限的，多维信息融合^[9]是解决这一问题的方法之一。Lindell等人结合常规高分辨率相机和线列SPAD，使用多尺度深度卷积网络，提出了一种用于效率3 D成像的数据驱动方法^[10]；在此基础上，Sun等人引入了单目深度估计算法，能从强度信息中得到更可靠的相对距离信息^[11]；Ruget等人使用了相同的SPAD阵列传感器，基于神经网络，利用强度图和多个从直方图中提取出的特征引导深度上采样^[12]。

在人类视觉系统中，大脑会自动忽略场景中低价值的信息，为了模仿这一行为，注意力模型被提出。在神经网络中，注意力模型能够硬性选择输入的某些部分，或者给输入的不同部分分配不同的权重，目前在各个领域被广泛使用^[13]。最近的工作将注意力模型应用于三维点云数据上，但解决的都是分类问题，文中将深度图像重构视为回归问题，将注意力模型嵌入处理时间相关单光子计数（Time-Correlated Single-Photon Counting, TCSPC）直方图数据的神经网络中，证明注意力模型在三维数据回归问题中的有效性。

为了打破SPAD阵列的固有图像分辨率限制和去除探测器探测过程中的噪声光子，论文基于传感器融合策略提出了一种卷积神经网络结构，引入多尺度特征提取和注意力机制模块，提高了融合质量。此外，设计了一个针对TCSPC直方图的损失函数，不仅关注光子在时间维度上的总体分布，还考虑各个时间仓间光子的序数关系。文中提出的方法可以将深度数据的空间分辨率提升4倍，并在仿真数据和真实采集数据上都取得了比其他算法更好的质量效果和量化指标。

4. 结　论

文中针对深度重构研究方向，介绍了一种基于传感器融合策略的卷积神经网络结构，并结合了注意力模型，产生了更好的融合效果。此外，文中设计了一种损失函数，适用于处理TCSPC直方图数据的算法，同时关注时间维度上光子的总体分布和各个时间仓之间的序数回归关系。文中提出的卷积神经网络结构简单，无需过多的预处理步骤，并在由SPAD阵列探测器获取的数据上验证了深度重构的鲁棒性。在比较实验中，文中提出的方法处理结果能够重构出边缘，物体深度完整；相比于其他深度重构方法，最好可以将量化指标提高3倍。在消融实验中，使用文中设计的网络结构和损失函数得到的处理结果都取得了最佳图像质量。这些实验结果均验证了文中方法具有优异的深度重构能力，在实际应用中具有潜力。

Reference (16)

[1]	Henderson R K, Johnston N, Hutchings S W, et al. 5.7 A 256×256 40 nm/90 nm CMOS 3D-stacked 120 dB dynamic-range reconfigurable time-resolved spad imager[C]//ISSCC, 2019: 106–108.
[2]	Poland S P, Krstajić N, Monypenny J, et al. A high speed multifocal multiphoton fluorescence lifetime imaging microscope for live-cell FRET imaging [J]. Biomed Opt Express, 2015, 6(2): 277-296.
[3]	黄鹤, 李昕芮, 宋京, 等. 多尺度窗口的自适应透射率修复交通图像去雾方法[J]. 中国光学, 2019, 12(6): 1311-1320. doi: 10.3788/CO.20191206.1311	Huang He, Li Xinrui, Song Jing, et al. A traffic image dehaze method based on adaptive transmittance estimation with multi-scale window [J]. Chinese Optics, 2019, 12(6): 1311-1320. (in Chinese)
[4]	Shin D, Xu F, Venkatraman D, et al. Photon-efficient imaging with a single-photon camera [J]. Nat Commun, 2016, 7(1): 12046.
[5]	冯肖维, 胡海云, 庄瑞卿, 等. 三维点云自适应稀疏优化重构[J]. 光学精密工程, 2021, 29(10): 2495-2503. doi: 10.37188/OPE.20212910.2495	Feng Xiaowei, Hu Haiyun, Zhuang Ruiqing, et al. Adaptive reconstruction of 3D point cloud by sparse optimization [J]. Optics and Precision Engineering, 2021, 29(10): 2495-2503. (in Chinese)
[6]	Rapp J, Goyal V K. A few photons among many: Unmixing signal and noise for photon-efficient active imaging [J]. IEEE Trans Comput Imaging, 2017, 3(3): 445-459.
[7]	王春哲, 安军社, 姜秀杰, 等. 基于卷积神经网络的候选区域优化算法[J]. 中国光学, 2019, 12(6): 1348-1361. doi: 10.3788/CO.20191206.1348	Wang Chunzhe, An Junshe, Jiang Xiujie, et al. Region proposal optimization algorithm based on convolutional neural networks [J]. Chinese Optics, 2019, 12(6): 1348-1361. (in Chinese)
[8]	周宏强, 黄玲玲, 王涌天. 深度学习算法及其在光学的应用[J]. 红外与激光工程, 2019, 48(12): 1226004-1226004. doi: 10.3788/IRLA201948.1226004	Zhou Hongqiang, Huang Lingling, Wang Yongtian, et al. Deep learning algorithm and its application in optics [J]. Infrared and Laser Engineering, 2019, 48(12): 1226004. (in Chinese)
[9]	曾瀚林, 孟祥勇, 钱惟贤, 等. 高斯差分滤波图像融合方法[J]. 红外与激光工程, 2020, 49(S1): 20200091. doi: 10.3788/IRLA20200091	Zeng Hanlin, Meng Xiangyong, Qian Weixian, et al. Image fusion algorithm based on DOG filter [J]. Infrared and Laser Engineering, 2020, 49(S1): 20200091. (in Chinese)
[10]	Lindell D B, O’Toole M, Wetzstein G. Single-photon 3D imaging with deep sensor fusion [J]. ACM Trans Graph, 2018, 37(4): 1-12.
[11]	Sun Z H, Lindell D B, Solgaard O, et al. SPADnet: Deep RGB-SPAD sensor fusion assisted by monocular depth estimation [J]. Opt Express, 2020, 28(10): 14948-14962.
[12]	Ruget A, McLaughlin S, Henderson R K, et al. Robust super-resolution depth imaging via a multi-feature fusion deep network [J]. Opt Express, 2021, 29(8): 11917-11937.
[13]	雷俊锋, 贺睿, 肖进胜. 融合空间注意力机制的行车障碍预测网络[J]. 光学精密工程, 2020, 28(8): 1850-1860. doi:10.3788/OPE.20202808.1850	Lei Junfeng, He Rui, Xiao Jinsheng. Driving obstacles prediction network merged with spatial attention [J]. Optics and Precision Engineering, 2020, 28(8): 1850-1860. (in Chinese)
[14]	Woo S, Park J, Lee J Y, et al. CBAM: Convolutional block attention module[C]//ECCV, 2018: 3-19.
[15]	Silberman N, Hoiem D, Kohli P, et al. Indoor segmentation and support inference from RGBD images[C]//ECCV, 2019: 746-760.
[16]	He K M, Sun J, Tang X O. Guided image filtering [J]. IEEE Trans Pattern Anal Mach Intell, 2013, 35(6): 1397-1409.

	Preprocessed	MLE	He et al. ^[16]	Lindell et al. ^[10]	Proposed
"N" and "J"	0.8489	0.4823	0.3970	0.3253	0.3068
Multi objects	0.7204	0.4510	0.6129	0.2432	0.1958

	Without attention	Without intensity	KL + TV	OR + TV	Proposed
"N" and "J"	0.7204	0.4510	0.6129	0.2432	0.1958

Single-photon LiDAR imaging method based on sensor fusion network

doi: 10.3788/IRLA20210871

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views