Defocus projection three-dimensional measurement based on deep learning accurate phase acquisition

Zhao Yang; Fu Jia'an; Yu Haotian; Han Jing; Zheng Dongliang

doi:10.3788/IRLA20200012

The digital fringe projection three-dimensional (3D) measurement technology can generate a sinusoidal fringe pattern for 3D measurement by defocusing a binary fringe pattern. It can achieve extremely high projection speed and has great potential in the field of high-speed 3D measurement. However, the binary fringe pattern inevitably contains higher-order harmonics, resulting in a phase error introduced into the calculated phase, thereby reducing the accuracy of high-speed 3D measurement. A 3D measurement method for defocused projection based on deep learning accurate phase acquisition was proposed. The image feature processing capability based on deep learning algorithm can remove the phase errors introduced by higher-order harmonics. An end-to-end deep convolutional neural network from noise phase to precise phase was constructed by this method and the phase error introduced by higher-order harmonics was reduced. Finally, high-speed and accurate 3D measurement could be achieved by this method. Firstly, the theoretical analysis proved the feasibility of the proposed method. Then, simulation and experiments were performed to further verify the effectiveness and accuracy of the proposed method. Compared with the existing high-speed 3D measurement methods, the proposed method can ensure measurement speed while ensuring measurement accuracy.

HTML

0. 引　言

快速三维测量在先进制造、虚拟现实、安全防护等领域变得日益重要，数字光栅投影三维测量技术因其非接触、速度快、成本相对较低等优势，在快速三维测量领域表现了极大潜力^[1-4]。传统光栅投影法投影符合正弦分布光栅条纹，通常由频域法或者相移法计算测量所需相位信息。频域法可依据单帧光栅条纹计算相位，但是较难保持待测物体边界、复杂面型等细节信息^[5]；相移法依据至少三帧以上相移光栅条纹，时域内计算得到相位信息，能较好地保持待测物体细节^[6]。

数字投影设备利用数字微镜(Digital Micro-mirror Device，DMD)来投影光栅条纹，数字微镜中可以开和闭的微镜片表征对应像素，单个像素亮度强弱通过调整对应微镜片单位时间内开启时间长短来实现。投影“8”位正弦光栅条纹需要根据条纹灰度变化按照积分时间多次切换数字微镜开闭状态；而投影“1”位二值光栅条纹，数字微镜仅需保持开或闭两种状态，无需进行多次开闭状态切换，能够实现kHz（帧/秒）以上的投影速度。但投影二值光栅条纹不可避免会包含高次谐波，投影设备选择恰当离焦程度，离焦可由高斯低通滤波近似表征，即可滤除投影光栅包含的高次谐波，以生成测量所需正弦光栅条纹图像。基于二值光栅投影快速性，即可实现快速三维测量^[7]。

然而，投影设备通常需要较大离焦程度来滤除二值光栅所包含高次谐波，以计算得到精确相位信息。但是，较大离焦程度会减少生成条纹对比度，降低三维测量动态范围；轻微离焦程度能够保证较高的动态测量范围，但是相应高次谐波会引入相位误差，降低三维测量精度。通常有两种思路解决上述问题：（1）优化设计二值光栅，降低所需离焦程度，主要包括：脉宽调制（PWM）和抖动（dithering）、方波等设计方法。其中，PWM ^[8]和dithering ^[9]各自适用于较窄和较宽周期二值光栅设计，投影高频方波二值光栅，操作便捷且在不同离焦程度保持良好的一致性。但是，实际测量过程中通常难以精确调整到恰当离焦程度，尤其当所选相移算法步数较低情况下，不可避免会包含高次谐波，降低三维测量精度。（2）通过附加多帧光栅或者进行频域分析进行相位误差补偿。附加多帧光栅条纹，需要投影多帧相移二值光栅，制约了三维测量速度^[6]；引入频域变换方法，相对难以保持待测物体边界^[5]。基于上述，采用较低步数相移算法（例如：快速测量常用三步相移算法），投影设备处在不同离焦程度，均能实现精确相位获取，对快速三维测量具有重要意义。

文中对传统快速三维测量方法做了改进，提出一种基于深度学习精确相位获取的离焦投影三维测量方法。具体而言，在快速测量过程中引入基于深度学习的PDNet(Phase Denoising Network)算法，使用含噪声相位到理想相位的端到端方式来训练PDNet模型，训练完成后使用模型就能够在测量过程中得到精确相位图像。文中将三帧相移二值光栅离焦投影，采用三步相移算法计算得到的包裹相位图像作为输入，对应的理想相位图像通过多步相移算法获得，所提方法能够实现不同离焦下的精确相位获取，进而实现快速精确三维测量。

4. 结　论

在传统快速三维测量基础上，提出一种降低高次谐波引起相位噪声的快速三维测量方法。所提方法的明显优势在于，能够在不增加额外投影光栅帧数前提下，实现轻微离焦程度下精确相位信息获取，结合已标定系统参数，即可进行精确快速三维重建。与现有快速三维测量方法相比，该方法能够在不牺牲测量速度的前提下保证测量精度。采用该方法对模拟数据和真实墙面进行验证分析。结果表明，该方法对于不同周期相移条纹计算得到的相位图像均能得到高质量相位图像，且去噪效果稳定，证明该方法的有效性。此外，文中还对真实玩具模型进行验证，对比处理前后的重建效果，证明该方法的精确性。另外，在实际测量中，仅需使用数据集训练后得到的模型，无需修改任何参数信息，即可实现不同离焦程度下的精确三维重建。

Reference (14)

[1]	Zhang S. Recent progresses on real-time 3D shape measurement using digital fringe projection techniques [J]. Optics and Lasers in Engineering, 2010, 48(2): 149−158.
[2]	Hong Ziming, Ai Qingsong, Chen Kun. High precise 3D visual measurement based on fiber laser [J]. Infrared and Laser Engineering, 2018, 47(8): 0803011. (in Chinese)
[3]	Zheng D, Da F, Kemao Q, et al. Phase error analysis and compensation for phase shifting profilometry with projector defocusing [J]. Applied Optics, 2016, 55(21): 5721.
[4]	Dai Meiling, Yang Fujun, He Xiaoyuan. Three-dimensional shape measurement of objects with discontinuities by dual-frequency color fringe projection [J]. Optics and Precision Engineering, 2013, 21(1): 11−16. (in Chinese)
[5]	Su X, Chen W. Fourier transform profilometry: a review [J]. Optics and Lasers in Engineering, 2001, 35(5): 263−284.
[6]	Zuo C, Feng S, Huang L, et al. Phase shifting algorithms for fringe projection profilometry: A review [J]. Optics and Lasers in Engineering, 2018, 109: 23−59.
[7]	Song Z. Flexible 3D shape measurement using projector defocusing: extended measurement range [J]. Optics Letters, 2010, 35(7): 934−6.
[8]	Zuo C, Chen Q, Feng S, et al. Optimized pulse width modulation pattern strategy for three-dimensional profilometry with projector defocusing [J]. Applied Optics, 2012, 51(19): 4477−4490.
[9]	Zhao Liwei, Da Feipeng, Zheng Dongliang. Method for binary grating generation using defocused projection for three-dimensional measurement [J]. Acta Optica Sinica, 2016, 36(8): 0812005. (in Chinese)
[10]	Zheng D, Da F. Self-correction phase unwrapping method based on gray-code light [J]. Optics and Lasers in Engineering, 2012, 50(8): 1130−1139.
[11]	Zhang Lei, Jiao Xiaoxue, Zhou Liqiu, et al. Three-dimensional shape acquisition method by integral imaging based on corresponding points [J]. Chinese Optics, 2015, 8(1): 45−50. (in Chinese)
[12]	Sun Riming, Lin Tingting, Ji Lin, et al. A general linear imaging modeling method for space unstability targets and parameter optimization of linear laser radar [J]. Optics and Precision Engineering, 2018, 26(6): 1524−1532. (in Chinese)
[13]	An Dong, Chen Li, Ding Yifei, et al. Optical system model and calibration of grating projection phase method [J]. Chinese Optics, 2015, 8(2): 248−254. (in Chinese)
[14]	Romera E, Alvarez J M, Bergasa L M, et al. ERFNet: Efficient residual factorized ConvNet for real-time semantic segmentation [J]. IEEE Transactions on Intelligent Transportation Systems, 2018, 19(1): 263−272.

		Type	Out-F	Out-Res
ENCODER	1	Downsampler block	16	512×256
	2	Downsampler block	64	256×128
	3−7	5 × Non-bt-1D	64	256×128
	8	Downsampler block	128	128×64
	9	Non-bt-1D(dilated 2)	128	128×64
	10	Non-bt-1D(dilated 4)	128	128×64
	11	Non-bt-1D(dilated 8)	128	128×64
	12	Non-bt-1D(dilated 16)	128	128×64
	13	Non-bt-1D(dilated 2)	128	128×64
	14	Non-bt-1D(dilated 4)	128	128×64
	15	Non-bt-1D(dilated 8)	128	128×64
	16	Non-bt-1D(dilated 16)	128	128×64
DECODER	17	Deconvolution (unsampling)	64	256×128
	18−19	2 × Non-bt-1D	64	256×128
	20	Deconvolution (unsampling)	16	512×256
	21−22	2 × Non-bt-1D	16	512×256
	23	Deconvolution (unsampling)	C	1 024×512

Defocus projection three-dimensional measurement based on deep learning accurate phase acquisition

doi: 10.3788/IRLA20200012

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views