基于改进U-Net网络的相位解包裹技术研究

徐瑞书; 罗笑南; 沈瑶琼; 郭创为; 张文涛; 管钰晴; 傅云霞; 雷李华

doi:10.3788/IRLA20230564

基于改进U-Net网络的相位解包裹技术研究

doi: 10.3788/IRLA20230564

徐瑞书^{1, 2, 3,},
罗笑南³,
沈瑶琼^{1, 2},
郭创为^{1, 2},
张文涛³,
管钰晴^{1, 2},
傅云霞^{1, 2},
雷李华^{1, 2,}

1.
上海市计量测试技术研究院，上海 201203
2.
上海在线检测与控制技术重点实验室，上海 201203
3.
桂林电子科技大学计算机与信息安全学院，广西桂林 541004

基金项目: 国家重点研发计划项目(2021YFF0603300)；上海市市场监督管理局项目(D00RJ2310)；上海市在线检测与控制技术重点实验室基金项目(A04202223003)

详细信息

作者简介:
徐瑞书，女，硕士生，主要从事纳米测量方面的研究

罗笑南，男，教授，博士，主要从事计算机辅助几何学方面的研究

通讯作者: 雷李华，男，高级工程师，博士，主要从事纳米测量方面的研究。

中图分类号: TH741

Research on phase unwrapping technology based on improved U-Net network

Xu Ruishu^{1, 2, 3
,},
Luo Xiaonan³,
Shen Yaoqiong^{1, 2},
Guo Chuangwei^{1, 2},
Zhang Wentao³,
Guan Yuqing^{1, 2},
Fu Yunxia^{1, 2},
Lei Lihua^{1, 2
,}

1.
Shanghai Institute of Measurement and Testing Technology, Shanghai 201203, China
2.
Shanghai Key Laboratory of Online Testing and Control Technology, Shanghai 201203, China
3.
School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, China

Funds: National Key Research and Development Program of China (2021YFF0603300); National Natural Science Foundation of Shanghai (21ZR1483100); Supported by Shanghai Key Laboratory of Online Test and Control Technology (A04202223003)

摘要: 提出了一种结合深度学习的空间相位解包裹方法，采用基于改进U-Net网络的编码器-解码器架构，同时加入包含双向长短期记忆网络（BILSTM）的CBiLSTM模块，并且结合注意力机制，避免了典型卷积神经网络学习全局空间依赖关系的固有缺陷的同时增强了深度学习模型对相位解包裹任务中的关键信息的关注能力。通过大量的模拟数据，验证了文中方法在严重噪声(SNR=0)、不连续条件和混叠条件下的鲁棒性，在以上三种情况下，同其他深度学习网络模型进行对比，文中所提出的网络模型的归一化均方根误差（NRMSE）分别为0.75%、1.81%和1.68%；结构相似性指数（SSIM）分别为0.98、0.92和0.94；峰值信噪比(PSNR)分别为40.87、32.56、37.38；同时计算时间显著减少，适合应用到需要快速准确的空间相位解包裹任务中去。通过实际测量数据，验证了文中提出网络模型的可行性。该研究将双向长短期记忆网络（BILSTM）和注意力机制同时引入光学相位解包裹问题中，为解决复杂相位场的解包裹提供了新的思路和方案。
- 相位解包裹 /
- 深度学习 /
- 注意力机制 /
- 长短期记忆网络 /
- 卷积神经网络
Abstract: Objective Objective Phase Measurement Deflectometry (PMD) is widely employed in free-form surface transmission wavefront detection due to its simplicity, high accuracy, and broad detection range. Achieving high-precision phase acquisition is a critical step in the measurement and detection process. The phase unwrapping task, crucial in optics, plays a pivotal role in optical interferometry, magnetic resonance imaging, fringe projection profilometry (FPP), and other fields ^[1-4]. The challenge lies in recovering a continuously varying true phase signal from the observed wrapped phase signal within the range of [−π, π). While the ideal phase unrolling involves adding or subtracting 2π at each pixel based on the phase difference between adjacent pixels, practical applications face challenges such as noise and phase discontinuity, leading to poles in the wrapped phase ^[5]. These poles result in accumulated computational errors during the unwrapping process, causing phase unwrapping failures. Various methods are employed to unwrap and obtain the real phase distribution. To address these challenges, this paper proposes a phase unwrapping algorithm based on an improved U-Net network. Methods During the model training process, a composite loss function is defined to train the network based on the specific problem of spatial phase unwrapping. To address these challenges, this paper proposes a phase unwrapping algorithm based on an improved U-Net network. This algorithm utilizes U-Net as the basic network, integrates the CBiLSTM module for modeling time series, introduces an attention mechanism for enhanced generalization, and explores optimized loss functions. The proposed network model is validated through simulated and real datasets, showcasing its outstanding performance under noise, discontinuity, and aliasing conditions.The introduction of the attention mechanism enables better capture of global spatial relationships, while CBiLSTM effectively captures and stores long-term dependencies through memory unit structures. Memory units selectively remember and forget parts of the input signal information, enhancing their ability to handle long sequence data modeling tasks. The paper defines a composite loss function tailored to the spatial phase unwrapping problem during the model training process.Comparative experiments between the proposed network and classic models, such as U-Net ^[20], Res-UNet ^[21], and methods by Wang ^[13] and Perera et al. ^[19], demonstrate the robustness of the proposed network under severe noise and discontinuities. Additionally, it showcases computational efficiency in performing spatial phase unwrapping tasks. Results and Discussions Fig.10 shows the comparison between the predicted absolute phase and the real phase output by the wrapped phase after training the network model proposed in this article. Through the construction of the encoder-decoder model, the introduction of the CBiLSTM module and the attention mechanism module, and the composite The definition of the loss function, after comparing with other models, verifies the improvement in accuracy and reduction in training of the network model proposed in this article in the three situations mentioned above. Through simulation experiments and verification, by enhancing the deep learning model's ability to pay attention to key phase information, the network model proposed in this article can improve the accuracy and robustness of phase unwrapping, and promote further development in fields such as optical measurement and phase imaging. Conclusions This paper addresses the challenge of wrapped phase unwrapping by introducing a novel convolutional architecture framed as a regression problem. The proposed network incorporates several enhancements within the encoder-decoder framework, notably featuring a CBiLSTM module and a soft attention mechanism. Comparative analyses with existing phase unwrapping methods demonstrate the network's remarkable performance in achieving precise phase unwrapping, even in severe noise, discontinuities, and aliasing. Notably, the network showcases exceptional unwrapping capabilities without necessitating extensive training on large datasets. Moreover, it exhibits significantly reduced computational time, rendering it well-suited for tasks requiring accuracy and expeditious phase unwrapping.Validation experiments conducted on real laboratory datasets further affirm the outstanding performance of the proposed network. The introduced model empowers phase unwrapping tasks under challenging conditions, such as severe noise, discontinuities, and aliasing, surpassing the limitations of traditional methods. Comparative assessments with other deep learning models reveal a normalized root mean square error (NRMSE) as low as 0.75%. The advancement in unwrapped phase technology holds substantial significance for optical free-form surface detection, contributing to enhanced measurement accuracy, precise control of optical parameters, optimization of optical design, and quality assurance in optical manufacturing and detection processes.
- phase unwrapping /
- deep learning /
- attention mechanism /
- long short-term memory network /
- convolutional neural network

图 1 （a）绝对相位(真实数据集)；（b）包裹相位(真实数据集)；（c）绝对相位(模拟数据集1,2)；（d）包裹相位(模拟数据集1)；（e）包裹相位(模拟数据集2)；（f）包裹相位(模拟数据集3)

Figure 1. （a） Absolute phase (real data set); （b） Wrapped phase (real data set); （c） Absolute phase (simulated data set 1, 2); （d） Wrapped phase (simulated data set 1); （e） Wrapped phase (simulated data set 2); （f） Wrapped phase (simulated data set 3)

下载: 全尺寸图片幻灯片

图 2 真实数据与模拟数据香浓熵的比较

Figure 2. Comparison of Shannon entropy between real and simulated dataset

下载: 全尺寸图片幻灯片

图 3 网络模型图

Figure 3. Network model diagram

下载: 全尺寸图片幻灯片

图 4 CBiLSTM模块示意图

Figure 4. Schematic diagram of CBiLSTM module

下载: 全尺寸图片幻灯片

图 5 注意力门模块示意图

Figure 5. Attention gate schematic

下载: 全尺寸图片幻灯片

图 6 各个网络模型在不同噪声情况下的性能

Figure 6. The performance of each network model under different noise conditions

下载: 全尺寸图片幻灯片

图 7 各个网络模型在不连续情况下的性能

Figure 7. The performance of each network model in the case of discontinuity

下载: 全尺寸图片幻灯片

图 8 各个网络模型在混叠情况下的性能

Figure 8. Performance of individual network models in case of aliasing

下载: 全尺寸图片幻灯片

图 9 各个网络模型在真实数据集中的泛化能力

Figure 9. The generalization ability of each network model in the real data set

下载: 全尺寸图片幻灯片

图 10 各个网络模型不同性能比较

Figure 10. Different performance comparison of each network model

下载: 全尺寸图片幻灯片

图 11 文中提出的网络模型预测相位与真实绝对相位在上述三种情况下的对比

Figure 11. Comparison between the predicted phase and the true absolute phase of the network model proposed in this article in the above three situations

下载: 全尺寸图片幻灯片

图 12 实验测试系统

Figure 12. Experimental test system

下载: 全尺寸图片幻灯片

图 13 数据采集图

Figure 13. Data acquisition diagram

下载: 全尺寸图片幻灯片

图 14 真实数据集上的水平竖直方向的相位解包裹

Figure 14. Horizontal and vertical phase unwrapping on real data sets

下载: 全尺寸图片幻灯片

图 15 透镜存在灰尘和划痕的实验情况图

Figure 15. Picture of the experimental situation with dust and scratches on the lens

下载: 全尺寸图片幻灯片

图 16 复杂真实数据集上的相位解包裹

Figure 16. Phase unwrapping on complex real datasets

下载: 全尺寸图片幻灯片

表 1 各个网络模型在误差方面的性能比较

Table 1. Performance comparison of various network models in terms of errors

Network model	Noise NRMSE	Discontinuous NRMSE	Aliased NRMSE
Our Net (Loss=L_meaw)	2.05%	2.93%	2.64%
Our Net (Loss=L_error)	1.49%	2.99%	2.35%
U-Net^[20]	14.03%	12.59%	13.48%
Wang et al^[13]	13.27%	11.6%	12.24%
Res-UNet^[21]	13.26%	14.69%	12.98%
Our Net	1.12%	1.81%	1.68%
Perera et al^[19]	1.46%	2.09%	1.87%

下载: 导出CSV

表 2 各个网络模型在时间消耗方面的性能比较

Table 2. Performance comparison of various network models in terms of time consumption

Network model	Noise		Discontinuous		Aliased
Network model	Time/s	Total time/s	Time/s	Total time/s	Time/s	Total time/s
U-Net^[20]	7	2625	6	2406	6	2124
Wang et al^[13]	11	3256	12	3996	11	3080
Res-UNet^[21]	16	6384	21	6153	17	6443
Our Net	15	1275	18	2250	13	2860
Perera et al^[19]	18	1746	18	2574	17	4709

下载: 导出CSV

表 3 噪声数据集消融实验的定量比较

Table 3. Quantitative comparison of ablation experiments on noisy datasets

Serial number	Based on improved U-Net	Model CBiLSTM	Soft attention	NRMSE	PSNR	SSIM
1	√			10.06%	13.8	0.46
2	√	√		0.92%	35.78	0.96
3	√		√	1.16%	34.26	0.94
4	√	√	√	0.75%	40.87	0.98

下载: 导出CSV

[1]	Aiello L, Riccio D, Ferraro P, et al. Green's formulation for robust phase unwrapping in digital holography [J]. Optics and Lasers in Engineering, 2007, 45(6): 750-755. doi: 10.1016/j.optlaseng.2006.10.002
[2]	Jenkinson M. Fast, automated, N dimensional phase unwrapping algorithm [J]. Magnetic Resonance in Medicine, 2003, 49(1): 193-197. doi: 10.1002/mrm.10354
[3]	He Z, Cui J, Tan J, et al. Discrete fringe phase unwrapping algorithm based on Kalman motion estimation for high-speed I/Q-interferometry [J]. Optics Express, 2018, 26(7): 8699-8708. doi: 10.1364/OE.26.008699
[4]	Zuo C, Qian J, Feng S, et al. Deep learning in optical metrology: a review [J]. Light: Science & Applications, 2022, 11(1): 39.
[5]	Li Bo, Ma Suodong. Path-independent phase unwrapping method using zonal reconstruction technique [J]. Infrared and Laser Engineering, 2016, 45(2): 0229006. (in Chinese)
[6]	Chen J M, Wang Y H, Dong Z, et al. A phase unwrapping method based on attention-deficitresidual network [J]. Laser Journal, 2022, 43(9): 60-65. (in Chinese)
[7]	Abdul-Rahman H, Gdeisat M, Burton D, et al. Fast three-dimensional phase-unwrapping algorithm based on sorting by reliability following a non-continuous path [C]//Optical Measurement Systems for Industrial Inspection IV. SPIE, 2005, 5856: 32-40.
[8]	Lu Y, Wang X, Zhang X. Weighted least-squares phase unwrapping algorithm based on derivative variance correlation map [J]. Optik-International Journal for Light and Electron Optics, 2007, 118(2): 62-66. doi: 10.1016/j.ijleo.2006.01.006
[9]	Daniel Holden, Saito J, Komura T. A deep learning framework for character motion synthesis and editing [J]. ACM Journals, 2016, 35(4): 1-11.
[10]	Jin L, Dai Q, Zhang C, et al. Deep residual network based optical phase unwrapping [J]. Scientific Reports, 2017, 7(1): 10581. doi: 10.1038/s41598-017-11421-8
[11]	Spoorthi G E, Gorthi R K S, Gorthi S. PhaseNet 2.0: Phase unwrapping of noisy data based on deep learning approach [J]. IEEE Transactions on Image Processing, 2020, 29: 4862-4872. doi: 10.1109/TIP.2020.2977213
[12]	Zhang T, Jiang S, Zhao Z, et al. Rapid and robust two-dimensional phase unwrapping via deep learning [J]. Optics Express, 2019, 27(16): 23173-23185. doi: 10.1364/OE.27.023173
[13]	Wang K, Li Y, Qian K, et al. One-step robust deep learning phase unwrapping [J]. Optics Express, 2019, 27(10): 15100-15115.
[14]	Zhou L. PU-GAN: A one-step 2D InSAR phase unwrapping based on conditional generative adversarial network [C]//IEEE Trans Geosci Remote Sens, 2022, 60: 1-10.
[15]	Xu M. PU-M-Net for phase unwrapping with speckle reduction and structure protection in ESPI [J]. Opt Lasers Eng, 2022, 151: 106824. doi: 10.1016/j.optlaseng.2021.106824
[16]	Li Z, Liu F, Yang W, et al. A survey of convolutional neural networks: analysis, applications, and prospects [J]. IEEE Transactions on Neural Networks and Learning Systems, 2021, 33(12): 6999-7019.
[17]	Cao J, Wang J. Global asymptotic stability of a general class of recurrent neural networks with time-varying delays [J]. IEEE Transactions on Circuits & Systems I Fundamental Theory & Applications, 2003, 50(1): 34-44. doi: 10.1109/TCSI.2002.807494
[18]	Ryu K, Gho S M, Nam Y, et al. Development of a deep learning method for phase unwrap** MR images [C]//Proc Int Soc Magn Reson Med, 2019, 27: 4707.
[19]	Perera M V, De Silva A. A joint convolutional and spatial quad-directional LSTM network for phase unwrapping [C]//ICASSP 2021IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2021: 4055-4059.
[20]	Zhang J. Phase unwrapping in optical metrology via denoised and convolutional segmentation networks [J]. Opt Express, 2019, 27(10): 14903. doi: 10.1364/OE.27.014903
[21]	Wang K. One-step robust deep learning phase unwrapping [J]. Opt Express, 2019, 27(10): 15100. doi: 10.1364/OE.27.015100
[22]	Wu Jie. Determination of weights for ultimate cross efficiency using Shannon entropy [J]. Expert Systems with Applications, 2011, 38(5): 5162-5165. doi: 10.1016/j.eswa.2010.10.046

[1]	郝建新, 王力. 基于红外温度序列的电路板故障诊断研究 . 红外与激光工程, 2023, 52(4): 20220492-1-20220492-12. doi: 10.3788/IRLA20220492
[2]	夏信, 何传亮, 吕英杰, 王守志, 张博, 陈晨, 陈海鹏, 李美萱. 深度学习驱动的智能电网运行图像数据压缩技术 . 红外与激光工程, 2022, 51(12): 20220097-1-20220097-6. doi: 10.3788/IRLA20220097
[3]	齐悦, 董云云, 王溢琴. 基于汇聚级联卷积神经网络的旋转人脸检测方法 . 红外与激光工程, 2022, 51(12): 20220176-1-20220176-8. doi: 10.3788/IRLA20220176
[4]	宦克为, 李向阳, 曹宇彤, 陈笑. 卷积神经网络结合NSST的红外与可见光图像融合 . 红外与激光工程, 2022, 51(3): 20210139-1-20210139-8. doi: 10.3788/IRLA20210139
[5]	李保华, 王海星. 基于增强卷积神经网络的尺度不变人脸检测方法 . 红外与激光工程, 2022, 51(7): 20210586-1-20210586-8. doi: 10.3788/IRLA20210586
[6]	刘瀚霖, 辛璟焘, 庄炜, 夏嘉斌, 祝连庆. 基于卷积神经网络的混叠光谱解调方法 . 红外与激光工程, 2022, 51(5): 20210419-1-20210419-9. doi: 10.3788/IRLA20210419
[7]	庄子波, 邱岳恒, 林家泉, 宋德龙. 基于卷积神经网络的激光雷达湍流预警 . 红外与激光工程, 2022, 51(4): 20210320-1-20210320-10. doi: 10.3788/IRLA20210320
[8]	庞忠祥, 刘勰, 刘桂华, 龚泿军, 周晗, 罗洪伟. 并行多特征提取网络的红外图像增强方法 . 红外与激光工程, 2022, 51(8): 20210957-1-20210957-9. doi: 10.3788/IRLA20210957
[9]	王向军, 欧阳文森. 多尺度循环注意力网络运动模糊图像复原方法 . 红外与激光工程, 2022, 51(6): 20210605-1-20210605-9. doi: 10.3788/IRLA20210605
[10]	刘云朋, 霍晓丽, 刘智超. 基于深度学习的光纤网络异常数据检测算法 . 红外与激光工程, 2021, 50(6): 20210029-1-20210029-6. doi: 10.3788/IRLA20210029
[11]	张旭, 于明鑫, 祝连庆, 何彦霖, 孙广开. 基于全光衍射深度神经网络的矿物拉曼光谱识别方法 . 红外与激光工程, 2020, 49(10): 20200221-1-20200221-8. doi: 10.3788/IRLA20200221
[12]	刘鹏飞, 赵怀慈, 李培玄. 对抗网络实现单幅RGB重建高光谱图像 . 红外与激光工程, 2020, 49(S1): 20200093-20200093. doi: 10.3788/IRLA20200093
[13]	薛珊, 张振, 吕琼莹, 曹国华, 毛逸维. 基于卷积神经网络的反无人机系统图像识别方法 . 红外与激光工程, 2020, 49(7): 20200154-1-20200154-8. doi: 10.3788/IRLA20200154
[14]	高泽宇, 李新阳, 叶红卫. 流场测速中基于深度卷积神经网络的光学畸变校正技术 . 红外与激光工程, 2020, 49(10): 20200267-1-20200267-10. doi: 10.3788/IRLA20200267
[15]	韩旭, 王霖, 伏燕军. 双频外差结合相位编码的相位解包裹方法 . 红外与激光工程, 2019, 48(9): 913003-0913003(8). doi: 10.3788/IRLA201948.0913003
[16]	张秀, 周巍, 段哲民, 魏恒璐. 基于卷积稀疏自编码的图像超分辨率重建 . 红外与激光工程, 2019, 48(1): 126005-0126005(7). doi: 10.3788/IRLA201948.0126005
[17]	刘天赐, 史泽林, 刘云鹏, 张英迪. 基于Grassmann流形几何深度网络的图像集识别方法 . 红外与激光工程, 2018, 47(7): 703002-0703002(7). doi: 10.3788/IRLA201847.0703002
[18]	姚旺, 刘云鹏, 朱昌波. 基于人眼视觉特性的深度学习全参考图像质量评价方法 . 红外与激光工程, 2018, 47(7): 703004-0703004(8). doi: 10.3788/IRLA201847.0703004
[19]	张腊梅, 陈泽茜, 邹斌. 基于3D卷积神经网络的PolSAR图像精细分类 . 红外与激光工程, 2018, 47(7): 703001-0703001(8). doi: 10.3788/IRLA201847.0703001
[20]	郭强, 芦晓红, 谢英红, 孙鹏. 基于深度谱卷积神经网络的高效视觉目标跟踪算法 . 红外与激光工程, 2018, 47(6): 626005-0626005(6). doi: 10.3788/IRLA201847.0626005

点击查看大图

图(16) / 表(3)

计量

文章访问数: 117
HTML全文浏览量: 30
PDF下载量: 44
被引次数: 0

全文HTML

0. 引　言

光学偏折术(Phase measurement deflectometry, PM-D)因其结构简单、检测精度高、范围大等优点被用于自由曲面透射式波前检测，高精度的相位获取是测量检测过程的关键步骤之一。相位解包裹是光学领域中一项重要而具有挑战性的任务，它在光学干涉测量、磁共振成像、条纹投影轮廓测量等领域中扮演着关键的角色^[1−4]。其挑战在于从观测到的[−$ \pi $,$ \pi $)范围内的包裹相位信号中恢复成连续变化的真实相位信号。

理想情况下，相位展开可以根据相邻像素之间的相位差，通过在每个像素处加减2$ \pi $来完成。然而，在实践中由于包裹相位存在噪声、相位不连续等问题，造成了包裹相位中极点的存在^[5]。极点会导致解包裹路径中累积计算误差，从而导致相位展开失败。为了得到真实的相位分布，就需要采用各种方法对包裹相位进行解包裹处理。

现有的相位解包裹方法包括路径跟踪算法和路径无关算法^[6]。路径跟踪算法，如质量导向图相位展开（Quality guided phase unwrap, QGPU）^[6]算法和枝切算法^[7]（Goldstein’s branch cut algorithm），通过按特定路径展开相位，根据质量判断出最佳路径。尽管路径跟踪算法的计算效率相对较高，但是它对噪声缺少鲁棒性，限制了其应用场景；路径无关算法，如傅里叶变换法^[5]和最小二乘法^[8]，虽然对于噪声的鲁棒性较好，但是该算法计算复杂度高，迭代收敛慢、运算效率低，对于大规模相位数据和高分辨率图像，需要较长的计算时间和较大的计算资源。实际上，传统空间相位解包裹的目的都是尽可能地避免无效点的负面影响，主要适用于非严重噪声和一些离散的不连续点。但是在一些极端情况下，例如存在严重噪声或局部孤立的不连续区域，传统方法将变得无效。

近年来，深度学习算法得到了普及，已经被广泛地应用于目标检测和图像分类任务中^[9]。遵循这一发展趋势，研究试图应用深度学习来解决相位解包裹问题。

其中，Jin等人首先提出使用深度学习来解决光学成像中的相位展开问题^[10]。这种使用深度神经网络学习从输入空间到输出空间的映射关系的想法，使得解决空间相位解包裹问题成为可能； Spoorthi G E^[11]和Zhang T^[12]等人将相位展开问题重新表述为语义分割任务，其中训练全卷积网络(Fully convolutional network, FCN)来预测每个展开相位的包裹计数。然而，全卷积网络在下采样时存在损失细节信息问题，且网络收敛速度较慢，往往需要大量的数据集，从而限制了其在实际中的应用；Wang等人提出基于U-Net的语义分割模型（Deep learning phase unwrapping, DLPU）^[13], 通过引入跳跃连接和增加特征通道的方式，通过在模拟数据集上的测试，在相位解包裹任务中相比全卷积网络取得了更好的性能；2022年，Zhou^[14]等设计一种基于对抗神经网络的解包裹技术，以提高传统方法的鲁棒性和有效性；同年，Xu^[15]等人提出了MNet网络，通过丰富的跳跃连接结构，促进浅层信息和深层特征的融合，同时使用结构损失函数。然而，由于卷积神经网络(CNN)中的卷积操作和池化操作仅关注局部窗口内的像素^[16]，忽略了图像不同区域之间的全局空间关系和整个图像的全局结构。由于大多数真实世界的相位图像都包含一定的空间结构，因此在学习从包裹相位到绝对相位的映射时，对这种全局空间关系进行建模是至关重要的。

循环神经网络(Recurrent neural network, RNN)^[17]是一种可以对时间序列内的上下文关系进行建模的神经网络。Ryu等人首先尝试使用卷积和传统循环神经网络的组合在MRI图像中执行相位展开^[18]。然而，这项研究并没有做定量分析；Perera等人^[19] 虽然考虑到噪声情况的解包裹相位，但是缺乏对极端条件(像不连续与混叠)下的研究。

针对上述问题，文中提出了一种基于改进U-Net网络的相位解包裹算法。该算法以U-Net网络作为基础网络，添加可以对时间序列建模的CBiLSTM模块；同时引入注意力机制，增加模型的泛化能力和可解释性；通过对损失函数的对比与改进，找出最适合应用于本研究的损失函数。最后，将提出的网络模型同时经过模拟数据集和真实数据集验证，证明其在噪声、不连续、混叠三种情况下的优秀性能。

注意力机制的引入，可以更好地捕捉图像的全局空间关系；CBiLSTM通过记忆单元结构，能够有效地捕捉和存储长期依赖关系。相比于传统神经网络，记忆单元可以选择性地记住和忘记输入信号的部分信息，从而能够更好地处理长序列数据的建模任务。

在模型训练过程中，根据空间相位解包裹这一特定问题，定义了一个复合损失函数来训练网络。

将文中提出的网络同U-Net^[20]、Res-UNet^[21]等经典网络模型还有Wang^[13]、Perera等人^[19]提出的方法做比较实验，验证文中所提出的网络对严重噪声和不连续的条件具有很强的鲁棒性，并且在执行空间相位解包裹任务时具有很高的计算效率。

4. 结论

文中针对包裹相位展开问题，通过将其表述为回归问题，提出了一种新的卷积架构，该架构包含基于编码器-解码器架构的一系列改造，包括添加CBiLSTM模块，注意力机制模块等。与现有的几种相位展开方法进行比较，发现该网络在不需要大规模数据集训练的情况下，即使在严重噪声条件下、不连续、混叠等情况下也能获得较优秀的相位展开性能。此外，该网络执行此任务平均花费的计算时间显着减少，使其成为需要准确和快速相位展开任务的理想选择。同时在实验室真实数据集上面进行验证实验，发现该网络依旧有优秀的性能。文中提出的网络模型使传统方法无法解决的严重噪声、不连续与混叠情况下的相位解包裹任务变成可能，同时通过与其他深度学习模型的精度对比，归一化均方根误差低至0.75%。解包裹相位技术对光学自由曲面的检测意义非常重要，不仅体现在提高测量准确性、精确控制光学参数以及优化光学设计等方面，而且对于光学制造和检测的质量保证和性能提升具有重要作用。

参考文献 (22)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于改进U-Net网络的相位解包裹技术研究

doi: 10.3788/IRLA20230564

作者简介:
徐瑞书，女，硕士生，主要从事纳米测量方面的研究

罗笑南，男，教授，博士，主要从事计算机辅助几何学方面的研究

通讯作者: 雷李华，男，高级工程师，博士，主要从事纳米测量方面的研究。

Research on phase unwrapping technology based on improved U-Net network

计量

基于改进U-Net网络的相位解包裹技术研究

doi: 10.3788/IRLA20230564

1. 上海市计量测试技术研究院，上海 201203

2. 上海在线检测与控制技术重点实验室，上海 201203

3. 桂林电子科技大学计算机与信息安全学院，广西桂林 541004

作者简介:
徐瑞书，女，硕士生，主要从事纳米测量方面的研究

罗笑南，男，教授，博士，主要从事计算机辅助几何学方面的研究

通讯作者: 雷李华，男，高级工程师，博士，主要从事纳米测量方面的研究。

English Abstract

Research on phase unwrapping technology based on improved U-Net network

1. Shanghai Institute of Measurement and Testing Technology, Shanghai 201203, China

2. Shanghai Key Laboratory of Online Testing and Control Technology, Shanghai 201203, China

3. School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, China

全文HTML

1.1. 数据集的生成

1.2. 数据集的评价

1.3. 网络模型的效果评价指标

2.1. CBiLSTM模块

2.2. 注意力机制模块

2.3. 损失函数

3.1. 模拟实验

3.2. 消融实验

3.3. 测量实验

目录

留言板

基于改进U-Net网络的相位解包裹技术研究

doi: 10.3788/IRLA20230564

作者简介: 徐瑞书，女，硕士生，主要从事纳米测量方面的研究 罗笑南，男，教授，博士，主要从事计算机辅助几何学方面的研究

通讯作者: 雷李华，男，高级工程师，博士，主要从事纳米测量方面的研究。

Research on phase unwrapping technology based on improved U-Net network

计量

出版历程

基于改进U-Net网络的相位解包裹技术研究

doi: 10.3788/IRLA20230564

1. 上海市计量测试技术研究院，上海 201203 2. 上海在线检测与控制技术重点实验室，上海 201203 3. 桂林电子科技大学 计算机与信息安全学院，广西 桂林 541004

作者简介: 徐瑞书，女，硕士生，主要从事纳米测量方面的研究 罗笑南，男，教授，博士，主要从事计算机辅助几何学方面的研究

通讯作者: 雷李华，男，高级工程师，博士，主要从事纳米测量方面的研究。

English Abstract

Research on phase unwrapping technology based on improved U-Net network

1. Shanghai Institute of Measurement and Testing Technology, Shanghai 201203, China 2. Shanghai Key Laboratory of Online Testing and Control Technology, Shanghai 201203, China 3. School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, China

全文HTML

1.1. 数据集的生成

1.2. 数据集的评价

1.3. 网络模型的效果评价指标

2.1. CBiLSTM模块

2.2. 注意力机制模块

2.3. 损失函数

3.1. 模拟实验

3.2. 消融实验

3.3. 测量实验

目录

作者简介:
徐瑞书，女，硕士生，主要从事纳米测量方面的研究

罗笑南，男，教授，博士，主要从事计算机辅助几何学方面的研究

1. 上海市计量测试技术研究院，上海 201203

2. 上海在线检测与控制技术重点实验室，上海 201203

3. 桂林电子科技大学计算机与信息安全学院，广西桂林 541004

作者简介:
徐瑞书，女，硕士生，主要从事纳米测量方面的研究

罗笑南，男，教授，博士，主要从事计算机辅助几何学方面的研究

1. Shanghai Institute of Measurement and Testing Technology, Shanghai 201203, China

2. Shanghai Key Laboratory of Online Testing and Control Technology, Shanghai 201203, China

3. School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin 541004, China