Conditional random field classification method based on hyperspectral-LiDAR fusion

Wang Leiguang; Geng Ruozheng; Dai Qinling; Wang Jun; Zheng Chen; Fu Zhitao

doi:10.3788/IRLA20210112

The interpretation of single remotely sensed data source may suffer from inaccurate boundary and low classification accuracy. The integration of hyperspectral and LiDAR data opens up the possibility to improve the classification performance. But, it is a challenge that how to appropriately integrate the considerable heterogeneity between the two types of data. In this paper, a conditional random field classification method was proposed to solve this problem by jointly taking both the heterogeneity of fused spectral-spatial-height features and co-occurrence of class labels into account. Firstly, the morphological features were extracted from two types of data respectively, and a graph model and training samples were jointly used to fuse the morphological features and spectral features. The obtained features were inputted into a support vector machine classifier to obtain the initial classification results with probabilistic outputs. Then, based on the fusion features, a local heterogeneity value was calculated to measure the essential difference of classes among pixels. Meanwhile, a class co-occurrence matrix, whose element calculated the spatial relationship between classes, was also obtained. Finally, a conditional random field framework was used to integrate the initial classification results, local heterogeneity information and the class co-occurrence matrix, and obtain the final classification results through inferencing two objective functions. In this process, by defining the weight between two neighboring pixel as a monotone decreasing function respect to the normalized Euclidean distance of the corresponding fused features, the object boundary could be regularized by giving a smaller weight to the class pairs with different labels and distinct features. Similarly, by giving a small weight to the class pairs with a strong spatial relationship, the purpose of maintaining the class pairs with stable spatial relations could be achieved. The method was validated with Houston and Gaofeng forest farm data sets. The overall accuracies of the proposed method reached to 94.00% and 92.84% respectively, and the "pepper and salt" phenomena of the initial classification results were significantly reduced. The result indicates the effectiveness of the proposed method.

HTML

0. 引　言

遥感影像分类是遥感影像处理的研究热点和难点^[1]。随着对地观测技术的不断发展，通过不同类型的传感器获取的遥感数据急剧增加。一方面，数据的增加对现有基于小样本的分类方法的效率和精度提出挑战；另一方面，使得利用不同类型、互补性强的观测数据进一步提升分类精度成为可能^[2]。

高光谱影像具有光谱分辨率高、数据量大的特点，能精细的反映地物表面的光谱反射特性；LiDAR数据能够获得地物的立体结构信息。许多研究者^[3]尝试利用两者的互补性，通过高度信息辅助光谱相似地物的识别，通过光谱信息辅助具有相同高度地物的识别，采用数据融合的方法解决单一数据源进行地物解译面临的挑战。

高光谱和LiDAR数据作为成像机理不同的异构数据，为实现高精度的场景解译，更适合采用特征级和决策级的融合^[4]。在特征提取阶段，由于高光谱影像存在数据的冗余，一般首先对其进行降维处理^[5-6]，再从降维后的光学和LiDAR数据提取特征。然而，由于简单的特征叠加融合可能造成新的信息冗余，容易引发“维数灾难”，也难以反映不同特征对分类的贡献，融合的特征并不总是比使用单一特征源表现得更好。因此，通过特征选择或变换方法对特征集进行再次降维、利用对特征维度不敏感的分类器，都已成为提升分类精度的重要手段。文献[7]将高光谱影像和LiDAR数据提取的光谱、空间和高度特征通过拉普拉斯映射算法实现特征融合。文献[8]利用复核技术分别构造面向空间和光谱类特征的核函数，采用极限学习机实现分类。决策级融合通过将不同分类设置获得的分类结果加以综合实现。典型的决策融合方法有简单多数投票法^[9]等。

此外，将初始分类结果与特征学习或区域分割算法结合，以规整对象区域边界，并优化分类的思路，也获得了一定关注^[7]。如参考文献[9]利用分割算法获得的区域对象对高光谱和LiDAR分类的结果进行后处理优化，以消除像素分类的“胡椒盐”噪声和规整地物对象边界。

条件随机场(Conditional Random Field, CRF)属于无向概率图模型。基于影像像素类别空间分布的局部平滑假设，CRF可以有效平滑初始分类的噪声，也广泛应用于分类后处理中^[9]。其技术关键在于有效保留地物的边界，防止分类结果过度平滑^[9]。一般通过在空间能量项中引入边缘、局部光谱差异等度量指标^[10-12]加以度量。然而，这类工作仍多针对多光谱或高光谱单一数据源，采用的是底层的统计特征，如何针对多源数据提出局部异质性指标，有效度量类别间的真实差异，仍有待深入研究。

上述研究表明，多源数据分类的精度提升和“胡椒盐”噪声的改善依赖于多阶段、多种处理策略的融合。有鉴于此，受上述多阶段融合思路的启发，同时为了有效表征场景中局部的异质性，保留分类优化结果中的对象边界，文中提出了一种顾及局部特征差异与全局类共生的CRF高光谱-LiDAR融合分类方法。该方法的主要特点是：多源数据提取的光谱-空间-高度融合特征同时用于初始分类和描述地物局部空间异质性，并将全局类共生参数引入目标函数，综合特征融合和优化后处理技术，实现分类精度的提升和“胡椒盐”噪声的消除。

4. 结　论

为了充分利用高光谱影像与LiDAR数据的互补性信息，达到提升分类精度和保留地类边界的目的，文中提出了一种特征融合与条件随机场模型耦合的分类方法。该方法对于Houston高光谱-LiDAR实验数据集取得了良好的分类效果。

研究结论主要有以下三点：（1）拓展形态滤波特征对于高分辨率影像中的空间信息具有良好的描述能力，结合流形学习的降维方法能获得较高的分类精度。但因其仍然属于像素级的分类方法，无法从根本上避免分类结果中的细碎噪声。（2）以融合的特征描述条件随机场中二阶势团中的邻域像素的异质性，有效避免了传统随机场模型的标记过度平滑问题；类共生代价函数的引入进一步提升了分类精度。（3）采用融合的特征及概率分类结果初始化条件随机场模型特征场能量，同时将融合特征引入标记场建模，通过特征、决策的协同融合，可以实现地物对象的高精度分类和分类噪声的改善。

下一步工作中，拟将改进该方法在林区场景中树种分类上的应用，以及深度特征与随机场模型的结合。

致谢 感谢国家重点研发项目“人工林资源监测关键技术研究(2017YFD0600900)”提供文中使用的高峰林场数据集；感谢休斯顿大学高光谱图像分析组和国家航空激光测绘中心(NCALM)提供文中使用的Houston数据集。

Reference (15)

[1]	Ghamisi P, Rasti B, Yokoya N, et al. Multisource and multitemporal data fusion in remote sensing: A comprehensive review of the state of the art [J]. IEEE Geoscience and Remote Sensing Magazine, 2019, 7(1): 6-39.
[2]	Muram D, Prasad S, Pacifict F, et al. Challenges and opportunities of multimodality and data fusion in remote sensing [J]. Proceedings of the IEEE, 2015, 103(9): 1585-1601.
[3]	Rasti B, Ghamisi P, Gloaguen R. Hyperspectral and LiDAR fusion using extinction profiles and total variation component analysis [J]. IEEE Transactions on Geoscience and Remote Sensing, 2017, 55(7): 3997-4007.
[4]	Cao Qiong, Ma Ailong, Zhong Yanfei, et al. Hyperspectral-LiDAR multi-level fusion urban land cover classification [J]. National Remote Sensing Bulletin, 2019, 23(5): 892-903. (in Chinese)
[5]	Shi Guojun. Infrared image target recognition method based on joint characterization of depth feature [J]. Infrared and Laser Engineering, 2021, 50(3): 20200399. (in Chinese)
[6]	Hou Banghuan, Yao Minli, Jia Weimin, et al. Hyperspectral image classification based on spatial structure preserving [J]. Infrared and Laser Engineering, 2017, 46(12): 1228001. (in Chinese)
[7]	Liao W, Pižurica A, Bellens R, et al. Generalized graph-based fusion of hyperspectral and lidar data using morphological features [J]. IEEE Geoscience and Remote Sensing Letters, 2015, 12(3): 552-556.
[8]	Ghamisi P, Rasti B, Benediktsson J A. Multisensor composite kernels based on extreme learning machines [J]. IEEE Geoscience and Remote Sensing Letters, 2019, 16(2): 196-200.
[9]	Huang X, Lu Q, Zhang L, et al. New postprocessing methods for remote sensing image classification: A systematic study [J]. IEEE Transactions on Geoscience and Remote Sensing, 2014, 52(11): 7140-7159.
[10]	Debes C, Merentitis A, Heremans R, et al. Hyperspectral and LiDAR data fusion: Outcome of the 2013 GRSS data fusion contest [J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2014, 7(6): 2405-2418.
[11]	Ni L, Gao L, Li S, et al. Edge-constrained Markov random field classification by integrating hyperspectral image with LiDAR data over urban areas [J]. Journal of Applied Remote Sensing, 2014, 8(1): 085089.
[12]	Wang L, Huang X, Zheng C, et al. A Markov random field integrating spectral dissimilarity and class co-occurrence dependency for remote sensing image classification optimization [J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2017, 128: 223-239.
[13]	Chang C C, Lin C J. LIBSVM: A library for support vector machines [J]. ACM Transactions on Intelligent Systems and Technology, 2011, 2(3): 27.
[14]	Feng B, Zhang C, Zhang W, et al. Analyzing the role of spatial features when cooperating hyperspectral and LiDAR data for the tree species classification in a subtropical plantation forest area [J]. Journal of Applied Remote Sensing, 2020, 14(2): 022213.
[15]	Cheng Y, Li C, Ghamisp P, et al. Deep fusion of remote sensing data for accurate classification [J]. IEEE Geoscience and Remote Sensing Letters, 2017, 14(8): 1253-1257.

(a) 休斯顿 (a) Houston			(b) 高峰林场 (b) Gaofeng forest farm
Class name	Number of training/testing samples/pixel	Sample color		Class name	Number of training/testing samples/pixel	Sample color
Healthy grass	198/1053			Eucalyptus	193/315
Stressed grass	190/1064			Road	74/106
Synthetic grass	192/505			Tilia tuan	40/52
Trees	188/1056			Cultivated land	95/127
Soil	186/1056			Acacia crassicarpa benth	208/308
Water	182/143			Wasteland	16/20
Residential	196/1072			Michelia macclurei dandy	69/95
Commercial	191/1053			Building	165/251
Road	193/1059			Other broad leaved forests	184/275
Highway	191/1036			Pinus massoniana lamb	214/300
Railway	181/1054			Cunninghamia lanceolata	34/47
Parking Lot 1	192/1041			Water	390/562
Parking Lot 2	184/285			Mixed shrub forest	53/84
Tennis court	181/247			Bamboo	21/34
Running track	187/473			Grassland	23/20

Precision	β
Precision	0.5	1	1.5	2	2.5	3	3.5	4	4.5
OA	93.99%	94.00%	93.93%	93.88%	93.89%	93.86%	93.85	93.84%	93.83%
Kappa	0.935	0.935	0.934	0.933	0.934	0.933	0.933	0.933	0.933
AA	93.47%	93.42%	93.19%	93.07%	91.30%	93.06%	93.04%	93.04%	93.03%

(a) 休斯顿数据集 (a) Houston data set
Category	Pixel level classification method				CRF classification optimization method
Category	F^Spe	F^DSM	F^Spe+F^Spa	F^Spe+F^DSM	GGF	GGF_CRF1	GGF-CRF
Healthy grass	82.34	24.88	55.69	55.89	81.67	82.43	83.1
Stressed grass	83.36	55.92	84.40	84.49	99.34	99.62	99.81
Synthetic grass	100	91.88	100	100	100	100	100
Trees	93.37	67.23	91.57	98.11	99.24	99.24	99.62
Soil	98.30	76.80	100	99.15	100	100	100
Water	91.61	80.42	99.30	96.50	95.10	95.10	94.41
Residential	76.59	71.74	82.84	91.32	92.35	92.26	93.47
Commercial	56.51	61.92	53.09	52.42	94.59	94.78	95.73
Road	66.57	51.37	79.04	83.95	86.02	85.93	85.74
Highway	72.39	53.86	68.15	79.92	93.24	93.63	94.98
Railway	92.88	83.97	97.34	87.76	90.70	90.80	90.61
Parking Lot 1	78.58	60.71	97.70	79.63	94.24	94.43	97.41
Parking Lot 2	72.98	57.19	81.05	74.04	72.28	71.93	66.67
Tennis Court	98.79	97.17	100	98.79	100	100	100
Running Track	98.31	28.96	98.52	97.67	99.37	99.37	99.79
OA	81.98%	60.48%	85.12%	85.14%	93.34%	93.47%	94.00%
AA	84.17%	64.27%	85.91%	85.31%	93.21%	93.30%	93.42%
Kappa	0.805	0.597	0.839	0.839	0.928	0.929	0.935
(b)高峰林场数据集 (b) Gaofeng forest farm data set
Category	Pixel level classification method				CRF classification optimization method
Category	F^Spe	F^DSM	F^Spe+F^Spa	F^Spe+F^DSM	GGF	GGF_CRF1	GGF-CRF
Eucalyptus	73.65	60.63	90.79	77.46	86.67	96.82	97.14
Road	48.11	52.83	90.57	66.98	74.53	73.50	73.58
Tilia tuan	5.77	46.15	59.62	53.85	25	32.69	32.69
Cultivated land	83.46	98.43	100	96.85	100	100	100
Acacia crassicarpa benth	71.75	88.31	97.08	87.66	90.91	97.73	97.73
Wasteland	80	55	95	90	95	100	100
Michelia macclurei dandy	31.58	55.79	75.79	70.53	67.37	83.16	84.24
Building	83.27	84.06	96.41	92.83	98.01	97.21	97.21
Other broad leaved forests	70.91	66.55	96.36	65.82	83.27	85.45	85.82
Pinus massoniana lamb	73.67	92.67	92.00	85.67	89.00	96.67	97.00
Cunninghamia lanceolata	12.77	68.09	95.74	65.96	78.72	95.74	95.74
Water	99.82	98.22	100	99.64	100	100	100
Mixed shrub forest	2.38	51.19	73.81	67.86	73.81	88.1	88.1
Bamboo	0	2.94	17.65	17.65	0	0	0
Grassland	10.00	40.00	55.00	85.00	95.00	100	100
OA	71.46%	78.58%	92.41%	83.32%	87.71%	92.37%	92.84%
AA	49.81%	64.06%	82.39%	74.92%	77.15%	83.14%	83.28%
Kappa	0.674	0.756	0.914	0.811	0.860	0.913	0.919

Precision	Deep fusion^[15]	HyMCKs^[8]	Multi level fusion method^[4]	EC-CRF^[11]	GGF-CRF
OA	91.32%	90.33%	93.22%	91.70%	94.00%
Kappa	0.9057	0.8949	0.930	0.907	0.935

Conditional random field classification method based on hyperspectral-LiDAR fusion

doi: 10.3788/IRLA20210112

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views