Improved data-driven compressing method for hyperspectral mineral identification models

Deng Kewang; Zhao Huijie; Li Na; Cai Hui

doi:10.3788/IRLA20210252

Volume 51 Issue 3

Apr. 2022

Turn off MathJax

Article Contents

Article Navigation > Infrared and Laser Engineering > 2022 > 51(3): 20210252

Deng Kewang, Zhao Huijie, Li Na, Cai Hui. Improved data-driven compressing method for hyperspectral mineral identification models[J]. Infrared and Laser Engineering, 2022, 51(3): 20210252. doi: 10.3788/IRLA20210252

Citation:

Deng Kewang, Zhao Huijie, Li Na, Cai Hui. Improved data-driven compressing method for hyperspectral mineral identification models[J]. Infrared and Laser Engineering, 2022, 51(3): 20210252. doi: 10.3788/IRLA20210252

Improved data-driven compressing method for hyperspectral mineral identification models

doi: 10.3788/IRLA20210252

Deng Kewang^1
,,
Zhao Huijie^{1, 2
,},
Li Na^{1
,
,},
Cai Hui³

1.
School of Instrumentation Science and Opto-Electronic Engineering, Beihang University, Beijing 100191, China
2.
Beihang University Qingdao Research Institute, Beihang University, Qingdao 266101, China
3.
Unit 96901 of the People's Liberation Army of China, Beijing 300140, China

Funds: National Key Research and Development Program of China（2016YFB0500505，2017YFC0602104）； National Natural Science Foundation of China （61975004）；Demonstration System for Remote Sensing Application of High Resolution Land Resources (Phase II)（04-Y30B01-9001-18/20）；Qingdao Entrepreneurial Innovation Leading Talent Program (18-1-2-22-zhc)

Received Date: 2021-12-10
Rev Recd Date: 2022-01-25
Publish Date: 2022-04-07

Abstract

It was difficult to extract mineral features efficiently and quickly from large quantities of hyperspectral data obtained by airborne imaging hyperspectral spectrometers. An improved data-driven compressing method for mineral identification models was proposed in this paper, which pruned redundant neurons in neural networks to obtain efficient mineral identification models. Firstly, the average percentage of zeros driven by correctly identified samples in the validation set (C-APoZ) of each neuron was calculated as a criterion of importance for the neuron, so as to explore the contribution of the neuron to the network for identifying samples correctly. Then, the redundant neurons were pruned by setting the importance threshold, and the pruned network was retrained to improve the identification accuracy while preserving the correct identification abilities of the original network. Finally, an efficient compressed model for mineral identification was obtained through multiple iterative pruning. In this paper, the improved data-driven compressing method was conducted on the mineral identification models based on multilayer perceptron (MLP) to promote their efficiency. The hyperspectral data of the Nevada mining area collected by Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) were applied to evaluate the proposed method. The results show that the proposed method obtained an efficient model for mineral identification with the compression rate of 3.33 and the identification accuracy of 94.35%.
- neural network,
- network pruning,
- hyperspectral,
- mineral identification

References

[1]	Balas C, Epitropou G, Tsapras A, et al. Hyperspectral imaging and spectral classification for pigment identification and mapping in paintings by El Greco and his workshop [J]. Multimedia Tools and Applications, 2018, 77: 9737-9751. doi: 10.1007/s11042-017-5564-2
[2]	Amer R, Mezayen A A, Hasanein M. ASTER spectral analysis for alteration minerals associated with gold mineralization [J]. Ore Geology Reviews, 2016, 75: 239-251. doi: 10.1016/j.oregeorev.2015.12.008
[3]	Hecker C, Ruitenbeek F, Werff H, et al. Spectral absorption feature analysis for finding ore: A tutorial on using the method in geological remote sensing [J]. IEEE Geoscience and Remote Sensing Magazine, 2019, 7(2): 51-71. doi: 10.1109/MGRS.2019.2899193
[4]	Okada N, Maekawa Y, Owada N, et al. Automated identification of mineral types and grain size using hyperspectral imaging and deep learning for mineral processing [J]. Minerals, 2020, 10(9): 809. doi: 10.3390/min10090809
[5]	Deng Kewang, Zhao Huijie, Li Na, et al. Identification of minerals in hyperspectral imagery based on the attenuation spectral absorption index vector using a multilayer perceptron [J]. Remote Sensing Letters, 2021, 12(5): 449-458. doi: 10.1080/2150704X.2021.1903612
[6]	Zhang Minghua, Zou Yaqing, Song Wei, et al. GGCN: GPU-based hyperspectral image classification algorithm [J]. Laser & Optoelectronics Progress, 2020, 57(20): 231-237. (in Chinese)
[7]	Denil M, Shakibi B, Dinh L, et al. Predicting parameters in deep learning [J]. Neural Information Processing Systems (NIPS), 2012: 1097-1105.
[8]	Han S, Mao H, Dally W. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding [J]. Fiber, 2015, 56(4): 3-7.
[9]	Cheng Yu, Wang Duo, Zhou Pan, et al. A survey of model compression and acceleration for deep neural networks [J]. IEEE Signal Processing Magazine, 2017, 35(1): 126-136.
[10]	Wang Yulong, Zhang Xiao, Xie Lingxi, et al. Pruning from scratch[C]// Proceedings of the AAAI Conference on Artificial Intelligence, 2020, 34(7): 12273-12280.
[11]	Hassibi B, Stork D, Wolff G. Optimal brain surgeon and general network pruning[C]//IEEE International Conference on Neural Networks, 1993: 293-299.
[12]	Cun Y, Denker J, Solla S. Optimal brain damage[J]. *Advances in Neural Information Processing Systems*, 1990, 2: 598-605.
[13]	Hassibi B. Second order derivatives for network pruning: Optimal brain surgeon [J]. Advances in Neural Information Processing Systems, 1992, 5: 164-171.
[14]	Roy S, Panda P, Srinivasan G, et al. Pruning filters while training for efficiently optimizing deep learning networks[C]//2020 International Joint Conference on Neural Networks (IJCNN). IEEE, 2020: 1-7.
[15]	Guo Yiwen, Yao Anbang, Chen Yurong. Dynamic network surgery for efficient DNNs[C]//Neural Information Processing Systems, 2016: 1387-1395.
[16]	Luo Jianhao, Wu jianxin, Lin Weiyao. ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression[C]//IEEE International Conference on Computer Vision (ICCV), 2017: 5068-5076.
[17]	Hu Hengyuan, Peng Rui, Tai Yuwing, et al. Network trimming: A data-driven neuron pruning approach towards efficient deep architectures [J]. arXiv preprint arXiv, 2016: 1607.03250.
[18]	Clark R N, Swayze G A, Livo K E, et al. Imaging spectroscopy: Earth and planetary remote sensing with the USGS Tetracorder and expert systems [J]. Journal of Geophysical Research, 2003, 108(E12): 5131.

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(7) / Tables(4)

Get Citation

PDF

XML

Article Metrics

Article views(298) PDF downloads(33) Cited by()

Proportional views

HTML

0. 引　言

随着高光谱技术的不断发展，光谱分辨率与空间分辨率的不断提高，为高光谱矿物识别提供了有效技术手段。传统的矿物识别方法主要基于矿物光谱的相似性或者光谱的诊断特征，较为依赖矿物光谱且容易受成像条件的影响^[1-3]。深度学习技术在图像分类和识别领域取得了有效应用，极大地提升了模型的鲁棒性，越来越多的学者将神经网络应用于高光谱矿物识别^[4-5]。但是，该类神经网络模型由于网络结构复杂，网络参数庞大，很难满足许多高光谱应用中对矿物的快速信息提取和高效识别的要求。尽管Zhang等^[6]将GPU技术应用于矿物识别应用中，提高了计算速度，但是，针对矿物识别模型效率提升的模型结构研究还比较缺乏。Denil^[7]等提出神经网络存在着大量的冗余参数，仅需要部分网络参数就能获得与原网络相同的效果。除此之外，Han^[8]等提出过多的网络参数会带来过拟合现象，导致识别精度的下降。为了获取高效的矿物识别模型，必须去除神经网络中的冗余单元，因此，对矿物识别模型进行模型压缩成为必要的手段。目前，神经网络模型压缩主要分为网络剪枝、参数量化、设计结构化矩阵、知识蒸馏这四类方法^[9]。其中，网络剪枝方法直接作用于神经网络中的冗余单元，在降低模型规模的同时，还能减少过拟合程度，显著提升模型效能，得到了广泛应用^[10]。

网络剪枝的核心思想是建立网络元素的重要性判别依据，对重要性低的冗余元素进行剪除，并通过再训练获得新的压缩模型，其主要包括非结构化网络剪枝和结构化网络剪枝^[11]。非结构化网络剪枝主要针对神经网络中不同网络层之间的冗余连接进行剪枝，以实现模型压缩^[12-14]。但是，该类网络剪枝方法可能会对部分重要连接进行误剪枝，并缺少相应的连接恢复过程，从而导致网络不收敛或者精度严重下降。Guo等^[15]提出动态网络外科手术方法，对误剪枝的重要连接进行恢复，在获得高压缩比的同时，几乎没有精度损失。然而，非结构化网络剪枝使得权值参数矩阵呈现稀疏化，不利于硬件的加速，并且需要专门的软件进行支持。结构化剪枝主要针对神经网络中的冗余神经元、卷积核或者通道进行剪枝，以实现网络压缩^[16-17]。结构化网络剪枝将网络单元进行整体剪除，不需要特殊的硬件或者软件支持，得到了更为广泛的应用。然而，现有的结构化剪枝方法通常将所有的验证样本作为神经网络单元重要性的数据驱动，并未考虑误识别样本可能造成的影响。

因此，针对上述问题，文中提出了一种基于改进样本驱动的高光谱矿物识别模型压缩方法，对神经网络中冗余神经元进行剪枝，以得到高效的矿物识别模型。该方法以验证数据集中正确识别样本为基础，首先计算各神经元经激活后输出零值频率，即正确识别样本驱动的激活输出零值率（Average Percen tage of Zeros driven by Correctly identified samples, C-APoZ），并将C-APoZ作为神经元重要性依据，然后根据C-APoZ值对神经网络进行迭代剪枝，实现对原网络的高效压缩。为验证方法的有效性，将提出的模型压缩方法应用于基于多层感知机的矿物识别模型，并以美国内华达州Cuprite矿区的AVIRIS高光谱数据作为测试数据。

4. 结　论

文中提出基于改进样本驱动的网络剪枝方法，针对高光谱矿物识别模型中存在的冗余神经元进行剪枝，在降低神经网络冗余性和过拟合的同时，提升了矿物识别精度，获得了高效的压缩矿物识别模型。该方法相相较于传统的基于样本驱动的网络剪枝方法，摒弃了误识别样本对神经元重要性的影响，只对原模型正确识别样本的能力进行保留。实验结果表明，利用文中提出的剪枝方法对已有高光谱矿物识别模型进行网络剪枝，获得了压缩比为3.33的高效压缩识别模型，对Cuprite矿区高光谱数据的识别精度也由93.62%提升到了94.35%，充分体现了该剪枝方法的高效性。后续需要对误识别样本对神经元重要性带来的负面影响进行分析，以获得更加全面的神经元重要性判据。

Reference (18)

[1]	Balas C, Epitropou G, Tsapras A, et al. Hyperspectral imaging and spectral classification for pigment identification and mapping in paintings by El Greco and his workshop [J]. Multimedia Tools and Applications, 2018, 77: 9737-9751.
[2]	Amer R, Mezayen A A, Hasanein M. ASTER spectral analysis for alteration minerals associated with gold mineralization [J]. Ore Geology Reviews, 2016, 75: 239-251.
[3]	Hecker C, Ruitenbeek F, Werff H, et al. Spectral absorption feature analysis for finding ore: A tutorial on using the method in geological remote sensing [J]. IEEE Geoscience and Remote Sensing Magazine, 2019, 7(2): 51-71.
[4]	Okada N, Maekawa Y, Owada N, et al. Automated identification of mineral types and grain size using hyperspectral imaging and deep learning for mineral processing [J]. Minerals, 2020, 10(9): 809.
[5]	Deng Kewang, Zhao Huijie, Li Na, et al. Identification of minerals in hyperspectral imagery based on the attenuation spectral absorption index vector using a multilayer perceptron [J]. Remote Sensing Letters, 2021, 12(5): 449-458.
[6]	Zhang Minghua, Zou Yaqing, Song Wei, et al. GGCN: GPU-based hyperspectral image classification algorithm [J]. Laser & Optoelectronics Progress, 2020, 57(20): 231-237. (in Chinese)
[7]	Denil M, Shakibi B, Dinh L, et al. Predicting parameters in deep learning [J]. Neural Information Processing Systems (NIPS), 2012: 1097-1105.
[8]	Han S, Mao H, Dally W. Deep compression: Compressing deep neural networks with pruning, trained quantization and huffman coding [J]. Fiber, 2015, 56(4): 3-7.
[9]	Cheng Yu, Wang Duo, Zhou Pan, et al. A survey of model compression and acceleration for deep neural networks [J]. IEEE Signal Processing Magazine, 2017, 35(1): 126-136.
[10]	Wang Yulong, Zhang Xiao, Xie Lingxi, et al. Pruning from scratch[C]// Proceedings of the AAAI Conference on Artificial Intelligence, 2020, 34(7): 12273-12280.
[11]	Hassibi B, Stork D, Wolff G. Optimal brain surgeon and general network pruning[C]//IEEE International Conference on Neural Networks, 1993: 293-299.
[12]	Cun Y, Denker J, Solla S. Optimal brain damage[J]. *Advances in Neural Information Processing Systems*, 1990, 2: 598-605.
[13]	Hassibi B. Second order derivatives for network pruning: Optimal brain surgeon [J]. Advances in Neural Information Processing Systems, 1992, 5: 164-171.
[14]	Roy S, Panda P, Srinivasan G, et al. Pruning filters while training for efficiently optimizing deep learning networks[C]//2020 International Joint Conference on Neural Networks (IJCNN). IEEE, 2020: 1-7.
[15]	Guo Yiwen, Yao Anbang, Chen Yurong. Dynamic network surgery for efficient DNNs[C]//Neural Information Processing Systems, 2016: 1387-1395.
[16]	Luo Jianhao, Wu jianxin, Lin Weiyao. ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression[C]//IEEE International Conference on Computer Vision (ICCV), 2017: 5068-5076.
[17]	Hu Hengyuan, Peng Rui, Tai Yuwing, et al. Network trimming: A data-driven neuron pruning approach towards efficient deep architectures [J]. arXiv preprint arXiv, 2016: 1607.03250.
[18]	Clark R N, Swayze G A, Livo K E, et al. Imaging spectroscopy: Earth and planetary remote sensing with the USGS Tetracorder and expert systems [J]. Journal of Geophysical Research, 2003, 108(E12): 5131.

Class name	Training samples	Testing samples	Diagnostic bands/nm
Muscovite	100	400	2 200, 2 350
Halloysite	100	240	2 170, 2 210
Calcite	100	240	2 160, 2 340
Kaolinite	100	400	2 170, 2 210
Montmorillonite	100	400	2 230
Alunite	100	400	2 170, 2 320
Chalcedony	100	240	2 250
Total	700	2320

Importance criteria	Sequence number of the pruned neuron	Number of pruned units	Compression rate	Identification accuracy after retraining
Proposed C-APoZ	1, 4, 6, 12, 17, 20, 23, 27	8	1.36	94.61%
APoZ	1, 4, 6, 12, 17, 20, 23	7	1.30	94.57%

Iteration	C-APoZ (Proposed method)			APoZ
Iteration	Compression rate	Threshold	Identification accuracy	Compression rate	Threshold	Identification accuracy
0	0	0.817	93.62%	0	0.814	93.62%
1	1.36	0.651	94.61%	1.30	0.643	94.57%
2	1.76	0.502	94.66%	1.76	0.597	94.40%
3	2.31	0.464	95.04%	2.14	0.469	94.00%
4	2.73	0.445	94.66%	2.50	0.426	94.00%
5	3.33	0.448	94.35%	2.73	0.444	94.22%
6	4.29	0.379	93.41%	3.00	0.435	93.84%
7	6.00	0.316	90.13%	3.33	0.412	93.32%

Improved data-driven compressing method for hyperspectral mineral identification models

doi: 10.3788/IRLA20210252

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Related

Proportional views