低秩稀疏和改进SAM的高光谱图像误标签检测

刘煊; 渠慎明

doi:10.7510/jgjs.issn.1001-3806.2022.06.016

低秩稀疏和改进SAM的高光谱图像误标签检测

刘煊¹,
渠慎明^1,2, ,

1.
河南大学软件学院, 开封 475001
2.
河南大学智能网络系统研究所, 开封 475001

作者简介: 刘煊(1994-), 男, 硕士研究生, 主要研究方向为高光谱图像处理.

通讯作者: 渠慎明, qsm@vip.henu.edu.cn

基金项目:
河南省科技发展计划资助项目 212102210538

河南大学研究生教育创新与质量提升计划资助项目 SYL20040121
中图分类号: TP751

False label detection in hyperspectral image based on low rank sparse and improved SAM

LIU Xuan¹,
QU Shenming^1,2, ,

1.
School of Software, Henan University, Kaifeng 475001, China
2.
Institute of Intelligence Networks System, Henan University, Kaifeng 475001, China

Corresponding author: QU Shenming, qsm@vip.henu.edu.cn

CLC number: TP751

摘要: 为了解决基于监督学习的高光谱图像分类算法训练样本中存在的噪声标签会降低后续的分类精度的问题, 采用了一种基于低秩稀疏表示和改进光谱角制图(SAM)的高光谱图像误标签检测算法。首先对高光谱图像中信号子空间进行预测, 根据预测到的子空间对原始高光谱图像重构并去噪; 然后通过基于归一化的光谱角制图算法来获取每一类样本间的距离信息, 得到每类样本间的光谱相似度, 并利用密度峰值聚类算法得到每个训练样本的局部密度; 最后采用基于局部密度的决策函数对噪声标签进行检测, 使用支持向量机在两个真实数据集上验证。结果表明, 该算法比先进的层次结构的高光谱图像误标签检测算法提高了1.91%的总体精度。这一结果对高光谱图像分类是有帮助的。
- 图像处理 /
- 低秩稀疏表示 /
- 归一化光谱角制图 /
- 密度峰值聚类算法 /
- 噪声标签检测
Abstract: In order to solve the problem that reduction of the subsequent classification accuracy in the hyperspectral image classification algorithm based on supervised learning due to the presence of noise labels in the training samples, a false label detection algorithm based on low rank sparse representation and improved spectral angle mapping (SAM) was adopted. Firstly, the signal subspace of hyperspectral image was predicted, and the original hyperspectral image was reconstructed and denoised according to the predicted subspace. Next, the normalized spectral angle mapping algorithm was used to obtain the distance information between each class of samples, and the spectral similarity between each class of samples was obtained. Then, the density peak clustering algorithm was used to get the local density of each training sample. Support vector machine was used to verify the results on two real datasets. The experimental results show that the overall accuracy is improved by 1.91% compared with the advanced hierarchical structure of hyperspectral image false label detection algorithm. This result is helpful for hyperspectral image classification.
- image processing /
- low rank sparse representation /
- normalized spectral angle mapping /
- density peak clustering algorithm /
- noise label detection

Figure 1. Flow chart of LRS-NSAMDP algorithm

下载: 全尺寸图片幻灯片

Figure 2. KSC dataset

a—false color image b—ground object truth map c—name of each species

下载: 全尺寸图片幻灯片

Figure 3. PaviaU dataset

a—false color image b—ground object truth map c—name of each species

下载: 全尺寸图片幻灯片

Figure 4. On KSC dataset, the parameter θ and λ coefficient of different local densities λ impact on OA

下载: 全尺寸图片幻灯片

Figure 5. On PaviaU dataset, the parameter θ and λ coefficient of different local densities λ impact on OA

下载: 全尺寸图片幻灯片

Figure 6. Feature classification map (25T+5U) obtained by different algorithms in KSC dataset

a—SVM, OA: 85.20% b—DP, OA: 87.04% c—K-SDP, OA: 86.41% d—KECA, OA: 86.72% e—HCEM, OA: 87.90% f—LRS-NSAMDP, OA: 88.51%

下载: 全尺寸图片幻灯片

Figure 7. Feature classification map (50T+10U) obtained by different algorithms in PaviaU dataset

a—SVM, OA: 75.63% b—DP, OA: 79.01% c—K-SDP, OA: 80.44% d—KECA, OA: 81.43% e—HCEM, OA: 82.72% f—LRS-NSAMDP, OA: 83.28%

下载: 全尺寸图片幻灯片

a—KSC b—PaviaU

Figure 8. OA obtained by using different false label detection algorithms in different training sets

下载: 全尺寸图片幻灯片

Table 1. Umber of false labels in each class detected by different detection algorithms under different uncertain samples

		number of true labels and uncertain labels in each training sample
		25T+3U	25T+5U	25T+7U	25T+9U	25+11U
		number of false labels detected in each class
different false label detection algorithms	DP^[14]	1.3	0.6	2	2	1
	K-SDP^[15]	1	1	1	1	1
	LRS-NSAMDP	0	0.6	0.3	1.3	0.6

下载: 导出CSV

Table 2. Classification performance of KSC dataset under the false labeles detected by different distance measurement algorithms

classification accuracy	ED^[23]	SID^[24]	CC^[25]	SAM^[22]	LRS-NSAMDP
OA/%	88.70	87.63	89.17	89.46	89.74
AA/%	83.95	83.05	85.21	84.65	85.38
kappa	0.8739	0.8620	0.8790	0.8824	0.8854

下载: 导出CSV

Table 3. Classification accuracy under different false label algorithms on KSC dataset

class	number of true samples anduncertain samples
	25T+5U						25T+15U
	SVM	DP	K-SDP	KECA	HCEM	LRS-NSAMDP	SVM	DP	K-SDP	KECA	HCEM	LRS-NSAMDP
scrub	93.53	95.30	95.04	93.29	92.87	96.44	96.11	95.73	95.30	93.38	87.79	96.83
will-S	79.86	83.77	81.30	85.60	89.90	88.49	77.02	74.53	76.62	79.93	86.39	88.27
cabb-H	91.84	84.55	83.43	82.76	83.67	78.18	86.19	84.34	85.04	81.06	84.16	70.89
cabb-O	58.72	64.64	64.31	62.60	61.99	65.89	56.02	57.00	56.64	60.02	54.48	57.89
slash-P	51.88	61.92	62.86	62.45	63.28	58.04	57.14	57.82	57.34	56.72	75.00	51.57
broad	51.72	57.08	59.27	55.13	54.59	75.09	36.10	48.74	53.65	57.81	51.60	67.20
hardwood	56.83	70.00	64.99	66.96	65.14	84.38	53.19	65.48	66.03	60.36	64.65	76.47
graminoid	82.58	78.09	77.49	80.34	78.24	89.57	52.58	69.17	73.83	80.82	89.49	77.63
spartina	89.27	84.96	86.89	89.07	92.67	92.52	88.55	82.63	87.10	86.75	87.52	88.94
cattail	80.85	92.55	88.93	94.48	100.00	90.96	83.21	87.27	88.86	94.13	91.72	89.93
salt	86.72	87.33	91.52	92.60	94.29	94.89	97.78	90.71	90.95	90.83	90.59	95.49
muld	83.62	90.24	94.40	93.48	84.92	91.45	82.76	87.10	85.11	92.81	84.88	93.05
water	99.18	96.62	99.50	99.60	100.00	98.79	100.00	98.65	99.0	98.07	96.33	98.87
OA/%	83.57	85.47	86.19	86.67	87.52	89.43	79.90	82.60	84.97	85.19	85.23	86.49
AA/%	77.43	80.54	80.76	81.42	81.66	84.98	74.36	76.86	79.35	79.44	80.35	81.00
kappa	0.8171	0.8381	0.8460	0.8514	0.8607	0.8821	0.7766	0.8062	0.8325	0.8349	0.8345	0.8493

下载: 导出CSV

Table 4. Classification accuracy of PaviaU dataset with different false label algorithms

class	number of true samples anduncertain samples
	50T+10U						50T+20U
	SVM	DP	K-SDP	KECA	HCEM	LRS-NSAMDP	SVM	DP	K-SDP	KECA	HCEM	LRS-NSAMDP
asphalt	89.43	95.72	97.09	93.83	80.57	96.25	88.69	89.63	92.52	95.94	95.82	96.51
meadows	93.05	93.80	94.03	94.66	95.72	97.43	92.46	92.64	93.69	92.78	93.71	92.91
gravel	57.31	58.11	60.86	60.75	57.14	61.38	56.47	56.90	57.10	65.44	58.16	67.37
trees	71.22	75.79	80.51	73.04	88.67	86.84	67.07	70.52	72.58	81.67	78.59	89.60
M-sheets	85.51	94.23	95.41	86.01	99.10	98.35	84.50	88.67	89.40	81.22	86.05	81.48
B-soil	57.34	56.00	59.96	66.78	44.17	47.28	48.12	49.53	56.25	50.86	60.03	60.64
bitumen	48.44	43.99	52.73	51.71	56.16	58.94	46.09	40.77	48.89	67.86	70.43	68.52
self-Bricks	77.10	74.95	80.00	75.89	81.17	83.28	71.47	73.33	76.47	59.66	56.37	66.26
shadows	83.26	74.66	95.89	88.34	83.01	93.82	87.30	77.30	92.22	70.53	69.26	70.19
OA/%	76.73	78.91	79.55	81.11	82.25	83.49	72.46	75.27	76.32	76.53	78.12	81.42
AA/%	73.84	73.93	79.50	76.77	76.19	80.40	71.96	73.70	75.93	74.00	74.27	77.05
kappa	0.7135	0.7187	0.7542	0.7577	0.7601	0.7783	0.6649	0.6772	0.6973	0.6982	0.7158	0.7551

下载: 导出CSV

Table 5. Detection performance of false labels for the proposed method on two datasets

dataset		KSC			PaviaU
total uncertain samples		6×13	9×13	12×13	10×9	15×9	20×9
undetected false samples	DP	7.4	6.8	5.7	21.7	21.1	20.7
	K-SDP	6.1	5.4	4.8	20.4	18.2	17.8
	LRS-NSAMDP	5.6	4.4	3.8	17.6	15.8	14.1

下载: 导出CSV

[1]	ZHANG L, WEI W, ZHANG Y N, et al. Cluster sparsity field: An internal hyperspectral imagery prior for reconstruction[J]. International Journal of Computer Vision, 2018, 126(8): 797-821. doi: 10.1007/s11263-018-1080-8
[2]	LI Sh T, HAO Q N, GAO G H, et al. The effect of ground truth on performance evaluation of hyperspectral image classification[J]. IEEE Transactions on Geoence & Remote Sensing, 2018, 56(12): 7195-7206.
[3]	PARK B, LU R F. Hyperspectral imaging technology in food and agriculture[M]. New York, USA: Springer, 2015: 305-331.
[4]	RUITENBEEK F, DEBBA P, MEER F D, et al. Mapping white micas and their absorption wavelengths using hyperspectral band ratios[J]. Remote Sensing of Environment, 2006, 102(3/4): 211-222.
[5]	HU Y F, ZHANG Q L, ZHANG Y Z, et al. A deep convolution neural network method for land cover mapping: A case study of Qinhuangdao, China[J]. Remote Sensing, 2018, 10(12): 2053. doi: 10.3390/rs10122053
[6]	GUAN Sh H, YANG G, LI H, et al. Hyperspectral image classification based on 3-D convolutional recurrent neural network[J]. Laser Technology, 2020, 44(4): 485-491(in Chinese).
[7]	ZHANG X G, GAO Z Y, JIAO L C, et al. Multifeature hyperspectral image classification with local and nonlocal spatial information via markov random field in semantic space[J]. IEEE Transactions on Geoence & Remote Sensing, 2018, 56(3): 1409-1424.
[8]	FANG L Y, HE N J, LI Sh T, et al. A new spatial-spectral feature extraction method for hyperspectral images using local covariance matrix representation[J]. IEEE Transactions on Geoence & Remote Sensing, 2018, 56(6): 3534-3546.
[9]	LU Zh W, FU Zh Y, XIANG T, et al. Learning from weak and noisy labels for semantic segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2016, 39(3): 486-500.
[10]	FOODY G M. The effect of mis-labeled training data on the accuracy of supervised image classification by SVM[C]// 2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS). New York, USA: IEEE, 2015: 4987-4990.
[11]	KANG X D, DUAN P H, XIANG X L, et al. Detection and correction of mislabeled training samples for hyperspectral image classification[J]. IEEE Transactions on Geoence & Remote Sensing, 2018, 56(10): 5673-5686.
[12]	TU B, ZHOU Ch L, KUANG W L, et al. Hyperspectral imagery noisy label detection by spectral angle local outlier factor[J]. IEEE Geoscience and Remote Sensing Letters, 2018, 15(9): 1417-1421. doi: 10.1109/LGRS.2018.2842792
[13]	ALEX R, LAIO A. Clustering by fast search and find of density peaks[J]. Science, 2014, 344(6191): 1492-1496. doi: 10.1126/science.1242072
[14]	TU B, ZHANG X F, KANG X D, et al. Density peak-based noisy label detection for hyperspectral image classification[J]. IEEE Transactions on Geoence & Remote Sensing, 2019, 57(3): 1573-1584.
[15]	TU B, ZHANG X F, KANG X D, et al. Spatial density peak clustering for hyperspectral image classification with noisy labels[J]. IEEE Transactions on Geoscience and Remote Sensing, 2019, 57(7): 5085-5097. doi: 10.1109/TGRS.2019.2896471
[16]	TU B, ZHANG Ch L, PENG J, et al. Kernel entropy component analysis-based robust hyperspectral image supervised classification[J]. Remote Sensing, 2019, 11(23): 2823. doi: 10.3390/rs11232823
[17]	ZOU Zh X, SHI Zh W. Quadratic constrained energy minimization for hyperspectral target detection[C]//2015 IEEE International Geoscience and Remote Sensing Symposium (IGARSS). New York, USA: IEEE, 2015: 4979-4982.
[18]	ZHANG Y F, XIE B B, SUN J, et al. A hybrid sparsity and constrained energy minimization detector for hyperspectral images[C]//2017 IEEE International Geoscience and Remote Sensing Symposium (IGARSS). New York, USA: IEEE, 2017: 1137-1140.
[19]	TU B, ZHOU Ch L, LIAO X L, et al. Hierarchical structure-based noisy labels detection for hyperspectral image classification[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2020, 13: 2183-2199. doi: 10.1109/JSTARS.2020.2994162
[20]	KIM S J, KOH K, LUSTIG M, et al. An interior-point method for large-scale & 1-regularized least squares[J]. IEEE Journal of Selected Topics in Signal Processing, 2007, 1(4): 606-617. doi: 10.1109/JSTSP.2007.910971
[21]	BIOUCAS-DIAS J M, NASCIMENTO J M. Hyperspectral subspace identification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2008, 46(8): 2435-2445. doi: 10.1109/TGRS.2008.918089
[22]	KRUSE F A, LEFKOFF A B, BOARDMAN J W, et al. The spectral image processing system (SIPS): Software for integrated analysis of AVIRIS data[J]. Remote Sensing of Environment, 1993, 44(2/3): 145-163.
[23]	CUI M S, PRASAD S. Class-dependent sparse representation classifier for robust hyperspectral image classification[J]. IEEE Transactions on Geoscience & Remote Sensing, 2015, 53(5): 2683-2695.
[24]	SU H J, SHENG Y H. Orthogonal projection divergence-based hyperspectral band selection[J]. Spectroscopy and Spectral Analysis, 2011, 31(5): 1309-1313(in Chinese). doi: 10.3964/j.issn.1000-0593(2011)05-1309-05
[25]	TU B, YANG X C, LI N Y, et al. Hyperspectral image classification via superpixel correlation coefficient representation[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2018, 11(11): 4113-4127. doi: 10.1109/JSTARS.2018.2866901
[26]	MELGANI F, BRUZZONE L. Classification of hyperspectral remote sensing images with support vector machines[J]. IEEE Transactions on Geoscience and Remote Sensing, 2004, 42(8): 1778-1790. doi: 10.1109/TGRS.2004.831865

[1]	田猛 , 高向东 , 谢岳轩 , 张艳喜 . 焊接缺陷磁光成像噪声特征分析及处理算法. 激光技术, 2023, 47(5): 646-652. doi: 10.7510/jgjs.issn.1001-3806.2023.05.011
[2]	李文龙 , 戈海龙 , 任远 , 成巍 . 图像处理技术在激光熔池温度检测的应用. 激光技术, 2018, 42(5): 599-604. doi: 10.7510/jgjs.issn.1001-3806.2018.05.004
[3]	向志聪 , 张程潇 , 白玉磊 , 赖文敬 , 王钦若 , 周延周 . 一种高分辨率3维图像的自适应降噪算法. 激光技术, 2015, 39(5): 697-701. doi: 10.7510/jgjs.issn.1001-3806.2015.05.024
[4]	乔建平 , 邓联文 , 贺君 , 廖聪维 . 一种基于混沌映射的快速图像加密算法优化. 激光技术, 2017, 41(6): 897-903. doi: 10.7510/jgjs.issn.1001-3806.2017.06.026
[5]	刘逸飞 , 苏亚 , 姚晓天 , 崔省伟 , 杨丽君 , 周聪聪 , 何松 . OCT无创血糖检测图像处理最优化方法研究. 激光技术, 2023, 47(2): 178-184. doi: 10.7510/jgjs.issn.1001-3806.2023.02.004
[6]	陈勤霞 , 武文成 , 艾斯卡尔·艾木都拉 . 一种基于多尺度点状目标建模的检测算法. 激光技术, 2020, 44(4): 520-524. doi: 10.7510/jgjs.issn.1001-3806.2020.04.021
[7]	蒋洁 , 焦斌亮 . 由数据统计特性检测噪声的中值滤波算法. 激光技术, 2011, 35(1): 29-30. doi: 10.3969/j.issn.1001-3806.2011.01.009
[8]	周永康 , 朱尤攀 , 曾邦泽 , 胡健钏 , 欧阳慧明 , 李泽民 . 宽动态红外图像增强算法综述. 激光技术, 2018, 42(5): 718-726. doi: 10.7510/jgjs.issn.1001-3806.2018.05.025
[9]	马飞 , 王梓璇 , 刘思雨 . 基于深度图像先验的高光谱图像去噪方法. 激光技术, 2024, 48(3): 379-386. doi: 10.7510/jgjs.issn.1001-3806.2024.03.013
[10]	孙越娇 , 雷武虎 , 胡以华 , 赵楠翔 , 任晓东 . 基于视觉显著模型的遥感图像舰船快速检测. 激光技术, 2018, 42(3): 379-384. doi: 10.7510/jgjs.issn.1001-3806.2018.03.017
[11]	李昌海 , 叶玉堂 , 沈淦松 , 徐伟 , 叶涵 , 姚景昭 . 基于图像轮廓分析的LCD线路缺陷检测. 激光技术, 2013, 37(2): 207-210. doi: 10.7510/jgjs.issn.1001-3806.2013.02.017
[12]	张凡 . 红外图像改进非局部均值滤波算法研究. 激光技术, 2015, 39(5): 662-665. doi: 10.7510/jgjs.issn.1001-3806.2015.05.016
[13]	李庆辉 , 李艾华 , 姜柯 , 赵少宁 . HIS空间的火灾图像模糊增强快速算法. 激光技术, 2014, 38(1): 137-140. doi: 10.7510/jgjs.issn.1001-3806.2014.01.030
[14]	张健 , 李白燕 . 基于图论最小割集算法的图像分割研究. 激光技术, 2014, 38(6): 863-866. doi: 10.7510/jgjs.issn.1001-3806.2014.06.030
[15]	陶昕辰 , 朱涛 , 黄玉玲 , 高恬曼 , 何博 , 吴迪 . 基于DDR GAN的低质量图像增强算法. 激光技术, 2023, 47(3): 322-328. doi: 10.7510/jgjs.issn.1001-3806.2023.03.006
[16]	纪文 , 孙水发 , 王帅 , 董方敏 . 对数域的光学相干层析图像噪声模型分析. 激光技术, 2014, 38(6): 848-853. doi: 10.7510/jgjs.issn.1001-3806.2014.06.027
[17]	孙颖馨 . 基于FPGA红外成像光谱数据处理系统研究. 激光技术, 2019, 43(6): 763-767. doi: 10.7510/jgjs.issn.1001-3806.2019.06.006
[18]	江天 , 沈会良 , 杨冬晓 , 刘建军 , 邹哲 . 基于模糊局部信息C均值的太赫兹图像目标检测. 激光技术, 2015, 39(3): 289-294. doi: 10.7510/jgjs.issn.1001-3806.2015.03.001
[19]	李泽峰 , 欧阳八生 . 基于MFC+HALCON图像识别Mark圆的检测方法. 激光技术, 2020, 44(3): 358-363. doi: 10.7510/jgjs.issn.1001-3806.2020.03.016
[20]	刘宣呈 , 陈根余 , 操坤 , 曹明月 , 梅枫 . 成形砂轮激光修整的多轮廓图像合成检测方法. 激光技术, 2024, 48(3): 395-404. doi: 10.7510/jgjs.issn.1001-3806.2024.03.015

点击查看大图

图(8) / 表(5)

计量

文章访问数: 4864
HTML全文浏览量: 3473
PDF下载量: 12
被引次数: 0

全文HTML

引言

高光谱图像是由成像光谱仪接收的数十上百个波段所反射回来的地物的光谱特性组成。高光谱图像由两个空间维和一个光谱维构成，光谱维中的光谱向量代表了高光谱图像中相应像素独特的光谱特征。由于光谱特征在特征识别方面的优势, 目前高光谱图像处理技术已经被广泛应用到各种场景中^[1-2]，例如精准农业^[3]、海洋监测^[4]以及城乡规划^[5]等。在这些应用场景中，高光谱图像分类起到了重要作用。近年来，一些空谱联合分类算法被用来提升分类精度^[6-8]。这些方法用于学习训练样本标签是可行的，然而在实际应用中并非如此。

有监督的高光谱图像分类算法要求样本是标记完成的，但是手动标记过程非常困难，仅凭视觉解释的训练样本并不可靠。具体来说，引入误标签的原因有如下几点：(1)全球定位系统会对目标对象的空间位置产生不准确的估计，导致很难确定高光谱像素的精确位置; (2)对于一些场景，比如海洋和湿地，这样的场景人类无法到达，在这种情况下，基于人类视觉解读的训练样本标签不可避免会产生噪声; (3)当标记一个包含许多不规则形状土地覆盖物的场景时，人工贴标签的过程中会产生错误。

为了解决训练样本的误标签问题，对计算机视觉领域进行了深入的研究。LU等人^[9]提出一种基于曼哈顿距离优化的学习模型来检测弱噪声标签。FOODY等人^[10]发现, 噪声标签会影响基于支持向量机的机载制图分类。虽然许多研究已经解决了计算机视觉领域的噪声标签问题，但由于高光谱图像的高维和非线性结构，这些方法不能直接扩展到高光谱图像误标签分类中。最近几年，关于带有噪声标签的高光谱图像分类算法得到了关注。KANG等人^[11]首次提出了基于光谱检测和边缘保持滤波的噪声标签检测和校正方法。TU等人^[12]通过融合光谱角度和局部离群值因子来检测高光谱图像中的噪声标签，实验结果表明，该算法能有效地检测出有噪声的标签。密度峰值(density peak, DP)聚类算法作为一种鲁棒的聚类算法首次在科学杂志上被提出^[13]。TU等人^[14]首次利用DP聚类算法来检测高光谱图像训练样本中的误标签，基于DP聚类的高光谱图像误标签检测算法在检测过程中没有考虑相邻光谱像素之间的空间相关性。为了解决这一问题，TU等人^[15]提出一种新的基于空间DP聚类(k-spatial density peak, K-SDP)的噪声标签检测算法，该算法通过加入中心样本的邻域样本来进一步检测中心样本的异常程度。然而，参考文献[14]和参考文献[15]中没有考虑原始高光谱图像中存在稀疏噪声的问题。参考文献[16]中提出一种基于核熵分量分析(kernel entropy component analysis, KECA)的噪声标签检测方法，但是，该算法在检测过程中没有考虑到训练样本的上下文信息。多种基于约束能量最小化(constrained energy minimum, CEM)算法已被广泛应用于高光谱图像处理中。ZOU等人^[17]提出一种用于高光谱图像目标检测的二次约束能量最小化检测器。此外，ZHANG等人^[18]提出一种混合稀疏性和CEM的检测器，以提高目标检测的性能。CEM也有效地应用到了高光谱图像误标签检测上。TU等人^[19]提出了一种层次约束能量最小值(hierarchical constrained energy minimum, HCEM)方法来检测经过监督任务训练的原始训练集的错误标记样本，该方法可以准确地去除原始训练集的噪声标签，有效地提高监督分类任务的性能。但是，该算法的一个缺点是使用原始的光谱角制图算法(spectral angle mapping, SAM)来衡量光谱向量的相似度。原始的SAM是一种全局性的描述指标，当部分波段属性值有变化、或全部波段属性值具有不同的变化值时, 往往导致光谱角余弦的失真。

为了解决参考文献[14]~参考文献[16]和参考文献[19]中所出现的问题，本文作者提出基于低秩稀疏和改进光谱角制图的密度峰值聚类算法(low rank sparse-normalized spectral angular mapping density peak clustering, LRS-NSAMDP)。相比于DP聚类算法^[14]和K-SDP^[15]算法，本算法的改进是去除原始高光谱图像中的稀疏噪声，提取高光谱图像中的低秩成分，降低每一类样本中的加权平均局部密度，从而减少了光谱向量中的误标签数目，提高了分类精度。相比于基于层次约束能量最小值的高光谱图像误标签分类算法^[19]，本算法对原始的SAM算法进行改进，将光谱向量在波段上的属性值除以该光谱向量的模进行归一化，相比于SAM算法降低了同类像元之间的光谱角，使同类像元更加接近，从而更容易检测出训练样本中的像元之间差异较大的误标签。通过以上两个改进，相比于其它先进的遥感图像误标签分类算法，提升了总体精度(overall accuracy, OA)、平均精度(average accuracy, AA)和kappa系数。

1. 相关技术

1.1. 信号子空间估计

一幅原始高光谱图像Y ≡[y₁, y₂, …, y_Q]，Q代表每一波段的像素数。由于高光谱图像相邻波段之间的高相关性，根据线性回归理论和最小二乘法理论^[20]，假设z_i为传感器在第i波段读取的相关系数向量，所以有:

$ \boldsymbol{z}_i=\boldsymbol{Z}_{\partial_i} \boldsymbol{\beta}_i+\boldsymbol{\xi}_i $

(1)

$ \hat{\boldsymbol{\beta}}_i=\left(\boldsymbol{R}_{\partial_i, \partial_i}{ }^{\prime}-\boldsymbol{R}_{\partial_i, i}{ }^{\prime} \boldsymbol{R}_{i, \partial_i}{ }^{\prime} / \boldsymbol{R}_{i, i}{ }^{\prime}\right) \hat{\boldsymbol{R}}_{\partial_i, i} $

(2)

式中，Z_{∂_i}表示去除第i波段后的相关系数矩阵，β_i表示第i波段的回归向量，ξ_i代表稀疏噪声向量, $ \hat{\boldsymbol{R}}$=(YY^T)为自相关矩阵，R′为自相关矩阵$ \hat{\boldsymbol{R}}$的逆，R_{∂_i, ∂_i}′表示删除第i行和第i列的自相关矩阵的逆；上标^{^}表示估计值。

接下来根据预测到的稀疏噪声向量$ \hat{\boldsymbol{\xi}}$_i来估计信号子空间。计算原始高光谱图像Y自相关矩阵$ \hat{\boldsymbol{R}}$_Y，同理计算稀疏噪声向量ξ_i的自相关矩阵$ \hat{\boldsymbol{R}}$_ξ，计算真实信号原始光谱向量x的自相关矩阵$ \hat{\boldsymbol{R}}$_x, 并计算其特征向量E，将高光谱图像所在的空间分解为k维子空间E_k和E_γ，E_γ表示子空间E_k的正交补空间。设U_k为子空间E_k的投影矩阵，U_γ为子空间E_γ的投影矩阵， $\hat{\boldsymbol{x}} $_k≡U_ky, 为观测到的光谱向量y在子空间E_k上的投影，在此将$\hat{\boldsymbol{x}} $ _k和原始光谱向量x之间的最小均方误差作为估计子空间的准则^[21]，可得：

$ K=(\hat{k}, \hat{\pi})=\underset{k, \pi}{\operatorname{argmin}}\left\{\operatorname{tr}\left(\boldsymbol{U}_{\boldsymbol{R}} \hat{\boldsymbol{R}}_{\boldsymbol{Y}}\right)+2 \operatorname{tr}\left(\boldsymbol{U}_k \hat{\boldsymbol{R}}_{\boldsymbol{\xi}}\right)\right\} $

(3)

式中，K为设置项，$\widehat{k}$为δ=tr(U_R $ \hat{\boldsymbol{R}}$_Y)+2tr(U_k $ \hat{\boldsymbol{R}}$_ξ)取负值的个数，tr表示矩阵的迹, $ \hat{\pi}$为置换项, π为波段1~i的矩阵排列。将δ取负值时所对应的特征向量E中的子集作为预测到的子空间$\hat{\boldsymbol{S}} $，具体来说，子空间$\hat{\boldsymbol{S}} $可以根据$\widehat{k}$和$ \hat{\pi}$所对应的特征向量进行检索得到^[21]。

1.2. 光谱角制图

SAM是KRUSE等人在1993年提出的^[22]，把图像中的每一个像元的光谱视为一个高维向量，通过计算两向量之间的夹角来度量光谱间的相似性，夹角越小，两光谱越相似，属于同类地物的可能性越大，因而可根据光谱角的大小来辨别未知数据的类别。分类时，通过计算未知数据与已知数据间的光谱角，并把未知数据的类别归为最小光谱角对应的类别中, 如下式所示：

$ \cos \alpha=\frac{A \cdot B}{|A||B|}=\frac{\sum\limits_{i=1}^L A_i B_i}{\sqrt{\sum\limits_{i=1}^N A_i A_i} \sqrt{\sum\limits_{i=1}^N B_i B_i}} $

(4)

式中, L为波段数，A和B分别表示两个光谱向量在L个波段上的属性值，α为光谱角。夹角越小, 余弦值较大；相反夹角大, 相应的余弦值就较小。

4. 结论

针对传统的基于有监督的误标签检测算法检测到的误标签过多而导致后续分类精度下降的问题，提出了一种基于低秩稀疏表示和改进光谱角制图的高光谱图像误标签分类算法。提取原始高光谱图像的低秩成分，使用基于归一化的光谱角制图算法计算光谱相似度。所提出的算法相比于其它误标签检测算法，去除了原始高光谱图像中的混合噪声，降低了阈值λρ_j的大小，从而使误标签数目变小。实验结果表明，与其它先进的误标签检测算法相比，本算法提高了分类精度。

参考文献 (26)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

低秩稀疏和改进SAM的高光谱图像误标签检测

作者简介: 刘煊(1994-), 男, 硕士研究生, 主要研究方向为高光谱图像处理.

通讯作者: 渠慎明, qsm@vip.henu.edu.cn

False label detection in hyperspectral image based on low rank sparse and improved SAM

Corresponding author: QU Shenming, qsm@vip.henu.edu.cn

计量

低秩稀疏和改进SAM的高光谱图像误标签检测

通讯作者: 渠慎明, qsm@vip.henu.edu.cn

作者简介: 刘煊(1994-), 男, 硕士研究生, 主要研究方向为高光谱图像处理

English Abstract