基于双通道深度密集特征融合的遥感影像分类

张艳月; 张宝华; 赵云飞; 吕晓琪; 谷宇; 李建军

doi:10.7510/jgjs.issn.1001-3806.2021.01.013

基于双通道深度密集特征融合的遥感影像分类

1.
内蒙古科技大学信息工程学院，包头 014010
2.
内蒙古科技大学内蒙古自治区模式识别与智能图像处理重点实验室，包头 014010
3.
内蒙古工业大学信息工程学院，呼和浩特 010051

作者简介: 张艳月(1994-)，女，硕士研究生，现主要从事遥感图像处理的研究.

通讯作者: 张宝华, zbh_wj2004@imust.cn ;

基金项目:
国家自然科学基金资助项目 61841204

内蒙古自治区杰青培育资助项目 2018JQ02

国家自然科学基金资助项目 61962046

内蒙古自治区高等学校科学技术研究资助项目 NJZY145

内蒙古自治区2019年研究生科研创新资助项目 S20191187Z

内蒙古自治区自然科学基金资助项目 2015MS0604

国家自然科学基金资助项目 61663036
中图分类号: TP753

Remote sensing image classification based on dual-channel deep dense feature fusion

1.
.School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou 014010, China
2.
Inner Mongolia Key Laboratory of Pattern Recognition and Intelligent Image Processing, Inner Mongolia University of Science and Technology, Baotou 014010, China
3.
School of Information Engineering, Inner Mongolia University of Technology, Hohhot 010051, China

Corresponding author: ZHANG Baohua, zbh_wj2004@imust.cn ;

CLC number: TP753

摘要: 为了提高遥感图像场景分类中特征有效利用率，进而提高遥感影像分类精度，采用基于双通道深度密集特征融合的遥感影像分类方法，进行了理论分析和实验验证。首先通过构建复合密集网络模型, 分别提取图像卷积层特征和全连接层特征；然后为挖掘、利用图像深层信息，通过视觉词袋模型将提取的深层卷积层特征进行重组编码，捕获图像深层局部特征；最后采用线性加权方式将局部和全局特征融合、分类。结果表明，选用数据集UC Merced Land-Use和NWPU-RESISC45进行实验，取得的分类精度分别为93.81%和92.62%。该方法充分利用局部特征和全局特征的互补性，能实现图像深层信息的充分利用和表达。
- 图像处理 /
- 遥感图像分类 /
- 特征融合 /
- 密集网络 /
- 视觉词袋模型
Abstract: In order to improve the effective utilization of features in remote sensing image scene classification and achieve the purpose of improving the accuracy of remote sensing image classification, a remote sensing image classification method based on dual-channel depth-dense feature fusion was used for theoretical analysis and experimental verification. First, the image convolution layer features and fully connected layer features was separated extracted by constructing a composite dense convolutional network model. In order to exploit the deep information of the image, the deep convolutional layer features extracted by the model were recombined and encoded by the bag of visual words to capture the deep local features of the image. Finally the linear and weighted methods were used to fuse local and global features and then classify them. The results show that using the datasets UC Merced Land-Use and NWPU-RESISC45 for experiments, the classification accuracy obtained is 93.81% and 92.62%, respectively. This method makes full use of the complementarity of local features and global features to achieve the full expression of deep image information.
- image processing /
- classification of remote sensing images /
- feature fusion /
- dense convolutional network /
- bag of visual words

Figure 1. Dense block structure

下载: 全尺寸图片幻灯片

Figure 2. Feature reorganization

下载: 全尺寸图片幻灯片

Figure 3. Algorithm structure

下载: 全尺寸图片幻灯片

Figure 4. UC Merced Land-Use dataset image sample display

下载: 全尺寸图片幻灯片

Figure 5. NWPU dataset image sample display

下载: 全尺寸图片幻灯片

Figure 6. The confusion matrix obtained by our algorithm on the UC Merced Land-Use dataset

下载: 全尺寸图片幻灯片

Figure 7. Scenes classification accuracy of three comparison methods on UC Merced Land-Use

下载: 全尺寸图片幻灯片

Table 1. Extended DenseNet-40 structure

layers	input size/pixel	kernel size	number of convolution kernels	pool	stride	output size/pixel
input
convolution	256×256	3×3	72	—	—	256×256
max pooling	256×256	1×1	—	2×2	2	128×128
dense block 1	128×128	3×3	360	—	—	128×128
transition layer 1	128×128	1×1	360	2×2	2	64×64
dense block 2	64×64	3×3	648	—	—	64×64
transition layer 2	64×64	1×1	648	2×2	2	32×32
dense block 3	32×32	3×3	936	—	—	32×32
transition layer 3	32×32	1×1	936	2×2	2	16×16
dense block 4	16×16	3×3	1224			16×16
transition layer 4	16×16	1×1	1224	2×2	2	8×8
dense block 5	8×8	3×3	1512	—	—	8×8
classification layer	1×1		1			1×1

下载: 导出CSV

Table 2. Impact of two different scale input images on classification accuracy

input size	accuracy/%
input size	UC Merced Land-Use	NWPU-RE-SISC45
32pixel×32pixel	89.05	88.91
256pixel×256pixel	91.29	90.24
fusion network	93.81	92.62

下载: 导出CSV

Table 3. Classification accuracy of five methods for datasets experiments

algorithm	accuracy/%
algorithm	UC Merced Land-Use	NWPU-RE-SISC45
SVM-LDA^[19]	80.33	79.99
FK-S^[20]	91.36	91.45
MS-DCNN^[13]	91.34	90.03
PCA-CNN^[14]	92.86	91.78
proposed method	93.81	92.62

下载: 导出CSV

Table 4. Error rate of five methods for datasets experiments

algorithm	error/%
algorithm	UC Merced Land-Use	NWPU-RE-SISC45
SVM-LDA^[19]	19.67	20.01
FK-S^[20]	8.64	8.55
MS-DCNN^[13]	8.66	9.97
PCA-CNN^[14]	7.14	8.22
proposed method	6.19	7.38

下载: 导出CSV

[1]	QI Y F, MA Zh Y. Hyperspectral image classification method based on neighborhood spectra and probability cooperative representation[J]. Laser Technology, 2019, 43(4): 448-452(in Chinese).
[2]	GUAN Sh H, YANG G, LI H, et al. Hyperspectral image classification based on 3-D convolutional recurrent neural network[J]. Laser Technology, 2020, 44(4): 485-491(in Chinese).
[3]	ZHAO Sh. Remote sensing image classification method based on convolutional neural networks[D]. Beijing: China University of Geosciences, 2015: 35-41(in Chinese).
[4]	YI Y, NEWSAM S. Bag-of-visual-words and spatial extensions for land-use classification[EB/OL]. (2010-11-10)[2020-06-22].https://www.sci-hub.ren/10.1145/1869790.1869829.
[5]	ZHAO L, TANG P, HUO L. Land-use scene classification using a concentric circle-structured multi-scale bag-of-visual-words mode[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2014, 7(12): 4620-4613. doi: 10.1109/JSTARS.2014.2339842
[6]	YI Y, NEWSAM S. Spatial pyramid co-occurrence for image classification[C]//IEEE International Conference on Computer Vision.New York, USA: IEEE, 2011: 1465-1472.
[7]	GAO D D, ZHANG X Sh. Image saliency detection based on spatial convolutional neural network model[J]. Computer Engineering, 2018, 44(5): 240-245(in Chinese).
[8]	WANG F, ZHANG Y, ZHANG D P, et al. Application research of convolutional neural network based on shortcut in face recognition[J]. Journal of Electronic Measurement and Instrument, 2018, 32(4): 80-86(in Chinese).
[9]	LI Ch Ch, DU W B, MA X X, et al. SAR image segmentation based on improved MRF model[J]. Remote Sensing Information, 2017, 32(5): 85-89(in Chinese).
[10]	BIAN X Y, FEI X J, M N. Remote sensing image scene classification based on scale-attention network[J]. Journal of Computer A-pplications, 2020, 40(3): 872-877(in Chinese).
[11]	ZHOU Y, YE Q, QIU Q, et al. Oriented response networks[C]//Proceedings of the 2017 International Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2017: 4961-4970.
[12]	LUAN S, CHEN C, ZHANG B, et al. Gabor convolutional networks[J]. IEEE Transactions on Image Processing, 2018, 27(9): 4357-4366. doi: 10.1109/TIP.2018.2835143
[13]	XU S H, MU X D, ZHAO P, et al. Scene classification of remote sensing image based on multi-scale feature and deep neural network[J]. Acta Geodaetica et Cartographica Sinica, 2016, 45(7): 834-840(in Chinese).
[14]	HE X F, ZOU Zh R, TAO Ch, et al. High-resolution image scene classification based on joint significance and multi-layer convolutional neural networks[J]. Acta Geodaetica et Cartographica Sinica, 2016, 45(9): 1073-1080(in Chinese).
[15]	CHEN Y Q, QIANG Zh P, CHEN X, et al. Classification of land use scenarios based on fine-tuning convolution neural network[J]. Remote Sensing Information, 2019, 34(3): 70-77(in Chinese).
[16]	LI E, XIA J, DU P, et al. Integrating multilayer features of convolutional neural networks for remote sensing scene classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2017, 55(10): 5653-5665. doi: 10.1109/TGRS.2017.2711275
[17]	HUANG G, LIU Z, der MAATEN L V, et al. Densely connected convolutional networks[C]//Computer Vision and Pattern Recognition. New York, USA: IEEE, 2017: 2261-2269.
[18]	HE K, ZHANG X, REN S, et al. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification[C]//International Conference on Computer Vision. New York, USA: IEEE, 2015: 1026-1034.
[19]	ZHANG F, DU B, ZHANG L. Saliency-guided unsupervised feature learning for scene classification[J]. IEEE Transactions on Geoscienceand Remote Sensing, 2015, 53(4): 2175-2184. doi: 10.1109/TGRS.2014.2357078
[20]	ZHAO B, ZHONG Y, ZHANG L, et al. The Fisher kernel coding framework for high spatial resolution scene classification[J]. Remote Sensing, 2016, 8(2): 157.

[1]	熊羽 , 左小清 , 黄亮 , 陈震霆 . 基于多特征组合的彩色遥感图像分类研究. 激光技术, 2014, 38(2): 165-171. doi: 10.7510/jgjs.issn.1001-3806.2014.02.005
[2]	孙越娇 , 雷武虎 , 胡以华 , 赵楠翔 , 任晓东 . 基于视觉显著模型的遥感图像舰船快速检测. 激光技术, 2018, 42(3): 379-384. doi: 10.7510/jgjs.issn.1001-3806.2018.03.017
[3]	陈树越 , 刘金星 , 丁艺 . 基于小波变换的红外与X光图像融合方法研究. 激光技术, 2015, 39(5): 685-688. doi: 10.7510/jgjs.issn.1001-3806.2015.05.021
[4]	谢军昱 , 许杨剑 , 王效贵 . 基于贝叶斯模型和数字图像相关的视觉测量. 激光技术, 2016, 40(6): 866-870. doi: 10.7510/jgjs.issn.1001-3806.2016.06.019
[5]	朱文艳 , 李莹 , 袁飞 , 冯少彤 , 聂守平 . 基于JPEG压缩编码的小波域多图像融合算法研究. 激光技术, 2014, 38(3): 425-430. doi: 10.7510/jgjs.issn.1001-3806.2014.03.031
[6]	张雷 , 罗长更 , 张颖颖 , 李根全 , 杨兴强 , 王肖霞 . 基于支持度变换的红外与可见光图像融合算法. 激光技术, 2015, 39(3): 428-431. doi: 10.7510/jgjs.issn.1001-3806.2015.03.032
[7]	李旭寒 , 董安国 , 封建湖 . 基于多级引导滤波器的图像区域融合算法. 激光技术, 2016, 40(5): 756-761. doi: 10.7510/jgjs.issn.1001-3806.2016.05.029
[8]	陈锋 , 张闻文 , 虞文俊 , 陈钱 , 顾国华 . 基于小波变换的EMCCD微光图像融合算法. 激光技术, 2014, 38(2): 155-160. doi: 10.7510/jgjs.issn.1001-3806.2014.02.003
[9]	葛雯 , 姬鹏冲 , 赵天臣 . NSST域模糊逻辑的红外与可见光图像融合. 激光技术, 2016, 40(6): 892-896. doi: 10.7510/jgjs.issn.1001-3806.2016.06.024
[10]	郑伟 , 孙雪青 , 李哲 . shearlet变换和区域特性相结合的图像融合. 激光技术, 2015, 39(1): 50-56. doi: 10.7510/jgjs.issn.1001-3806.2015.01.010
[11]	葛雯 , 杨阳 . 基于NSCT域的动态WNMF图像融合算法的研究. 激光技术, 2019, 43(2): 286-290. doi: 10.7510/jgjs.issn.1001-3806.2019.02.025
[12]	张宝华 , 刘鹤 , 张传亭 . 基于经验模态分解提取纹理的图像融合算法. 激光技术, 2014, 38(4): 463-468. doi: 10.7510/jgjs.issn.1001-3806.2014.04.007
[13]	刘凯 , 王慧琴 , 吴萌 , 相建凯 , 卢英 . 基于提升小波的古铜镜X光图像融合方法研究. 激光技术, 2020, 44(1): 113-118. doi: 10.7510/jgjs.issn.1001-3806.2020.01.020
[14]	高颖 , 王阿敏 , 王凤华 , 郭淑霞 . 改进的小波变换算法在图像融合中的应用. 激光技术, 2013, 37(5): 690-695. doi: 10.7510/jgjs.issn.1001-3806.2013.05.028
[15]	郑伟 , 李涵 , 安晓林 , 刘帅奇 , 张晓丹 , 马泽鹏 . 基于ShearLab 3D变换的3维PET/MRI图像融合. 激光技术, 2021, 45(1): 86-92. doi: 10.7510/jgjs.issn.1001-3806.2021.01.015
[16]	虞文俊 , 顾国华 , 杨蔚 . 基于小波变换的红外偏振图像融合算法. 激光技术, 2013, 37(3): 289-292. doi: 10.7510/jgjs.issn.1001-3806.2013.03.004
[17]	李志国 , 张思将 , 周建忠 . 基于图像特征的红外对抗干扰效果评估方法研究. 激光技术, 2013, 37(3): 413-416. doi: 10.7510/jgjs.issn.1001-3806.2013.03.032
[18]	纪文 , 孙水发 , 王帅 , 董方敏 . 对数域的光学相干层析图像噪声模型分析. 激光技术, 2014, 38(6): 848-853. doi: 10.7510/jgjs.issn.1001-3806.2014.06.027
[19]	何易德 , 朱斌 , 姜湖海 , 刘书信 , 李黎明 , 胡绍云 . 红外图像多尺度统计和应用先验去模糊模型. 激光技术, 2023, 47(3): 360-365. doi: 10.7510/jgjs.issn.1001-3806.2023.03.012
[20]	李文龙 , 戈海龙 , 任远 , 成巍 . 图像处理技术在激光熔池温度检测的应用. 激光技术, 2018, 42(5): 599-604. doi: 10.7510/jgjs.issn.1001-3806.2018.05.004

点击查看大图

图(7) / 表(4)

计量

文章访问数: 5492
HTML全文浏览量: 5693
PDF下载量: 18
被引次数: 0

全文HTML

引言

遥感影像分类^[1-2]是遥感领域不可或缺的一部分，被广泛应用在土地资源管理、城市设计规划、气象观测、环境及自然灾害的变化监测等领域。为了反映地面复杂的空间结构，需要充分利用遥感图像中的丰富地理信息，但目前从遥感图像中精确提取有效信息并适度表达, 还面临诸多挑战^[3]。

图像分类任务中如何提取图片特征至关重要，而且对于实验结果的影响不容忽略。中低层次上的方法提取图像语义特征，如视觉词袋模型(bag of visual words，BOVW)^[4]，在此基础上出现众多基于视觉词袋的模型，如改进的同心圆多尺度结构视觉词袋(concentric circle-structured multi-scale BOVW，CCM-BOVW)^[5]模型，该模型利用特征组合描述视觉词的空间信息，但特征表达能力较弱，影响分类精度；共线性核以一种空间金字塔形式(spatial pyramid co-occurrence kernel，SPCK)^[6]，通过捕获单词的绝对和相对空间排列表征图像的光度和几何特征，但采用的是底层局部特征导致其分类结果不佳。大多数经典方法是基于人工或浅层学习的算法，而且提取的中低级语义特征在描述能力上受到限制，难以进一步提高分类准确性。

近些年，深度学习方法作为计算机视觉识别领域的主要方法已成功应用于目标识别，取得巨大成功，如空间卷积的显著性方法^[7]检测物体、捷径卷积神经网络^[8]识别人脸、改进马尔可夫模型^[9]分割合成孔径雷达(synthetic aperture radar，SAR)图像，以及深度学习的方式用于遥感图像分类^[10]等。但是，训练卷积神经网络(convolutional neural networks，CNN)处理图像分类任务时，大规模标签数据是前提。由于图像本身的信息较复杂，导致目前单一标签的图像数据较少，而人工标注又耗时间和精力，因此成为图像分类精度的一个影响因素。在此基础上，卷积神经网络及其一系列改进^[11-12]用于解决上述问题，将多尺度图像^[13]送到输入端，产生图像的丰富特征信息，然后用多种编码方式对特征编码，最后输入分类器分类。利用联合显著性算法与卷积神经网络结合的方法^[14]对遥感图像采样，对于图像场景差异小的类别识别效率低。将微调和卷积神经网络模型结合提取图像特征^[15]的方法有效解决了遥感图像场景分类的相同类内差异和不同类间相似性的问题，但同时局部信息的表达被减弱。LI等人^[16]提出一种多尺度费舍尔编码方法来构建卷积深度特征的中层特征表示，通过主成分分析方法融合了从卷积层中提取的中层特征和全连通层的特征，但该方法着重于中层特征的表达，没有利用更高层次的特征。

为充分利用图像中包含的丰富场景信息，本文中提出一种用双通道深度密集网络提取特征并融合的分类方法。首先，改变深层密集卷积神经网络(dense convolutional network，DenseNet)的DenseNet-40网络结构，使其适应原遥感图像尺度大小，其次通过改进的DenseNet-40表征遥感图像的全局信息，原DenseNet-40表征遥感图像的局部信息，然后利用BOVW模型对深层局部特征进行重组编码，最后，利用局部特征和全局特征的互补性，将密集网络的各层特征加权融合，使融合后特征带有更深层次语义信息，利于改善分类准确率。

3. 结论

由于在图像识别ImageNet数据集上运用DenseNet有很好的分类效果，因此使用DenseNet网络设计双通道特征融合网络，利用网络中各层特征较强的表示能力，提取更多图片的特征，并且该网络特点是能保证高效的特征复用，因此提高了各层的特征利用率。为了增加保留图片的有用信息的同时去除冗余信息的能力，算法最初设计数据增强等预处理操作。从实验结果可知，该融合模型在两个数据集上表现出了良好的分类性能。后续研究中，可尝试引入多层特征融合的方法来提高目标的识别分类。

参考文献 (20)

姓名
邮箱
手机号码
标题
留言内容
验证码

留言板

基于双通道深度密集特征融合的遥感影像分类

作者简介: 张艳月(1994-)，女，硕士研究生，现主要从事遥感图像处理的研究.

通讯作者: 张宝华, zbh_wj2004@imust.cn ;

Remote sensing image classification based on dual-channel deep dense feature fusion

Corresponding author: ZHANG Baohua, zbh_wj2004@imust.cn ;

计量

基于双通道深度密集特征融合的遥感影像分类

通讯作者: 张宝华, zbh_wj2004@imust.cn;

作者简介: 张艳月(1994-)，女，硕士研究生，现主要从事遥感图像处理的研究

English Abstract

Remote sensing image classification based on dual-channel deep dense feature fusion

Corresponding author: ZHANG Baohua, zbh_wj2004@imust.cn

全文HTML

1.1. 密集网络

1.2. 局部特征提取

1.3. 全局特征提取

1.4. 基于双通道深度密集特征融合算法

2.1. 数据集介绍

2.2. 实验参量设置

2.3. 实验结果与分析

目录

留言板

基于双通道深度密集特征融合的遥感影像分类

作者简介: 张艳月(1994-)，女，硕士研究生，现主要从事遥感图像处理的研究.

通讯作者: 张宝华, zbh_wj2004@imust.cn ;

Remote sensing image classification based on dual-channel deep dense feature fusion

Corresponding author: ZHANG Baohua, zbh_wj2004@imust.cn ;

计量

出版历程

基于双通道深度密集特征融合的遥感影像分类

通讯作者: 张宝华, zbh_wj2004@imust.cn;

作者简介: 张艳月(1994-)，女，硕士研究生，现主要从事遥感图像处理的研究

English Abstract

Remote sensing image classification based on dual-channel deep dense feature fusion

Corresponding author: ZHANG Baohua, zbh_wj2004@imust.cn

全文HTML

1.1. 密集网络

1.2. 局部特征提取

1.3. 全局特征提取

1.4. 基于双通道深度密集特征融合算法

2.1. 数据集介绍

2.2. 实验参量设置

2.3. 实验结果与分析

目录