Remote sensing image classification based on dual-channel deep dense feature fusion

ZHANG Yanyue; ZHANG Baohua; ZHAO Yunfei; LÜ Xiaoqi; GU Yu; LI Jianjun

doi:10.7510/jgjs.issn.1001-3806.2021.01.013

Volume 45 Issue 1

Jan. 2021

Article Contents

Turn off MathJax

Article Navigation > LASER TECHNOLOGY > 2021 > 45(1): 73-79

Citation:

Remote sensing image classification based on dual-channel deep dense feature fusion

1.
.School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou 014010, China
2.
Inner Mongolia Key Laboratory of Pattern Recognition and Intelligent Image Processing, Inner Mongolia University of Science and Technology, Baotou 014010, China
3.
School of Information Engineering, Inner Mongolia University of Technology, Hohhot 010051, China

Corresponding author: ZHANG Baohua, zbh_wj2004@imust.cn ;
Received Date: 2019-12-27
Accepted Date: 2020-03-09

Abstract

In order to improve the effective utilization of features in remote sensing image scene classification and achieve the purpose of improving the accuracy of remote sensing image classification, a remote sensing image classification method based on dual-channel depth-dense feature fusion was used for theoretical analysis and experimental verification. First, the image convolution layer features and fully connected layer features was separated extracted by constructing a composite dense convolutional network model. In order to exploit the deep information of the image, the deep convolutional layer features extracted by the model were recombined and encoded by the bag of visual words to capture the deep local features of the image. Finally the linear and weighted methods were used to fuse local and global features and then classify them. The results show that using the datasets UC Merced Land-Use and NWPU-RESISC45 for experiments, the classification accuracy obtained is 93.81% and 92.62%, respectively. This method makes full use of the complementarity of local features and global features to achieve the full expression of deep image information.
- image processing,
- classification of remote sensing images,
- feature fusion,
- dense convolutional network,
- bag of visual words

References

[1]	QI Y F, MA Zh Y. Hyperspectral image classification method based on neighborhood spectra and probability cooperative representation[J]. Laser Technology, 2019, 43(4): 448-452(in Chinese).
[2]	GUAN Sh H, YANG G, LI H, et al. Hyperspectral image classification based on 3-D convolutional recurrent neural network[J]. Laser Technology, 2020, 44(4): 485-491(in Chinese).
[3]	ZHAO Sh. Remote sensing image classification method based on convolutional neural networks[D]. Beijing: China University of Geosciences, 2015: 35-41(in Chinese).
[4]	YI Y, NEWSAM S. Bag-of-visual-words and spatial extensions for land-use classification[EB/OL]. (2010-11-10)[2020-06-22].https://www.sci-hub.ren/10.1145/1869790.1869829.
[5]	ZHAO L, TANG P, HUO L. Land-use scene classification using a concentric circle-structured multi-scale bag-of-visual-words mode[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2014, 7(12): 4620-4613. doi: 10.1109/JSTARS.2014.2339842
[6]	YI Y, NEWSAM S. Spatial pyramid co-occurrence for image classification[C]//IEEE International Conference on Computer Vision.New York, USA: IEEE, 2011: 1465-1472.
[7]	GAO D D, ZHANG X Sh. Image saliency detection based on spatial convolutional neural network model[J]. Computer Engineering, 2018, 44(5): 240-245(in Chinese).
[8]	WANG F, ZHANG Y, ZHANG D P, et al. Application research of convolutional neural network based on shortcut in face recognition[J]. Journal of Electronic Measurement and Instrument, 2018, 32(4): 80-86(in Chinese).
[9]	LI Ch Ch, DU W B, MA X X, et al. SAR image segmentation based on improved MRF model[J]. Remote Sensing Information, 2017, 32(5): 85-89(in Chinese).
[10]	BIAN X Y, FEI X J, M N. Remote sensing image scene classification based on scale-attention network[J]. Journal of Computer A-pplications, 2020, 40(3): 872-877(in Chinese).
[11]	ZHOU Y, YE Q, QIU Q, et al. Oriented response networks[C]//Proceedings of the 2017 International Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2017: 4961-4970.
[12]	LUAN S, CHEN C, ZHANG B, et al. Gabor convolutional networks[J]. IEEE Transactions on Image Processing, 2018, 27(9): 4357-4366. doi: 10.1109/TIP.2018.2835143
[13]	XU S H, MU X D, ZHAO P, et al. Scene classification of remote sensing image based on multi-scale feature and deep neural network[J]. Acta Geodaetica et Cartographica Sinica, 2016, 45(7): 834-840(in Chinese).
[14]	HE X F, ZOU Zh R, TAO Ch, et al. High-resolution image scene classification based on joint significance and multi-layer convolutional neural networks[J]. Acta Geodaetica et Cartographica Sinica, 2016, 45(9): 1073-1080(in Chinese).
[15]	CHEN Y Q, QIANG Zh P, CHEN X, et al. Classification of land use scenarios based on fine-tuning convolution neural network[J]. Remote Sensing Information, 2019, 34(3): 70-77(in Chinese).
[16]	LI E, XIA J, DU P, et al. Integrating multilayer features of convolutional neural networks for remote sensing scene classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2017, 55(10): 5653-5665. doi: 10.1109/TGRS.2017.2711275
[17]	HUANG G, LIU Z, der MAATEN L V, et al. Densely connected convolutional networks[C]//Computer Vision and Pattern Recognition. New York, USA: IEEE, 2017: 2261-2269.
[18]	HE K, ZHANG X, REN S, et al. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification[C]//International Conference on Computer Vision. New York, USA: IEEE, 2015: 1026-1034.
[19]	ZHANG F, DU B, ZHANG L. Saliency-guided unsupervised feature learning for scene classification[J]. IEEE Transactions on Geoscienceand Remote Sensing, 2015, 53(4): 2175-2184. doi: 10.1109/TGRS.2014.2357078
[20]	ZHAO B, ZHONG Y, ZHANG L, et al. The Fisher kernel coding framework for high spatial resolution scene classification[J]. Remote Sensing, 2016, 8(2): 157.

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(7) / Tables(4)

Get Citation

PDF

XML

Article views(5487) PDF downloads(18) Cited by()

Proportional views

HTML

引言

遥感影像分类^[1-2]是遥感领域不可或缺的一部分，被广泛应用在土地资源管理、城市设计规划、气象观测、环境及自然灾害的变化监测等领域。为了反映地面复杂的空间结构，需要充分利用遥感图像中的丰富地理信息，但目前从遥感图像中精确提取有效信息并适度表达, 还面临诸多挑战^[3]。

图像分类任务中如何提取图片特征至关重要，而且对于实验结果的影响不容忽略。中低层次上的方法提取图像语义特征，如视觉词袋模型(bag of visual words，BOVW)^[4]，在此基础上出现众多基于视觉词袋的模型，如改进的同心圆多尺度结构视觉词袋(concentric circle-structured multi-scale BOVW，CCM-BOVW)^[5]模型，该模型利用特征组合描述视觉词的空间信息，但特征表达能力较弱，影响分类精度；共线性核以一种空间金字塔形式(spatial pyramid co-occurrence kernel，SPCK)^[6]，通过捕获单词的绝对和相对空间排列表征图像的光度和几何特征，但采用的是底层局部特征导致其分类结果不佳。大多数经典方法是基于人工或浅层学习的算法，而且提取的中低级语义特征在描述能力上受到限制，难以进一步提高分类准确性。

近些年，深度学习方法作为计算机视觉识别领域的主要方法已成功应用于目标识别，取得巨大成功，如空间卷积的显著性方法^[7]检测物体、捷径卷积神经网络^[8]识别人脸、改进马尔可夫模型^[9]分割合成孔径雷达(synthetic aperture radar，SAR)图像，以及深度学习的方式用于遥感图像分类^[10]等。但是，训练卷积神经网络(convolutional neural networks，CNN)处理图像分类任务时，大规模标签数据是前提。由于图像本身的信息较复杂，导致目前单一标签的图像数据较少，而人工标注又耗时间和精力，因此成为图像分类精度的一个影响因素。在此基础上，卷积神经网络及其一系列改进^[11-12]用于解决上述问题，将多尺度图像^[13]送到输入端，产生图像的丰富特征信息，然后用多种编码方式对特征编码，最后输入分类器分类。利用联合显著性算法与卷积神经网络结合的方法^[14]对遥感图像采样，对于图像场景差异小的类别识别效率低。将微调和卷积神经网络模型结合提取图像特征^[15]的方法有效解决了遥感图像场景分类的相同类内差异和不同类间相似性的问题，但同时局部信息的表达被减弱。LI等人^[16]提出一种多尺度费舍尔编码方法来构建卷积深度特征的中层特征表示，通过主成分分析方法融合了从卷积层中提取的中层特征和全连通层的特征，但该方法着重于中层特征的表达，没有利用更高层次的特征。

为充分利用图像中包含的丰富场景信息，本文中提出一种用双通道深度密集网络提取特征并融合的分类方法。首先，改变深层密集卷积神经网络(dense convolutional network，DenseNet)的DenseNet-40网络结构，使其适应原遥感图像尺度大小，其次通过改进的DenseNet-40表征遥感图像的全局信息，原DenseNet-40表征遥感图像的局部信息，然后利用BOVW模型对深层局部特征进行重组编码，最后，利用局部特征和全局特征的互补性，将密集网络的各层特征加权融合，使融合后特征带有更深层次语义信息，利于改善分类准确率。

3. 结论

由于在图像识别ImageNet数据集上运用DenseNet有很好的分类效果，因此使用DenseNet网络设计双通道特征融合网络，利用网络中各层特征较强的表示能力，提取更多图片的特征，并且该网络特点是能保证高效的特征复用，因此提高了各层的特征利用率。为了增加保留图片的有用信息的同时去除冗余信息的能力，算法最初设计数据增强等预处理操作。从实验结果可知，该融合模型在两个数据集上表现出了良好的分类性能。后续研究中，可尝试引入多层特征融合的方法来提高目标的识别分类。

Reference (20)

[1]	QI Y F, MA Zh Y. Hyperspectral image classification method based on neighborhood spectra and probability cooperative representation[J]. Laser Technology, 2019, 43(4): 448-452.
[2]	GUAN Sh H, YANG G, LI H. Hyperspectral image classification based on 3-D convolutional recurrent neural network[J]. Laser Technology, 2020, 44(4): 485-491.
[3]	ZHAO Sh. Remote sensing image classification method based on convolutional neural networks[D]. Beijing: China University of Geosciences, 2015: 35-41(in Chinese).
[4]	YI Y, NEWSAM S. Bag-of-visual-words and spatial extensions for land-use classification[EB/OL]. (2010-11-10)[2020-06-22].https://www.sci-hub.ren/10.1145/1869790.1869829.
[5]	ZHAO L, TANG P, HUO L. Land-use scene classification using a concentric circle-structured multi-scale bag-of-visual-words mode[J]. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2014, 7(12): 4620-4613. doi: 10.1109/JSTARS.2014.2339842
[6]	YI Y, NEWSAM S. Spatial pyramid co-occurrence for image classification[C]//IEEE International Conference on Computer Vision.New York, USA: IEEE, 2011: 1465-1472.
[7]	GAO D D, ZHANG X Sh. Image saliency detection based on spatial convolutional neural network model[J]. Computer Engineering, 2018, 44(5): 240-245.
[8]	WANG F, ZHANG Y, ZHANG D P. Application research of convolutional neural network based on shortcut in face recognition[J]. Journal of Electronic Measurement and Instrument, 2018, 32(4): 80-86.
[9]	LI Ch Ch, DU W B, MA X X. SAR image segmentation based on improved MRF model[J]. Remote Sensing Information, 2017, 32(5): 85-89.
[10]	BIAN X Y, FEI X J, M N. Remote sensing image scene classification based on scale-attention network[J]. Journal of Computer A-pplications, 2020, 40(3): 872-877.
[11]	ZHOU Y, YE Q, QIU Q, et al. Oriented response networks[C]//Proceedings of the 2017 International Conference on Computer Vision and Pattern Recognition. New York, USA: IEEE, 2017: 4961-4970.
[12]	LUAN S, CHEN C, ZHANG B. Gabor convolutional networks[J]. IEEE Transactions on Image Processing, 2018, 27(9): 4357-4366. doi: 10.1109/TIP.2018.2835143
[13]	XU S H, MU X D, ZHAO P. Scene classification of remote sensing image based on multi-scale feature and deep neural network[J]. Acta Geodaetica et Cartographica Sinica, 2016, 45(7): 834-840.
[14]	HE X F, ZOU Zh R, TAO Ch. High-resolution image scene classification based on joint significance and multi-layer convolutional neural networks[J]. Acta Geodaetica et Cartographica Sinica, 2016, 45(9): 1073-1080.
[15]	CHEN Y Q, QIANG Zh P, CHEN X. Classification of land use scenarios based on fine-tuning convolution neural network[J]. Remote Sensing Information, 2019, 34(3): 70-77.
[16]	LI E, XIA J, DU P. Integrating multilayer features of convolutional neural networks for remote sensing scene classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2017, 55(10): 5653-5665. doi: 10.1109/TGRS.2017.2711275
[17]	HUANG G, LIU Z, der MAATEN L V, et al. Densely connected convolutional networks[C]//Computer Vision and Pattern Recognition. New York, USA: IEEE, 2017: 2261-2269.
[18]	HE K, ZHANG X, REN S, et al. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification[C]//International Conference on Computer Vision. New York, USA: IEEE, 2015: 1026-1034.
[19]	ZHANG F, DU B, ZHANG L. Saliency-guided unsupervised feature learning for scene classification[J]. IEEE Transactions on Geoscienceand Remote Sensing, 2015, 53(4): 2175-2184. doi: 10.1109/TGRS.2014.2357078
[20]	ZHAO B, ZHONG Y, ZHANG L. The Fisher kernel coding framework for high spatial resolution scene classification[J]. Remote Sensing, 2016, 8(2): 157-.

layers	input size/pixel	kernel size	number of convolution kernels	pool	stride	output size/pixel
input
convolution	256×256	3×3	72	—	—	256×256
max pooling	256×256	1×1	—	2×2	2	128×128
dense block 1	128×128	3×3	360	—	—	128×128
transition layer 1	128×128	1×1	360	2×2	2	64×64
dense block 2	64×64	3×3	648	—	—	64×64
transition layer 2	64×64	1×1	648	2×2	2	32×32
dense block 3	32×32	3×3	936	—	—	32×32
transition layer 3	32×32	1×1	936	2×2	2	16×16
dense block 4	16×16	3×3	1224			16×16
transition layer 4	16×16	1×1	1224	2×2	2	8×8
dense block 5	8×8	3×3	1512	—	—	8×8
classification layer	1×1		1			1×1

input size	accuracy/%
input size	UC Merced Land-Use	NWPU-RE-SISC45
32pixel×32pixel	89.05	88.91
256pixel×256pixel	91.29	90.24
fusion network	93.81	92.62

algorithm	accuracy/%
algorithm	UC Merced Land-Use	NWPU-RE-SISC45
SVM-LDA^[19]	80.33	79.99
FK-S^[20]	91.36	91.45
MS-DCNN^[13]	91.34	90.03
PCA-CNN^[14]	92.86	91.78
proposed method	93.81	92.62

algorithm	error/%
algorithm	UC Merced Land-Use	NWPU-RE-SISC45
SVM-LDA^[19]	19.67	20.01
FK-S^[20]	8.64	8.55
MS-DCNN^[13]	8.66	9.97
PCA-CNN^[14]	7.14	8.22
proposed method	6.19	7.38

Remote sensing image classification based on dual-channel deep dense feature fusion

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Proportional views

Remote sensing image classification based on dual-channel deep dense feature fusion

Corresponding author: ZHANG Baohua, zbh_wj2004@imust.cn;

HTML

1.1. 密集网络

1.2. 局部特征提取

1.3. 全局特征提取

1.4. 基于双通道深度密集特征融合算法

2.1. 数据集介绍

2.2. 实验参量设置

2.3. 实验结果与分析

Catalog

Remote sensing image classification based on dual-channel deep dense feature fusion

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Proportional views

Remote sensing image classification based on dual-channel deep dense feature fusion

Corresponding author: ZHANG Baohua, zbh_wj2004@imust.cn;

HTML

1.1. 密集网络

1.2. 局部特征提取

1.3. 全局特征提取

1.4. 基于双通道深度密集特征融合算法

2.1. 数据集介绍

2.2. 实验参量设置

2.3. 实验结果与分析

Catalog

Export File

Citation

Format

Content