An airborne point cloud roof plane extraction algorithm based on deep learning

LI Jie; LI Qingqing; LI Li; LIU Zhao; SHEN Yang; TU Jingmin

doi:10.7510/jgjs.issn.1001-3806.2024.05.003

In order to accurately extract the individual planes from various types of building roof point clouds, metric learning was used to learn separate high-dimensional depth features for the points on each plane, and each plane was considered as a separate instance. Then the extracted high-dimensional depth features were used to perform preliminary clustering of the plane points. The unclustered points were assigned to each plane by a combined metric of simple Euclidean distance and feature space distance. The proposed method was trained and tested on a synthetic dataset and the publicly available airborne point cloud building roof dataset RoofN3D, respectively. The results show that on the synthetic dataset, the accuracy, recall, and F₁ scores of the extracted building planes are 0.990, 0.998, and 0.994, respectively. On the airborne point cloud dataset RoofN3D, the accuracy, recall, and F₁ scores of the extracted building planes are 0.945, 0.971, and 0.957, respectively. The proposed method not only can accurately and effectively extract different building roof planes, but also the extracted plane edges are very accurate. In addition, the method can also accurately distinguish between the planar and non-planar contents of building roofs, which provides important help for further 3-D modeling of buildings.

HTML

0. 引言

机载激光雷达(light detection and ranging, LiDAR)点云技术可以快速、准确地获取到高密度且高精度的真实3维城市模型，广泛应用于数字城市建设、地质勘测及灾害评估、3维导航等领域^[1-5]。建筑物是地理环境中最主要、最基本的组成元素之一，基于机载LiDAR点云数据的建筑物3维模型重建也是当前研究的热点。屋顶作为建筑物模型中的重要组成元素，快速、准确的屋顶平面提取是进行屋顶建模的必要环节和关键步骤^[6-8]，屋顶平面的提取精度将直接影响最终的建模精度。由于不同类型建筑物屋顶形状各异且空间结构复杂，从建筑物屋顶LiDAR点云数据中进行准确有效的屋顶平面提取仍面临挑战。

传统的平面提取方法主要包括区域增长算法、模型拟合和聚类等方法。区域增长算法^{[5, 9]}主要通过选取种子点、近邻点搜索以及生长条件进行平面提取。一般选择曲率较小的即较为平稳的区域中的点作为种子点进行区域增长。在近邻关系中，往往使用构建k-D树的方式来获取点之间的邻接关系。对于生长条件的选取，通常采用欧氏距离、法向量等具有几何性质的条件进行判定^[10]。区域生长方法受生长规则的限制较大，容易出现过分割或欠分割的现象。为了提高分割的准确性，部分方法在基础的区域生长方法上进行了改进。WANG等人^[11]在生长条件中加入了3-D点云的彩色信息使点云分割更加稳定。ZHU等人^[12]提出一种以三角面为基元的基于区域生长算法对同一建筑物面片上各三角面进行初步划分，然后结合随机抽样一致(random sample consensus，RANSAC)完成建筑物屋顶的点云分割。

模型拟合的方法主要是基于3-D Hough变换算法^[13]和RANSAC算法^[14]。Hough变换是图像处理领域中常见的特征检测技术。在处理3维点云时，3-D Hough变换对于非平面噪声点具有一定的鲁棒性。但3-D Hough变换在将原始空间中的特征转化至参数空间时，随着参数量的增加，算法的复杂度也呈指数级的增长，从而耗费大量的时间和空间资源。为此，部分方法^[15-17]通过降低投票成本的方式来降低计算成本。除此以外，针对3-D Hough变换面对复杂建筑物屋顶形状时可能伪平面的情况，KANG等人^[18]提出了一种将尺度不变特征转换(scale invariant feature transform，SIFT)和3-D Hough变换相结合的方法，引入曲率以提高SIFT对复杂结构和离散点的识别，并拟合三角面将获取的法向量投入3-D Hough空间用于判断平面可靠性，降低伪平面的出现几率。RANSAC算法通过迭代估算出概率最大、点数最多的平面方程^[19]，这种方法对于噪声点的鲁棒性较高，非常适用于处理噪声点较多的大规模数据集。XU等人^[20]提出了一种用于点云分割的加权RANSAC方法，将点平面距离和法向量一致性的硬阈值投票函数转换为基于两个权重函数的软阈值投票函数，并为正确和不正确的平面假设之间的误差分布设计了一个权重函数和异常值抑制，该加权方法可以显著地提高分割精度。LI等人^[21]提出了一种多原重建方法，首先通过RANSAC将建筑分为平面面片，然后使用一组设计的指标从预定义的原始建筑类型中选择初始建筑分割的可能类型，最后使用3-D布尔运算从其组成基元中重建拓扑一致的3-D建筑模型。目前，基于RANSAC的方法被广泛应用于各类建筑物平面提取。但当平面结构较为复杂时，RANSAC算法的计算效率显著降低，且易产生伪平面。因此，RANSAC算法在面对较为复杂的场景时表现不佳。

基于聚类的方法通常是根据点云的特征(例如距离、法向量、颜色等)，利用k均值、均值漂移等聚类方法从点云中提取平面^[22]。显然这种方法依赖于固定的阈值和预定义的参数，对噪声和非平面内容非常敏感。除此以外，基于密度的聚类算法^[23]也是一种非常常见的聚类算法。CHEN等人^[24]提出了一种新的基于密度聚类算法(density-based spatial clustering of applications with noise, DBSCAN)的3维点云边界检测和平面分割方法，在DBSCAN的基础上引入了共面约束对3维空间中的候选样本和平面有效性进行选择和检测。由于建筑物屋顶类型多样且结构复杂，单一的聚类方法往往很难提取出准确且完整的屋顶平面。因此，传统的建筑物平面提取方法均存在一定的限制。

近年来，随着深度学习和3维点云在语义分割^[25-28]、实例分割^[29-30]等方面的广泛应用，尤其是部分点云分割方法的提出^[31-32]，使得深度学习在屋顶平面分割任务中的应用成为可能。目前，部分深度学习方法^[33]多用于从单幅图像或多视角图像中重建3维建筑模型。ZHANG等人^[34]提出了一个多任务网络，同时实现对3维屋顶平面提取的语义和实例预测，最后使用均值漂移算法生成最终的实例结果。在此基础上，受到点云实例分割方法的启发，本文中设计了一个简洁且有效的建筑物平面提取网络，首先使用PointNet + + 为每个点提取面实例级的深度特征，然后应用简单的距离约束对深度特征进行聚类以获得初始平面，最后对于未分配的点，通过简单的空间距离和深度特征距离进行综合度量和分配。

3. 结论

针对传统建筑物平面分割任务中效果不佳的问题，本文作者提出了一种基于深度学习的点云平面提取方法用于机载LiDAR点云数据的建筑物屋顶平面提取。首先通过深度学习的方式，为建筑物屋顶的每个平面点提取不同的深度特征，然后利用所提取的深度特征在特征空间上进行聚类，最后通过后处理将未聚类点分配至各个初始平面得到最终的平面提取结果。实验结果表明，面对不同类型的建筑物屋顶，所提出的方法不仅可以区分非屋顶平面内容，还可准确有效地提取出各个平面，并且经过后处理步骤，可以准确分割各个平面边缘。与传统的建筑物平面分割方法相比，所提出的方法在分割结果的完整性和准确性方面都有明显的提高。后续工作将结合建筑物屋顶机载LiDAR点云数据的语义信息和平面边缘信息实现更加精确的分割，为进一步实现建筑物的3维重建做准备。

Reference (42)

[1]	杨必胜, 梁福逊, 黄荣刚. 3维激光扫描点云数据处理研究进展、挑战与趋势[J]. 测绘学报, 2017, 46(10): 1509-1516.	YANG B Sh, LIANG F X, HUANG R G. Progress, challenges and trends in 3D laser scanning point cloud data processing research[J]. Acta Geodaetica et Cartographica Sinica, 2017, 46(10): 1509-1516.
[2]	董保根. 机载LiDAR点云与遥感影像融合的地物分类技术研究[D]. 郑州: 解放军信息工程大学, 2013.	DONG B G. Research on feature classification technology by fusion of airborne LiDAR point cloud and remote sensing images[D]. Zhengzhou: Information Engineering University, 2013(in Chinese).
[3]	史建青. 机载LiDAR在省级基础测绘中若干关键技术研究[D]. 武汉: 武汉大学, 2014.	SHI J Q. Research on several key technologies of airborne LiDAR in provincial basic surveying and mapping[D]. Wuhan: Wuhan University, 2014(in Chinese).
[4]	何正斌. 机载LiDAR技术用于数字地面的应用研究[D]. 西安: 长安大学, 2008.	HE Zh B. An applied study of airborne LiDAR technology for digital terrestrial applications[D]. Xi'an: Chang'an University, 2008(in Chinese).
[5]	赵传, 张保明, 陈小卫. 一种基于LiDAR点云的建筑物提取方法[J]. 测绘通报, 2017, (2): 35-39.	ZHAO Ch, ZHANG B M, CHEN X W. A building extraction method based on LiDAR point cloud[J]. Bulletin of Surveying and Mapping, 2017, (2): 35-39.
[6]	陈焱明. 基于机载与车载LiDAR数据的建筑物模型多视3维重建研究[D]. 南京: 南京大学, 2015.	CHEN Y M. Multi-view 3D reconstruction of building models based on airborne and vehicle-mounted LiDAR data[D]. Nanjing: Nanjing University, 2015(in Chinese).
[7]	曾齐红. 机载激光雷达点云数据处理与建筑物3维重建[D]. 上海: 上海大学, 2009.	ZENG Q H. Airborne LiDAR point cloud data processing and 3D reconstruction of buildings[D]. Shanghai: Shanghai University, 2009(in Chinese).
[8]	李峰, 吴燕雄, 卫爱霞. 机载激光雷达3维建筑物模型重建的研究进展[J]. 激光技术, 2015, 39(1): 23-27.	LI F, WU Y X, WEI A X. Research progress in airborne LiDAR 3D building model reconstruction[J]. Laser Technology, 2015, 39(1): 23-27.
[9]	卢维欣, 万幼川, 何培培. 大场景内建筑物点云提取及平面分割算法[J]. 中国激光, 2015, 42(9): 0914004-.	LU W X, WAN Y Ch, HE P P. Point cloud extraction and planar segmentation algorithm for buildings in large scenes[J]. Chinese Journal of Lasers, 2015, 42(9): 0914004-.
[10]	XU Y, TONG X, STILLA U. Voxel-based representation of 3D point clouds: Methods, applications, and its potential use in the construction industry[J]. Automation in Construction, 2021, 126(6): 1-26.
[11]	王雯, 任小玲, 陈逍遥. 一种改进的区域增长彩色3D点云分割算法[J]. 国外电子测量技术, 2018, 37(11): 10-14.	WANG W, REN X L, CHEN X Y. An improved algorithm for segmentation of region-growing color 3D point clouds[J]. Foreign Electronic Measurement Technology, 2018, 37(11): 10-14.
[12]	朱军桃, 王雷, 赵传. 基于区域生长算法的复杂建筑物屋顶点云分割[J]. 国土资源遥感, 2019, 31(4): 20-25.	ZHU J T, WANG L, ZHAO Ch. Complex building roof point cloud segmentation based on region growing algorithm[J]. Remote Sensing for Land & Resources, 2019, 31(4): 20-25.
[13]	BALLARD D H. Generalizing the Hough transform to detect arbitrary shape[J]. Pattern Recognition, 1981, 13(2): 111-122.
[14]	FISHLER M A, BOLLES R C. Random sample consensus: A paradigm for model fitting with applications to image analysis and automated cartography[J]. Communications of the ACM, 1981, 24(6): 381-395.
[15]	XU L, OJA E. Randomized Hough transform (RHT): Basic mechanisms, algorithms, and computational complexities[J]. CVGIP: Image Understanding, 1993, 57(2): 131-154.
[16]	KIRYATI N, ELDAR Y, BRUCKSTEIN A M. A probabilistic Hough transform[J]. Pattern Recognition, 1991, 24(4): 303-316.
[17]	YLA-JAASKI A, KIRYATI N. Adaptive termination of voting in the probabilistic circular Hough transform[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 1994, 16(9): 911-915.
[18]	康传利, 兰猗令, 王宁. 一种结合SIFT与3D Hough变换的建筑物屋顶点云分割方法[J]. 遥感信息, 2022, 37(5): 31-37.	KANG Ch L, LAN Y L, WANG N. A combined SIFT and 3D Hough transform for segmentation of building roof point clouds[J]. Remote Sensing Information, 2022, 37(5): 31-37.
[19]	夏金泽, 孙浩铭, 胡盛辉. 基于图像信息约束的3维激光点云聚类方法[J]. 光电工程, 2023, 50(2): 220148-.	XIA J Z, SUN H M, HU Sh H. 3D laser point cloud clustering method based on image information constraints[J]. Opto-Electronic Engineering, 2023, 50(2): 220148-.
[20]	XU B, JIANG W Sh, SHAN J. Investigation on the weighted RANSAC approaches for building roof plane segmentation from LiDAR point clouds[J]. Remote Sensing, 2015, 8(1): 01005-.
[21]	LI Z, SHAN J. RANSAC-based multi primitive building reconstruction from 3D point clouds[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2022, 185(3): 247-260.
[22]	巩育江, 庞亚军, 王汞. 基于几何特征的点云分割算法研究进展[J]. 激光技术, 2022, 46(3): 326-336.	GONG Y J, PANG Y J, WANG G. Research progress on point cloud segmentation algorithm based on geometric features[J]. Laser Technology, 2022, 46(3): 326-336.
[23]	赵传, 张保明, 陈小卫. 一种利用点云邻域信息的建筑物屋顶面高精度自动提取方法[J]. 测绘学报, 2017, 46(9): 1123-1134.	ZHAO Ch, ZHANG B M, CHEN X W. A high-precision automatic extraction method of building roof surface using point cloud neighborhood information[J]. Acta Geodaetica et Cartographica Sinica, 2017, 46(9): 1123-1134.
[24]	CHEN H, LIANG M, LIU W. An approach to boundary detection for 3D point clouds based on DBSCAN clustering[J]. Pattern Recognition, 2022, 124(4): 108431-.
[25]	QI C R, SU H, MO K, et al. Pointnet: Deep learning on point sets for 3D classification and segmentation[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern recognition. Honolulu, USA: IEEE, 2017: 652-660.
[26]	QI C R, YI L, SU H. PointNet + +: Deep hierarchical feature learning on point sets in a metric space[J]. Advances in Neural Information Processing Systems, 2017, 30(12): 5105-5114.
[27]	WU W, QI Z, FUXIN L. Pointconv: Deep convolutional networks on 3d point clouds[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach, CA, USA: IEEE, 2019: 9621-9630.
[28]	HU Q, YANG B, XIE L, et al. Randla-net: Efficient semantic segmentation of large-scale point clouds[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle, WA, USA: IEEE, 2020: 11108-11117.
[29]	YI L, ZHAO W, WANG H, et al. Gspn: Generative shape proposal network for 3D instance segmentation in point cloud[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach, CA, USA: IEEE, 2019: 3947-3956.
[30]	WANG X, LIU S, SHEN X, et al. Associatively segmenting instances and semantics in point clouds[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach, USA: IEEE, 2019: 4096- 4105.
[31]	PHAM Q H, NGUYEN T, HUA B S, et al. JSIS3D: Joint semantic-instance segmentation of 3D point clouds with multi-task pointwise networks and multi-value conditional random fields[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Long Beach, USA: IEEE, 2019: 8827-8836.
[32]	JIANG L, ZHAO H, SHI S, et al. Pointgroup: Dual-set point grouping for 3D instance segmentation[C]// Proceedings of the IEEE/CVF conference on computer vision and Pattern recognition. Seattle, USA: IEEE, 2020: 4867- 4876.
[33]	MAHMUD J, PRICE T, BAPAT A, et al. Boundary-aware 3D building reconstruction from a single overhead image[C]// Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. Seattle, USA: IEEE, 2020: 441- 451.
[34]	ZHANG C Q, FAN H C. An improved multi-task pointwise network for segmentation of building roofs in airborne laser scanning point clouds[J]. The Photogrammetric Record, 2022, 37(179): 260-284.
[35]	WICHMANN A, AGOUB A, KADA M. RoofN3D: Deep learning training data for 3D building reconstruction[J]. The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, 2018, 42(5): 1191-1198.
[36]	LI L, SONG N, SUN F. Point2Roof: End-to-end 3D building roof modeling from airborne LiDAR point clouds[J]. ISPRS Journal of Photogrammetry and Remote Sensing, 2022, 193(11): 17-28.
[37]	REN M, ZEMEL R S. End-to-end instance segmentation with recurrent attention[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, USA: IEEE, 2017: 6656-6664.
[38]	LIU S, JIA J, FIDLER S, et al. SGN: Sequential grouping networks for instance segmentation[C]// Proceedings of the IEEE International Conference on Computer Vision. Venice, Italy: IEEE, 2017: 3496-3504.
[39]	ZHUO W, SALZMANN M, HE X, et al. Indoor scene parsing with instance segmentation, semantic labeling and support relationship inference[C]// Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Honolulu, USA: IEEE, 2017: 5429-5437.
[40]	de BRABANDERE B, NEVEN D, van GOOL L. Semantic instance segmentation with a discriminative loss function[J/OL]. (2017-08-08)[2023-08-16]. https://arxiv.org/abs/1708.02551.
[41]	SCHNABEL R, WAHL R, KLEIN R. Efficient RANSAC for point cloud shape detection[C]// Computer Graphics Forum. Oxford, UK: Blackwell Publishing Ltd, 2007: 214-226.
[42]	LAFARGE F, MALLET C. Creating large-scale city models from 3D-point clouds: A robust approach with hybrid representation[J]. International Journal of Computer Vision, 2012, 99(2): 69-85.

parameters	value
batch size	16
learning rate	10^-3
iterate	150
input size	2048×3

clustering radius r	coverage	weighted coverage	precision	recall	F₁-score
0.4	0.9597	0.9775	0.9897	0.9965	0.9930
0.5	0.9618	0.9786	0.9901	0.9972	0.9836
0.6	0.9626	0.9791	0.9900	0.9982	0.9940
0.7	0.9620	0.9785	0.9895	0.9982	0.9938

methods	instance-level		point-level
methods	coverage	weightal coverage	precision	recall	F₁-score
RANSAC^[40]	0.7623	0.7905	0.8423	0.9602	0.8882
region growing^[41]	0.8027	0.8295	0.8706	0.9811	0.9183
PointGroup^[31]	0.8214	0.8708	0.8745	0.9987	0.9313
our work	0.9626	0.9791	0.9900	0.9982	0.9940

methods	instance-level		point-level
methods	coverage	weighted coverage	precision	recall	F₁-score
RANSAC	0.6331	0.7555	0.8230	0.9715	0.8883
region growing	0.6212	0.7345	0.7952	0.9510	0.8633
PointGroup	0.8082	0.8156	0.8486	0.9989	0.9090
our work	0.8757	0.8801	0.9453	0.9708	0.9573

An airborne point cloud roof plane extraction algorithm based on deep learning

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Proportional views