Generating LOD3 building models from structure-from-motion and semantic segmentation

•Novel pipeline for automatically reconstruct LOD3 models of free-standing buildings.•New metrics for evaluating the performance of a LOD reconstruction.•Benchmark dataset for evaluating the reconstruction of SfM as well as LOD3 models.•Deep learning models for facade and opening segmentation of fre...

Full description

Saved in:
Bibliographic Details
Published inAutomation in construction Vol. 141; p. 104430
Main Authors Pantoja-Rosero, B.G., Achanta, R., Kozinski, M., Fua, P., Perez-Cruz, F., Beyer, K.
Format Journal Article
LanguageEnglish
Published Elsevier B.V 01.09.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•Novel pipeline for automatically reconstruct LOD3 models of free-standing buildings.•New metrics for evaluating the performance of a LOD reconstruction.•Benchmark dataset for evaluating the reconstruction of SfM as well as LOD3 models.•Deep learning models for facade and opening segmentation of free-standing masonry buildings. This paper describes a pipeline for automatically generating level of detail (LOD) models (digital twins), specifically LOD2 and LOD3, from free-standing buildings. Our approach combines structure from motion (SfM) with deep-learning-based segmentation techniques. Given multiple-view images of a building, we compute a three-dimensional (3D) planar abstraction (LOD2 model) of its point cloud using SfM techniques. To obtain LOD3 models, we use deep learning to perform semantic segmentation of the openings in the two-dimensional (2D) images. Unlike existing approaches, we do not rely on complex input, pre-defined 3D shapes or manual intervention. To demonstrate the robustness of our method, we show that it can generate 3D building shapes from a collection of building images with no further input. For evaluating reconstructions, we also propose two novel metrics. The first is a Euclidean–distance-based correlation of the 3D building model with the point cloud. The second involves re-projecting 3D model facades onto source photos to determine dice scores with respect to the ground-truth masks. Finally, we make the code, the image datasets, SfM outputs, and digital twins reported in this work publicly available in github.com/eesd-epfl/LOD3_buildings and doi.org/10.5281/zenodo.6651663. With this work we aim to contribute research in applications such as construction management, city planning, and mechanical analysis, among others.
ISSN:0926-5805
1872-7891
DOI:10.1016/j.autcon.2022.104430