Deep Learning for the Automatic Division of Building Constructions into Sections on Remote Sensing Images

Philipp Schuegraf; Stefano Zorzi; Friedrich Fraundorfer; Ksenia Bittner

doi:10.1109/JSTARS.2023.3296449

Deep Learning for the Automatic Division of Building Constructions into Sections on Remote Sensing Images

Philipp Schuegraf^*, Stefano Zorzi, Friedrich Fraundorfer, Ksenia Bittner

^*Corresponding author for this work

Institute of Computer Graphics and Vision (7100)

Research output: Contribution to journal › Article › peer-review

Abstract

Urban areas predominantly consist of complex building structures, which are assembled of multiple building sections. From very high resolution remote sensing imagery, not only roof-tops but also the separation lines between them are visible. Since fully convolutional neural network (FCN)-based methods have become the primary choice in segmentation approaches, they have been extensively used for automatic building footprint extraction. But each of the previous works on building segmentation either lacks separation of building blocks into sections or does not produce sections of regular appearance. We propose a two-stage approach to overcome these limitations. The first step segments building and separation lines using an FCN model and the second step produces building instances by using a learning-free method. Our model receives a top-down image and a digital surface model (DSM) patch in two separate encoders. The encoder features are summed before the skip connections, which utilize the encoder features from the current and higher-resolution feature maps. We train our model with regularization losses for building shapes and separation lines on both satellite and aerial imagery. We test our model on a city that was not previously included in the training phase to show that it has the capacity to generalize across different geographical locations and architectural styles. Furthermore, we use our building section instance predictions to generate: 1) vectorized building maps and 2) a level-of-detail-1 DSM.

Original language	English
Pages (from-to)	7186-7200
Number of pages	15
Journal	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Volume	16
DOIs	https://doi.org/10.1109/JSTARS.2023.3296449
Publication status	Published - 2023

Keywords

Convolutional neural networks
deep learning
semantic segmentation
supervised learning
urban areas

ASJC Scopus subject areas

Computers in Earth Sciences
Atmospheric Science

Access to Document

10.1109/JSTARS.2023.3296449Licence: CC BY-NC-ND 4.0

Cite this

@article{8d4f6b0983f4495f869bb40c5d11359d,

title = "Deep Learning for the Automatic Division of Building Constructions into Sections on Remote Sensing Images",

abstract = "Urban areas predominantly consist of complex building structures, which are assembled of multiple building sections. From very high resolution remote sensing imagery, not only roof-tops but also the separation lines between them are visible. Since fully convolutional neural network (FCN)-based methods have become the primary choice in segmentation approaches, they have been extensively used for automatic building footprint extraction. But each of the previous works on building segmentation either lacks separation of building blocks into sections or does not produce sections of regular appearance. We propose a two-stage approach to overcome these limitations. The first step segments building and separation lines using an FCN model and the second step produces building instances by using a learning-free method. Our model receives a top-down image and a digital surface model (DSM) patch in two separate encoders. The encoder features are summed before the skip connections, which utilize the encoder features from the current and higher-resolution feature maps. We train our model with regularization losses for building shapes and separation lines on both satellite and aerial imagery. We test our model on a city that was not previously included in the training phase to show that it has the capacity to generalize across different geographical locations and architectural styles. Furthermore, we use our building section instance predictions to generate: 1) vectorized building maps and 2) a level-of-detail-1 DSM.",

keywords = "Convolutional neural networks, deep learning, semantic segmentation, supervised learning, urban areas",

author = "Philipp Schuegraf and Stefano Zorzi and Friedrich Fraundorfer and Ksenia Bittner",

note = "Publisher Copyright: {\textcopyright} 2008-2012 IEEE.",

year = "2023",

doi = "10.1109/JSTARS.2023.3296449",

language = "English",

volume = "16",

pages = "7186--7200",

journal = "IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing",

issn = "1939-1404",

publisher = "Institute of Electrical and Electronics Engineers",

}

TY - JOUR

T1 - Deep Learning for the Automatic Division of Building Constructions into Sections on Remote Sensing Images

AU - Schuegraf, Philipp

AU - Zorzi, Stefano

AU - Fraundorfer, Friedrich

AU - Bittner, Ksenia

PY - 2023

Y1 - 2023

N2 - Urban areas predominantly consist of complex building structures, which are assembled of multiple building sections. From very high resolution remote sensing imagery, not only roof-tops but also the separation lines between them are visible. Since fully convolutional neural network (FCN)-based methods have become the primary choice in segmentation approaches, they have been extensively used for automatic building footprint extraction. But each of the previous works on building segmentation either lacks separation of building blocks into sections or does not produce sections of regular appearance. We propose a two-stage approach to overcome these limitations. The first step segments building and separation lines using an FCN model and the second step produces building instances by using a learning-free method. Our model receives a top-down image and a digital surface model (DSM) patch in two separate encoders. The encoder features are summed before the skip connections, which utilize the encoder features from the current and higher-resolution feature maps. We train our model with regularization losses for building shapes and separation lines on both satellite and aerial imagery. We test our model on a city that was not previously included in the training phase to show that it has the capacity to generalize across different geographical locations and architectural styles. Furthermore, we use our building section instance predictions to generate: 1) vectorized building maps and 2) a level-of-detail-1 DSM.

AB - Urban areas predominantly consist of complex building structures, which are assembled of multiple building sections. From very high resolution remote sensing imagery, not only roof-tops but also the separation lines between them are visible. Since fully convolutional neural network (FCN)-based methods have become the primary choice in segmentation approaches, they have been extensively used for automatic building footprint extraction. But each of the previous works on building segmentation either lacks separation of building blocks into sections or does not produce sections of regular appearance. We propose a two-stage approach to overcome these limitations. The first step segments building and separation lines using an FCN model and the second step produces building instances by using a learning-free method. Our model receives a top-down image and a digital surface model (DSM) patch in two separate encoders. The encoder features are summed before the skip connections, which utilize the encoder features from the current and higher-resolution feature maps. We train our model with regularization losses for building shapes and separation lines on both satellite and aerial imagery. We test our model on a city that was not previously included in the training phase to show that it has the capacity to generalize across different geographical locations and architectural styles. Furthermore, we use our building section instance predictions to generate: 1) vectorized building maps and 2) a level-of-detail-1 DSM.

KW - Convolutional neural networks

KW - deep learning

KW - semantic segmentation

KW - supervised learning

KW - urban areas

UR - http://www.scopus.com/inward/record.url?scp=85165302287&partnerID=8YFLogxK

U2 - 10.1109/JSTARS.2023.3296449

DO - 10.1109/JSTARS.2023.3296449

M3 - Article

AN - SCOPUS:85165302287

SN - 1939-1404

VL - 16

SP - 7186

EP - 7200

JO - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

JF - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

ER -

Deep Learning for the Automatic Division of Building Constructions into Sections on Remote Sensing Images

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this