Deep Learning for the Automatic Division of Building Constructions into Sections on Remote Sensing Images

Philipp Schuegraf; Stefano Zorzi; Friedrich Fraundorfer; Ksenia Bittner

doi:10.1109/JSTARS.2023.3296449

Deep Learning for the Automatic Division of Building Constructions into Sections on Remote Sensing Images

Philipp Schuegraf^*, Stefano Zorzi, Friedrich Fraundorfer, Ksenia Bittner

^*Korrespondierende/r Autor/-in für diese Arbeit

Institut für Maschinelles Sehen und Darstellen (7100)

Publikation: Beitrag in einer Fachzeitschrift › Artikel › Begutachtung

Abstract

Urban areas predominantly consist of complex building structures, which are assembled of multiple building sections. From very high resolution remote sensing imagery, not only roof-tops but also the separation lines between them are visible. Since fully convolutional neural network (FCN)-based methods have become the primary choice in segmentation approaches, they have been extensively used for automatic building footprint extraction. But each of the previous works on building segmentation either lacks separation of building blocks into sections or does not produce sections of regular appearance. We propose a two-stage approach to overcome these limitations. The first step segments building and separation lines using an FCN model and the second step produces building instances by using a learning-free method. Our model receives a top-down image and a digital surface model (DSM) patch in two separate encoders. The encoder features are summed before the skip connections, which utilize the encoder features from the current and higher-resolution feature maps. We train our model with regularization losses for building shapes and separation lines on both satellite and aerial imagery. We test our model on a city that was not previously included in the training phase to show that it has the capacity to generalize across different geographical locations and architectural styles. Furthermore, we use our building section instance predictions to generate: 1) vectorized building maps and 2) a level-of-detail-1 DSM.

Originalsprache	englisch
Seiten (von - bis)	7186-7200
Seitenumfang	15
Fachzeitschrift	IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Jahrgang	16
DOIs	https://doi.org/10.1109/JSTARS.2023.3296449
Publikationsstatus	Veröffentlicht - 2023

ASJC Scopus subject areas

Computer in den Geowissenschaften
Atmosphärenwissenschaften

Zugriff auf Dokument

10.1109/JSTARS.2023.3296449Lizenz: CC BY-NC-ND 4.0

Andere Dateien und Links

Verknüpfung zur Publikation in Scopus

Dieses zitieren

@article{8d4f6b0983f4495f869bb40c5d11359d,

title = "Deep Learning for the Automatic Division of Building Constructions into Sections on Remote Sensing Images",

abstract = "Urban areas predominantly consist of complex building structures, which are assembled of multiple building sections. From very high resolution remote sensing imagery, not only roof-tops but also the separation lines between them are visible. Since fully convolutional neural network (FCN)-based methods have become the primary choice in segmentation approaches, they have been extensively used for automatic building footprint extraction. But each of the previous works on building segmentation either lacks separation of building blocks into sections or does not produce sections of regular appearance. We propose a two-stage approach to overcome these limitations. The first step segments building and separation lines using an FCN model and the second step produces building instances by using a learning-free method. Our model receives a top-down image and a digital surface model (DSM) patch in two separate encoders. The encoder features are summed before the skip connections, which utilize the encoder features from the current and higher-resolution feature maps. We train our model with regularization losses for building shapes and separation lines on both satellite and aerial imagery. We test our model on a city that was not previously included in the training phase to show that it has the capacity to generalize across different geographical locations and architectural styles. Furthermore, we use our building section instance predictions to generate: 1) vectorized building maps and 2) a level-of-detail-1 DSM.",

keywords = "Convolutional neural networks, deep learning, semantic segmentation, supervised learning, urban areas",

author = "Philipp Schuegraf and Stefano Zorzi and Friedrich Fraundorfer and Ksenia Bittner",

note = "Publisher Copyright: {\textcopyright} 2008-2012 IEEE.",

year = "2023",

doi = "10.1109/JSTARS.2023.3296449",

language = "English",

volume = "16",

pages = "7186--7200",

journal = "IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing",

issn = "1939-1404",

publisher = "Institute of Electrical and Electronics Engineers",

}

TY - JOUR

T1 - Deep Learning for the Automatic Division of Building Constructions into Sections on Remote Sensing Images

AU - Schuegraf, Philipp

AU - Zorzi, Stefano

AU - Fraundorfer, Friedrich

AU - Bittner, Ksenia

PY - 2023

Y1 - 2023

N2 - Urban areas predominantly consist of complex building structures, which are assembled of multiple building sections. From very high resolution remote sensing imagery, not only roof-tops but also the separation lines between them are visible. Since fully convolutional neural network (FCN)-based methods have become the primary choice in segmentation approaches, they have been extensively used for automatic building footprint extraction. But each of the previous works on building segmentation either lacks separation of building blocks into sections or does not produce sections of regular appearance. We propose a two-stage approach to overcome these limitations. The first step segments building and separation lines using an FCN model and the second step produces building instances by using a learning-free method. Our model receives a top-down image and a digital surface model (DSM) patch in two separate encoders. The encoder features are summed before the skip connections, which utilize the encoder features from the current and higher-resolution feature maps. We train our model with regularization losses for building shapes and separation lines on both satellite and aerial imagery. We test our model on a city that was not previously included in the training phase to show that it has the capacity to generalize across different geographical locations and architectural styles. Furthermore, we use our building section instance predictions to generate: 1) vectorized building maps and 2) a level-of-detail-1 DSM.

AB - Urban areas predominantly consist of complex building structures, which are assembled of multiple building sections. From very high resolution remote sensing imagery, not only roof-tops but also the separation lines between them are visible. Since fully convolutional neural network (FCN)-based methods have become the primary choice in segmentation approaches, they have been extensively used for automatic building footprint extraction. But each of the previous works on building segmentation either lacks separation of building blocks into sections or does not produce sections of regular appearance. We propose a two-stage approach to overcome these limitations. The first step segments building and separation lines using an FCN model and the second step produces building instances by using a learning-free method. Our model receives a top-down image and a digital surface model (DSM) patch in two separate encoders. The encoder features are summed before the skip connections, which utilize the encoder features from the current and higher-resolution feature maps. We train our model with regularization losses for building shapes and separation lines on both satellite and aerial imagery. We test our model on a city that was not previously included in the training phase to show that it has the capacity to generalize across different geographical locations and architectural styles. Furthermore, we use our building section instance predictions to generate: 1) vectorized building maps and 2) a level-of-detail-1 DSM.

KW - Convolutional neural networks

KW - deep learning

KW - semantic segmentation

KW - supervised learning

KW - urban areas

UR - http://www.scopus.com/inward/record.url?scp=85165302287&partnerID=8YFLogxK

U2 - 10.1109/JSTARS.2023.3296449

DO - 10.1109/JSTARS.2023.3296449

M3 - Article

AN - SCOPUS:85165302287

SN - 1939-1404

VL - 16

SP - 7186

EP - 7200

JO - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

JF - IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

ER -

Deep Learning for the Automatic Division of Building Constructions into Sections on Remote Sensing Images

Abstract

ASJC Scopus subject areas

Zugriff auf Dokument

Andere Dateien und Links

Fingerprint

Dieses zitieren