Abstract
Urban areas predominantly consist of complex building structures, which are assembled of multiple building sections. From very high resolution remote sensing imagery, not only roof-tops but also the separation lines between them are visible. Since fully convolutional neural network (FCN)-based methods have become the primary choice in segmentation approaches, they have been extensively used for automatic building footprint extraction. But each of the previous works on building segmentation either lacks separation of building blocks into sections or does not produce sections of regular appearance. We propose a two-stage approach to overcome these limitations. The first step segments building and separation lines using an FCN model and the second step produces building instances by using a learning-free method. Our model receives a top-down image and a digital surface model (DSM) patch in two separate encoders. The encoder features are summed before the skip connections, which utilize the encoder features from the current and higher-resolution feature maps. We train our model with regularization losses for building shapes and separation lines on both satellite and aerial imagery. We test our model on a city that was not previously included in the training phase to show that it has the capacity to generalize across different geographical locations and architectural styles. Furthermore, we use our building section instance predictions to generate: 1) vectorized building maps and 2) a level-of-detail-1 DSM.
Original language | English |
---|---|
Pages (from-to) | 7186-7200 |
Number of pages | 15 |
Journal | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
Volume | 16 |
DOIs | |
Publication status | Published - 2023 |
Keywords
- Convolutional neural networks
- deep learning
- semantic segmentation
- supervised learning
- urban areas
ASJC Scopus subject areas
- Computers in Earth Sciences
- Atmospheric Science