Re:PolyWorld - A Graph Neural Network for Polygonal Scene Parsing

Stefano Zorzi, Friedrich Fraundorfer

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung

Abstract

While most state-of-the-art instance segmentation methods produce pixel-wise segmentation masks, numerous applications demand precise vector polygons of detected objects instead of rasterized output. This paper proposes Re:PolyWorld as a remastered and improved version of PolyWorld, a neural network that extracts object vertices from an image and connects them optimally to generate precise polygons. The objective of this work was to overcome weaknesses and shortcomings of the original model, as well as introducing an improved polygonal representation to obtain a general-purpose method for polygon extraction in images. The architecture has been redesigned to not only exploit vertex features, but to also make use of the visual appearance of edges. To this end, an edge-aware Graph Neural Network predicts the connection strength between each pair of vertices, which is further used to compute the assignment by solving a differentiable optimal transport problem. The proposed redefinition of the polygonal scene turns the method into a powerful generalized approach that can be applied to a large variety of tasks and problem settings, such as building extraction, floorplan reconstruction and even wireframe parsing. Re:PolyWorld not only outperforms the original model on building extraction in aerial images, thanks to the proposed joint analysis of vertices and edges, but also beats the state-of-the-art in multiple other domains.
Originalspracheenglisch
Titel2023 IEEE/CVF International Conference on Computer Vision (ICCV)
Herausgeber (Verlag)ACM/IEEE
Seiten16716-16725
Seitenumfang10
ISBN (Print)979-8-3503-0719-1
DOIs
PublikationsstatusVeröffentlicht - 6 Okt. 2023
Veranstaltung2023 IEEE/CVF International Conference on Computer Vision: ICCV 2023 - Paris, Frankreich
Dauer: 1 Okt. 20236 Okt. 2023

Konferenz

Konferenz2023 IEEE/CVF International Conference on Computer Vision
KurztitelICCV 2023
Land/GebietFrankreich
OrtParis
Zeitraum1/10/236/10/23

Schlagwörter

  • Instance segmentation
  • Visualization
  • Image edge detection
  • Buildings
  • Computer architecture
  • Feature extraction
  • Graph neural networks

Fields of Expertise

  • Information, Communication & Computing

Dieses zitieren