Unsupervised single-shot depth estimation using perceptual reconstruction

Christoph Angermann; Matthias Schwab; Markus Haltmeier; Christian Laubichler; Steinbjörn Jónsson

doi:10.1007/s00138-023-01410-5

Unsupervised single-shot depth estimation using perceptual reconstruction

Christoph Angermann^*, Matthias Schwab, Markus Haltmeier, Christian Laubichler, Steinbjörn Jónsson

^*Korrespondierende/r Autor/-in für diese Arbeit

LEC GmbH (98780)

Publikation: Beitrag in einer Fachzeitschrift › Artikel › Begutachtung

Abstract

Real-time estimation of actual object depth is an essential module for various autonomous system tasks such as 3D reconstruction, scene understanding and condition assessment. During the last decade of machine learning, extensive deployment of deep learning methods to computer vision tasks has yielded approaches that succeed in achieving realistic depth synthesis out of a simple RGB modality. Most of these models are based on paired RGB-depth data and/or the availability of video sequences and stereo images. However, the lack of RGB-depth pairs, video sequences, or stereo images makes depth estimation a challenging task that needs to be explored in more detail. This study builds on recent advances in the field of generative neural networks in order to establish fully unsupervised single-shot depth estimation. Two generators for RGB-to-depth and depth-to-RGB transfer are implemented and simultaneously optimized using the Wasserstein-1 distance, a novel perceptual reconstruction term, and hand-crafted image filters. We comprehensively evaluate the models using a custom-generated industrial surface depth data set as well as the Texas 3D Face Recognition Database, the CelebAMask-HQ database of human portraits and the SURREAL dataset that records body depth. For each evaluation dataset, the proposed method shows a significant increase in depth accuracy compared to state-of-the-art single-image transfer methods.

Originalsprache	englisch
Aufsatznummer	82
Seitenumfang	16
Fachzeitschrift	Machine Vision and Applications
Jahrgang	34
Ausgabenummer	5
DOIs	https://doi.org/10.1007/s00138-023-01410-5
Publikationsstatus	Veröffentlicht - 11 Aug. 2023

ASJC Scopus subject areas

Software
Hardware und Architektur
Maschinelles Sehen und Mustererkennung
Angewandte Informatik

Zugriff auf Dokument

10.1007/s00138-023-01410-5Lizenz: CC BY 4.0

Andere Dateien und Links

Verknüpfung zur Publikation in Scopus

Dieses zitieren

@article{46a43f37a8e64aefb4b972389996aa75,

title = "Unsupervised single-shot depth estimation using perceptual reconstruction",

abstract = "Real-time estimation of actual object depth is an essential module for various autonomous system tasks such as 3D reconstruction, scene understanding and condition assessment. During the last decade of machine learning, extensive deployment of deep learning methods to computer vision tasks has yielded approaches that succeed in achieving realistic depth synthesis out of a simple RGB modality. Most of these models are based on paired RGB-depth data and/or the availability of video sequences and stereo images. However, the lack of RGB-depth pairs, video sequences, or stereo images makes depth estimation a challenging task that needs to be explored in more detail. This study builds on recent advances in the field of generative neural networks in order to establish fully unsupervised single-shot depth estimation. Two generators for RGB-to-depth and depth-to-RGB transfer are implemented and simultaneously optimized using the Wasserstein-1 distance, a novel perceptual reconstruction term, and hand-crafted image filters. We comprehensively evaluate the models using a custom-generated industrial surface depth data set as well as the Texas 3D Face Recognition Database, the CelebAMask-HQ database of human portraits and the SURREAL dataset that records body depth. For each evaluation dataset, the proposed method shows a significant increase in depth accuracy compared to state-of-the-art single-image transfer methods.",

keywords = "Perceptual similarity, Surface depth, Unsupervised learning, Wasserstein GAN",

author = "Christoph Angermann and Matthias Schwab and Markus Haltmeier and Christian Laubichler and Steinbj{\"o}rn J{\'o}nsson",

note = "Publisher Copyright: {\textcopyright} 2023, The Author(s).",

year = "2023",

month = aug,

day = "11",

doi = "10.1007/s00138-023-01410-5",

language = "English",

volume = "34",

journal = "Machine Vision and Applications",

issn = "0932-8092",

publisher = "Springer Verlag",

number = "5",

}

TY - JOUR

T1 - Unsupervised single-shot depth estimation using perceptual reconstruction

AU - Angermann, Christoph

AU - Schwab, Matthias

AU - Haltmeier, Markus

AU - Laubichler, Christian

AU - Jónsson, Steinbjörn

PY - 2023/8/11

Y1 - 2023/8/11

N2 - Real-time estimation of actual object depth is an essential module for various autonomous system tasks such as 3D reconstruction, scene understanding and condition assessment. During the last decade of machine learning, extensive deployment of deep learning methods to computer vision tasks has yielded approaches that succeed in achieving realistic depth synthesis out of a simple RGB modality. Most of these models are based on paired RGB-depth data and/or the availability of video sequences and stereo images. However, the lack of RGB-depth pairs, video sequences, or stereo images makes depth estimation a challenging task that needs to be explored in more detail. This study builds on recent advances in the field of generative neural networks in order to establish fully unsupervised single-shot depth estimation. Two generators for RGB-to-depth and depth-to-RGB transfer are implemented and simultaneously optimized using the Wasserstein-1 distance, a novel perceptual reconstruction term, and hand-crafted image filters. We comprehensively evaluate the models using a custom-generated industrial surface depth data set as well as the Texas 3D Face Recognition Database, the CelebAMask-HQ database of human portraits and the SURREAL dataset that records body depth. For each evaluation dataset, the proposed method shows a significant increase in depth accuracy compared to state-of-the-art single-image transfer methods.

AB - Real-time estimation of actual object depth is an essential module for various autonomous system tasks such as 3D reconstruction, scene understanding and condition assessment. During the last decade of machine learning, extensive deployment of deep learning methods to computer vision tasks has yielded approaches that succeed in achieving realistic depth synthesis out of a simple RGB modality. Most of these models are based on paired RGB-depth data and/or the availability of video sequences and stereo images. However, the lack of RGB-depth pairs, video sequences, or stereo images makes depth estimation a challenging task that needs to be explored in more detail. This study builds on recent advances in the field of generative neural networks in order to establish fully unsupervised single-shot depth estimation. Two generators for RGB-to-depth and depth-to-RGB transfer are implemented and simultaneously optimized using the Wasserstein-1 distance, a novel perceptual reconstruction term, and hand-crafted image filters. We comprehensively evaluate the models using a custom-generated industrial surface depth data set as well as the Texas 3D Face Recognition Database, the CelebAMask-HQ database of human portraits and the SURREAL dataset that records body depth. For each evaluation dataset, the proposed method shows a significant increase in depth accuracy compared to state-of-the-art single-image transfer methods.

KW - Perceptual similarity

KW - Surface depth

KW - Unsupervised learning

KW - Wasserstein GAN

UR - http://www.scopus.com/inward/record.url?scp=85168311333&partnerID=8YFLogxK

U2 - 10.1007/s00138-023-01410-5

DO - 10.1007/s00138-023-01410-5

M3 - Article

AN - SCOPUS:85168311333

SN - 0932-8092

VL - 34

JO - Machine Vision and Applications

JF - Machine Vision and Applications

IS - 5

M1 - 82

ER -

Unsupervised single-shot depth estimation using perceptual reconstruction

Abstract

ASJC Scopus subject areas

Zugriff auf Dokument

Andere Dateien und Links

Fingerprint

Dieses zitieren