Accurate Camera Registration in Urban Environments Using High-Level Feature Matching

Anil Armagan; Martin Hirzer; Peter M. Roth; Vincent Lepetit

Accurate Camera Registration in Urban Environments Using High-Level Feature Matching

Anil Armagan, Martin Hirzer, Peter M. Roth, Vincent Lepetit

Institute of Computer Graphics and Vision (7100)

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

Abstract

We propose a method for accurate camera pose estimation in urban environments from single images and 2D maps made of the surrounding buildings’ outlines. Our approach bridges the gap between learning-based approaches and geometric approaches: We use recent semantic segmentation techniques for extracting the buildings’ edges and the façades’ normals in the images and minimal solvers [14] to compute the camera pose accurately and robustly. We propose two such minimal solvers: one based on three correspondences of buildings’ corners from the image and the 2D map and another one based on two corner correspondences plus one façade correspondence. We show on a challenging dataset that, compared to recent state-of-the-art [1], this approach is both, faster and more accurate.

Original language	English
Title of host publication	Proceedings of the British Machine Vision Conference (BMVC)
Publication status	Published - 2017
Event	28th British Machine Vision Conference: BMVC 2017 - London, United Kingdom Duration: 4 Sept 2017 → 7 Apr 2018

Conference

Conference	28th British Machine Vision Conference
Abbreviated title	BMVC 2017
Country/Territory	United Kingdom
City	London
Period	4/09/17 → 7/04/18

Cite this

@inproceedings{0a5d1c54f48e4877ac7a296b90eae54c,

title = "Accurate Camera Registration in Urban Environments Using High-Level Feature Matching",

abstract = "We propose a method for accurate camera pose estimation in urban environments from single images and 2D maps made of the surrounding buildings{\textquoteright} outlines. Our approach bridges the gap between learning-based approaches and geometric approaches: We use recent semantic segmentation techniques for extracting the buildings{\textquoteright} edges and the fa{\c c}ades{\textquoteright} normals in the images and minimal solvers [14] to compute the camera pose accurately and robustly. We propose two such minimal solvers: one based on three correspondences of buildings{\textquoteright} corners from the image and the 2D map and another one based on two corner correspondences plus one fa{\c c}ade correspondence. We show on a challenging dataset that, compared to recent state-of-the-art [1], this approach is both, faster and more accurate.",

author = "Anil Armagan and Martin Hirzer and Roth, {Peter M.} and Vincent Lepetit",

year = "2017",

language = "English",

booktitle = "Proceedings of the British Machine Vision Conference (BMVC)",

note = "28th British Machine Vision Conference : BMVC 2017, BMVC 2017 ; Conference date: 04-09-2017 Through 07-04-2018",

}

TY - GEN

T1 - Accurate Camera Registration in Urban Environments Using High-Level Feature Matching

AU - Armagan, Anil

AU - Hirzer, Martin

AU - Roth, Peter M.

AU - Lepetit, Vincent

PY - 2017

Y1 - 2017

N2 - We propose a method for accurate camera pose estimation in urban environments from single images and 2D maps made of the surrounding buildings’ outlines. Our approach bridges the gap between learning-based approaches and geometric approaches: We use recent semantic segmentation techniques for extracting the buildings’ edges and the façades’ normals in the images and minimal solvers [14] to compute the camera pose accurately and robustly. We propose two such minimal solvers: one based on three correspondences of buildings’ corners from the image and the 2D map and another one based on two corner correspondences plus one façade correspondence. We show on a challenging dataset that, compared to recent state-of-the-art [1], this approach is both, faster and more accurate.

AB - We propose a method for accurate camera pose estimation in urban environments from single images and 2D maps made of the surrounding buildings’ outlines. Our approach bridges the gap between learning-based approaches and geometric approaches: We use recent semantic segmentation techniques for extracting the buildings’ edges and the façades’ normals in the images and minimal solvers [14] to compute the camera pose accurately and robustly. We propose two such minimal solvers: one based on three correspondences of buildings’ corners from the image and the 2D map and another one based on two corner correspondences plus one façade correspondence. We show on a challenging dataset that, compared to recent state-of-the-art [1], this approach is both, faster and more accurate.

M3 - Conference paper

BT - Proceedings of the British Machine Vision Conference (BMVC)

T2 - 28th British Machine Vision Conference

Y2 - 4 September 2017 through 7 April 2018

ER -

Accurate Camera Registration in Urban Environments Using High-Level Feature Matching

Abstract

Conference

Fingerprint

Cite this