Abstract
In this work we introduce S-TREK, a novel local feature extractor that combines a deep keypoint detector, which is both translation and rotation equivariant by design, with a lightweight deep descriptor extractor. We train the S-TREK keypoint detector within a framework inspired by reinforcement learning, where we leverage a sequential procedure to maximize a reward directly related to keypoint repeatability. Our descriptor network is trained following a "detect, then describe" approach, where the descriptor loss is evaluated only at those locations where keypoints have been selected by the already trained detector. Extensive experiments on multiple benchmarks confirm the effectiveness of our proposed method, with S-TREK often outperforming other state-of-the-art methods in terms of repeatability and quality of the recovered poses, especially when dealing with in-plane rotations.
Original language | English |
---|---|
Title of host publication | 2023 IEEE/CVF International Conference on Computer Vision (ICCV) |
Publisher | IEEEXplore |
Pages | 9694-9703 |
DOIs | |
Publication status | Published - 2023 |
Event | 2023 IEEE/CVF International Conference on Computer Vision: ICCV 2023 - Paris, France Duration: 1 Oct 2023 → 6 Oct 2023 |
Conference
Conference | 2023 IEEE/CVF International Conference on Computer Vision |
---|---|
Abbreviated title | ICCV 2023 |
Country/Territory | France |
City | Paris |
Period | 1/10/23 → 6/10/23 |
Keywords
- cs.CV
Fields of Expertise
- Information, Communication & Computing