Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation

Shreyas Hampali, Sayan Deb Sarkar, Mahdi Rad, Vincent Lepetit

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung

Abstract

We propose a robust and accurate method for estimating the 3D poses of two hands in close interaction from a single color image. This is a very challenging problem, as large occlusions and many confusions between the joints may happen. State-of-the-art methods solve this problem by regressing a heatmap for each joint, which requires solving two problems simultaneously: localizing the joints and recognizing them. In this work, we propose to separate these tasks by relying on a CNN to first localize joints as 2D keypoints, and on self-attention between the CNN features at these keypoints to associate them with the corresponding hand joint. The resulting architecture, which we call 'Keypoint Transformer', is highly efficient as it achieves state-of-the-art performance with roughly half the number of model parameters on the InterHand2.6M dataset. We also show it can be easily extended to estimate the 3D pose of an object manipulated by one or two hands with high performance. Moreover, we created a new dataset of more than 75,000 images of two hands manipulating an object fully annotated in 3D and will make it publicly available.

Originalspracheenglisch
TitelProceedings - 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2022
Herausgeber (Verlag)IEEE Computer Society Publications
Seiten11080-11090
Seitenumfang11
ISBN (elektronisch)9781665469463
DOIs
PublikationsstatusVeröffentlicht - 2022
Veranstaltung2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition: CVPR 2022 - New Orleans, USA / Vereinigte Staaten
Dauer: 19 Juni 202224 Juni 2022

Konferenz

Konferenz2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition
KurztitelCVPR 2022
Land/GebietUSA / Vereinigte Staaten
OrtNew Orleans
Zeitraum19/06/2224/06/22

ASJC Scopus subject areas

  • Software
  • Maschinelles Sehen und Mustererkennung

Fingerprint

Untersuchen Sie die Forschungsthemen von „Keypoint Transformer: Solving Joint Identification in Challenging Hands and Object Interactions for Accurate 3D Pose Estimation“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren