Grasping Point Prediction in Cluttered Environment using Automatically Labeled Data

Stefan Ainetter; Friedrich Fraundorfer

Grasping Point Prediction in Cluttered Environment using Automatically Labeled Data

Institute of Computer Graphics and Vision (7100)

Research output: Chapter in Book/Report/Conference proceeding › Conference paper › peer-review

Abstract

We propose a method to automatically generate high quality ground truth annotations for grasping point prediction and show the usefulness of these annotations by training a deep neural network to predict grasping candidates for objects in a cluttered environment. First, we acquire sequences of RGBD images of a real world picking scenario and leverage the sequential depth information to extract labels for grasping point prediction. Afterwards, we train a deep neural network to predict grasping points, establishing a fully automatic pipeline from acquiring data to a trained network without the need of human annotators. We show in our experiments that our network trained with automatically generated labels delivers high quality results for predicting grasping candidates, on par with a trained network which uses human annotated data. This work lowers the cost/complexity of creating specific datasets for grasping and makes it easy to expand the existing dataset without additional effort.

Original language	English
Title of host publication	Proceedings of the Joint Austrian Computer Vision and Robotics Workshop 2020
Pages	124 - 130
Publication status	Published - 2020
Event	2020 Joined Austrian Computer Vision and Robotics Workshop - Technische Universität Graz, abgesagt, Austria Duration: 17 Sept 2020 → 18 Sept 2020

Conference

Conference	2020 Joined Austrian Computer Vision and Robotics Workshop
Abbreviated title	ACVRW '20
Country/Territory	Austria
City	abgesagt
Period	17/09/20 → 18/09/20

Cite this

@inproceedings{313c4eba15d741d9ba857ad33498986c,

title = "Grasping Point Prediction in Cluttered Environment using Automatically Labeled Data",

abstract = "We propose a method to automatically generate high quality ground truth annotations for grasping point prediction and show the usefulness of these annotations by training a deep neural network to predict grasping candidates for objects in a cluttered environment. First, we acquire sequences of RGBD images of a real world picking scenario and leverage the sequential depth information to extract labels for grasping point prediction. Afterwards, we train a deep neural network to predict grasping points, establishing a fully automatic pipeline from acquiring data to a trained network without the need of human annotators. We show in our experiments that our network trained with automatically generated labels delivers high quality results for predicting grasping candidates, on par with a trained network which uses human annotated data. This work lowers the cost/complexity of creating specific datasets for grasping and makes it easy to expand the existing dataset without additional effort.",

author = "Stefan Ainetter and Friedrich Fraundorfer",

year = "2020",

language = "English",

pages = "124 -- 130",

booktitle = "Proceedings of the Joint Austrian Computer Vision and Robotics Workshop 2020",

note = "2020 Joined Austrian Computer Vision and Robotics Workshop, ACVRW '20 ; Conference date: 17-09-2020 Through 18-09-2020",

}

TY - GEN

T1 - Grasping Point Prediction in Cluttered Environment using Automatically Labeled Data

AU - Ainetter, Stefan

AU - Fraundorfer, Friedrich

PY - 2020

Y1 - 2020

N2 - We propose a method to automatically generate high quality ground truth annotations for grasping point prediction and show the usefulness of these annotations by training a deep neural network to predict grasping candidates for objects in a cluttered environment. First, we acquire sequences of RGBD images of a real world picking scenario and leverage the sequential depth information to extract labels for grasping point prediction. Afterwards, we train a deep neural network to predict grasping points, establishing a fully automatic pipeline from acquiring data to a trained network without the need of human annotators. We show in our experiments that our network trained with automatically generated labels delivers high quality results for predicting grasping candidates, on par with a trained network which uses human annotated data. This work lowers the cost/complexity of creating specific datasets for grasping and makes it easy to expand the existing dataset without additional effort.

AB - We propose a method to automatically generate high quality ground truth annotations for grasping point prediction and show the usefulness of these annotations by training a deep neural network to predict grasping candidates for objects in a cluttered environment. First, we acquire sequences of RGBD images of a real world picking scenario and leverage the sequential depth information to extract labels for grasping point prediction. Afterwards, we train a deep neural network to predict grasping points, establishing a fully automatic pipeline from acquiring data to a trained network without the need of human annotators. We show in our experiments that our network trained with automatically generated labels delivers high quality results for predicting grasping candidates, on par with a trained network which uses human annotated data. This work lowers the cost/complexity of creating specific datasets for grasping and makes it easy to expand the existing dataset without additional effort.

M3 - Conference paper

SP - 124

EP - 130

BT - Proceedings of the Joint Austrian Computer Vision and Robotics Workshop 2020

T2 - 2020 Joined Austrian Computer Vision and Robotics Workshop

Y2 - 17 September 2020 through 18 September 2020

ER -