Deep Learning-Powered Assembly Step Classification for Intricate Machines

Luca Rodiga; Eva Eggeling; Ulrich Krispel; Torsten Ullrich

doi:10.5220/0012376300003660

Deep Learning-Powered Assembly Step Classification for Intricate Machines

Luca Rodiga, Eva Eggeling, Ulrich Krispel, Torsten Ullrich

Publikation: Beitrag in Buch/Bericht/Konferenzband › Beitrag in einem Konferenzband › Begutachtung

Abstract

Augmented Reality-based assistance systems can help qualified technicians by providing them with technical details. However, the applicability is limited by the low availability of real data. In this paper, we focus on synthetic renderings of CAD data. Our objective is to investigate different model architectures within the machine-learning component and compare their performance. The training data consists of CAD renderings from different viewpoints distributed over a sphere around the model. Utilizing the advantages of transfer learning and pre-trained backbones we trained different versions of EfficientNet and EfficientNetV2 on these images for every assembly step in two resolutions. The classification performance was evaluated on a smaller test set of synthetic renderings and a dataset of real-world images of the model. The best Top1-accuracy on the real-world dataset is achieved by the medium-sized EfficientNetV2 with 57.74%, while the best Top5-accuracy is provided by EfficientNetV2 Small. Consequently, our approach has a good classification performance indicating the real-world applicability of such a deep learning classifier in the near future.

Originalsprache	englisch
Titel	Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
Herausgeber (Verlag)	SciTePress
Seiten	500-507
Seitenumfang	8
Band	4, VISAPP
ISBN (elektronisch)	978-989-758-679-8
DOIs	https://doi.org/10.5220/0012376300003660
Publikationsstatus	Veröffentlicht - 2024
Veranstaltung	19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications: VISIGRAPP 2024 - Rome, Italien Dauer: 27 Feb. 2024 → 29 Feb. 2024

Konferenz

Konferenz	19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications
Kurztitel	VISIGRAPP 2024
Land/Gebiet	Italien
Ort	Rome
Zeitraum	27/02/24 → 29/02/24

ASJC Scopus subject areas

Computergrafik und computergestütztes Design
Maschinelles Sehen und Mustererkennung
Human-computer interaction

Zugriff auf Dokument

10.5220/0012376300003660Lizenz: CC BY-NC-ND 4.0

Andere Dateien und Links

Verknüpfung zur Publikation in Scopus

Dieses zitieren

Deep Learning-Powered Assembly Step Classification for Intricate Machines. / Rodiga, Luca; Eggeling, Eva; Krispel, Ulrich et al.
Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications . Band 4, VISAPP SciTePress, 2024. S. 500-507.

Publikation: Beitrag in Buch/Bericht/Konferenzband › Beitrag in einem Konferenzband › Begutachtung

Rodiga, L, Eggeling, E, Krispel, U & Ullrich, T 2024, Deep Learning-Powered Assembly Step Classification for Intricate Machines. in Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications . Bd. 4, VISAPP, SciTePress, S. 500-507, 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, Rome, Italien, 27/02/24. https://doi.org/10.5220/0012376300003660

@inproceedings{28b2321ba50b4461a8135c9cb36a0095,

title = "Deep Learning-Powered Assembly Step Classification for Intricate Machines",

abstract = "Augmented Reality-based assistance systems can help qualified technicians by providing them with technical details. However, the applicability is limited by the low availability of real data. In this paper, we focus on synthetic renderings of CAD data. Our objective is to investigate different model architectures within the machine-learning component and compare their performance. The training data consists of CAD renderings from different viewpoints distributed over a sphere around the model. Utilizing the advantages of transfer learning and pre-trained backbones we trained different versions of EfficientNet and EfficientNetV2 on these images for every assembly step in two resolutions. The classification performance was evaluated on a smaller test set of synthetic renderings and a dataset of real-world images of the model. The best Top1-accuracy on the real-world dataset is achieved by the medium-sized EfficientNetV2 with 57.74%, while the best Top5-accuracy is provided by EfficientNetV2 Small. Consequently, our approach has a good classification performance indicating the real-world applicability of such a deep learning classifier in the near future.",

keywords = "Computer Vision, Deep Learning, Machine Learning",

author = "Luca Rodiga and Eva Eggeling and Ulrich Krispel and Torsten Ullrich",

note = "Publisher Copyright: {\textcopyright} 2024 by SCITEPRESS – Science and Technology Publications, Lda.; 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications : VISIGRAPP 2024, VISIGRAPP 2024 ; Conference date: 27-02-2024 Through 29-02-2024",

year = "2024",

doi = "10.5220/0012376300003660",

language = "English",

volume = "4, VISAPP",

pages = "500--507",

booktitle = "Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications",

publisher = "SciTePress",

address = "Portugal",

}

TY - GEN

T1 - Deep Learning-Powered Assembly Step Classification for Intricate Machines

AU - Rodiga, Luca

AU - Eggeling, Eva

AU - Krispel, Ulrich

AU - Ullrich, Torsten

PY - 2024

Y1 - 2024

N2 - Augmented Reality-based assistance systems can help qualified technicians by providing them with technical details. However, the applicability is limited by the low availability of real data. In this paper, we focus on synthetic renderings of CAD data. Our objective is to investigate different model architectures within the machine-learning component and compare their performance. The training data consists of CAD renderings from different viewpoints distributed over a sphere around the model. Utilizing the advantages of transfer learning and pre-trained backbones we trained different versions of EfficientNet and EfficientNetV2 on these images for every assembly step in two resolutions. The classification performance was evaluated on a smaller test set of synthetic renderings and a dataset of real-world images of the model. The best Top1-accuracy on the real-world dataset is achieved by the medium-sized EfficientNetV2 with 57.74%, while the best Top5-accuracy is provided by EfficientNetV2 Small. Consequently, our approach has a good classification performance indicating the real-world applicability of such a deep learning classifier in the near future.

AB - Augmented Reality-based assistance systems can help qualified technicians by providing them with technical details. However, the applicability is limited by the low availability of real data. In this paper, we focus on synthetic renderings of CAD data. Our objective is to investigate different model architectures within the machine-learning component and compare their performance. The training data consists of CAD renderings from different viewpoints distributed over a sphere around the model. Utilizing the advantages of transfer learning and pre-trained backbones we trained different versions of EfficientNet and EfficientNetV2 on these images for every assembly step in two resolutions. The classification performance was evaluated on a smaller test set of synthetic renderings and a dataset of real-world images of the model. The best Top1-accuracy on the real-world dataset is achieved by the medium-sized EfficientNetV2 with 57.74%, while the best Top5-accuracy is provided by EfficientNetV2 Small. Consequently, our approach has a good classification performance indicating the real-world applicability of such a deep learning classifier in the near future.

KW - Computer Vision

KW - Deep Learning

KW - Machine Learning

UR - http://www.scopus.com/inward/record.url?scp=85190714573&partnerID=8YFLogxK

U2 - 10.5220/0012376300003660

DO - 10.5220/0012376300003660

M3 - Conference paper

AN - SCOPUS:85190714573

VL - 4, VISAPP

SP - 500

EP - 507

BT - Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications

PB - SciTePress

T2 - 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications

Y2 - 27 February 2024 through 29 February 2024

ER -

Deep Learning-Powered Assembly Step Classification for Intricate Machines

Abstract

Konferenz

ASJC Scopus subject areas

Zugriff auf Dokument

Andere Dateien und Links

Fingerprint

Dieses zitieren