Abstract
Augmented Reality-based assistance systems can help qualified technicians by providing them with technical details. However, the applicability is limited by the low availability of real data. In this paper, we focus on synthetic renderings of CAD data. Our objective is to investigate different model architectures within the machine-learning component and compare their performance. The training data consists of CAD renderings from different viewpoints distributed over a sphere around the model. Utilizing the advantages of transfer learning and pre-trained backbones we trained different versions of EfficientNet and EfficientNetV2 on these images for every assembly step in two resolutions. The classification performance was evaluated on a smaller test set of synthetic renderings and a dataset of real-world images of the model. The best Top1-accuracy on the real-world dataset is achieved by the medium-sized EfficientNetV2 with 57.74%, while the best Top5-accuracy is provided by EfficientNetV2 Small. Consequently, our approach has a good classification performance indicating the real-world applicability of such a deep learning classifier in the near future.
Originalsprache | englisch |
---|---|
Titel | Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications |
Herausgeber (Verlag) | SciTePress |
Seiten | 500-507 |
Seitenumfang | 8 |
Band | 4, VISAPP |
ISBN (elektronisch) | 978-989-758-679-8 |
DOIs | |
Publikationsstatus | Veröffentlicht - 2024 |
Veranstaltung | 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications: VISIGRAPP 2024 - Rome, Italien Dauer: 27 Feb. 2024 → 29 Feb. 2024 |
Konferenz
Konferenz | 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications |
---|---|
Kurztitel | VISIGRAPP 2024 |
Land/Gebiet | Italien |
Ort | Rome |
Zeitraum | 27/02/24 → 29/02/24 |
ASJC Scopus subject areas
- Computergrafik und computergestütztes Design
- Maschinelles Sehen und Mustererkennung
- Human-computer interaction