Version Control for Speech Corpora

Vlad-Andrei Dumitru*, Matthias Boehm, Martin Hagmüller, Barbara Schuppler

*Korrespondierende/r Autor/-in für diese Arbeit

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung

Abstract

While the audio recordings of a corpus represent the ground truth, transcriptions are – in the case of manual annotations – subject to human error, and subject to changes related to technology improvements underpinning automated annotation methods. In order to facilitate the dynamic extension of speech corpora, we introduce Speechcake, a tool for centralized version control for speech corpora, enabling the automatic check-in and merging of annotations. It considers typical workflows of phoneticians, linguists and speech technologists, and enables the development of dynamic, collaborative, and perpetually-improving speech corpora.
Originalspracheenglisch
TitelProceedings of the 20th Conference on Natural Language Processing (KONVENS 2024)
Herausgeber (Verlag)Association for Computational Linguistics (ACL)
Seiten303-308
PublikationsstatusVeröffentlicht - 2024
Veranstaltung20th Conference on Natural Language Processing, KONVENS 2024 - Vienna, Österreich
Dauer: 10 Sept. 202413 Sept. 2024

Konferenz

Konferenz20th Conference on Natural Language Processing, KONVENS 2024
Land/GebietÖsterreich
OrtVienna
Zeitraum10/09/2413/09/24

Fingerprint

Untersuchen Sie die Forschungsthemen von „Version Control for Speech Corpora“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren