A German distant speech recognizer based on 3D beamforming and harmonic missing data mask.

Publikation: Beitrag in Buch/Bericht/KonferenzbandBeitrag in einem KonferenzbandBegutachtung

Abstract

This paper addresses the problem of distant speech recognition in reverberant noise conditions applying a star-shaped microphone array and missing data techniques. The performance of
the system is evaluated over a German database, which has been contaminated with noise of an apartment of the DIRHA (Distant Speech Interaction for Robust Home Applications) project.
The proposed system is composed of three blocks. First, a beamformer yields an enhanced single-channel signal by filtering multi-channel signals and summing up all signals after-
wards. To optimize the filter weights, we apply convex (CVX) optimization over three spatial dimensions given the spatiotemporal position of the target speaker as prior knowledge. Sec-
ond, the beamformer output is exploited to extract pitch and estimate the stationary part of the background noise. Third, the system produces a final noise estimate by combining both, the
stationary noise part as well as the harmonic noise estimate obtained from the pitch. Finally, the filter-bank representation of the enhanced signal and its corresponding missing data mask
obtained from this final noise estimate are sent to the speech recognition back-end. The purpose of this paper is to analyze the impact of employing a beamformer followed by a missing data technique.
Originalspracheenglisch
TitelAIA-DAGA 2013 : proceedings of the International Conference on Acoustics
Seiten2049-2052
ISBN (elektronisch) 9783939296058
PublikationsstatusVeröffentlicht - 2013
Veranstaltung39. Jahrestagung für Akustik: AIA-DAGA 2013 - Meran, Italien
Dauer: 18 März 201321 März 2013

Konferenz

Konferenz39. Jahrestagung für Akustik
KurztitelDAGA 2013
Land/GebietItalien
OrtMeran
Zeitraum18/03/1321/03/13

Fields of Expertise

  • Information, Communication & Computing

Treatment code (Nähere Zuordnung)

  • Basic - Fundamental (Grundlagenforschung)
  • Application

Fingerprint

Untersuchen Sie die Forschungsthemen von „A German distant speech recognizer based on 3D beamforming and harmonic missing data mask.“. Zusammen bilden sie einen einzigartigen Fingerprint.

Dieses zitieren