Maximum a posteriori speech enhancement based on double spectrum

Pejman Mowlaee; Daniel Scheran; Johannes Stahl; Sean U.N. Wood; W. Bastiaan Kleijn

doi:10.21437/Interspeech.2019-1197

Maximum a posteriori speech enhancement based on double spectrum

Pejman Mowlaee, Daniel Scheran, Johannes Stahl, Sean U.N. Wood, W. Bastiaan Kleijn

Institut für Signalverarbeitung und Sprachkommunikation (4420)

Publikation: Beitrag in Buch/Bericht/Konferenzband › Beitrag in einem Konferenzband › Begutachtung

Abstract

While the acoustic frequency domain has been widely used for speech enhancement, usage of the modulation domain is less common. In this paper, we investigate single-channel speech enhancement in the recently proposed Double Spectrum (DS) framework and provide insights on the statistical properties of speech and noise in the DS domain. Relying on our statistical analysis in the DS, we derive a maximum a posteriori estimator of speech in the DS domain. By means of experiments, we evaluate the speech enhancement performance of the proposed method and relevant benchmarks in the acoustic frequency and modulation domains and show that the proposed method achieves a good balance between noise attenuation and speech distortion for various SNRs and noise types.

Originalsprache	englisch
Titel	Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019
Seiten	2738-2742
Seitenumfang	5
DOIs	https://doi.org/10.21437/Interspeech.2019-1197
Publikationsstatus	Veröffentlicht - 1 Jan. 2019
Veranstaltung	20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language: INTERSPEECH 2019 - Messe Congress Graz, Graz, Österreich Dauer: 15 Sept. 2019 → 19 Sept. 2019

Konferenz

Konferenz	20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language
Land/Gebiet	Österreich
Ort	Graz
Zeitraum	15/09/19 → 19/09/19

ASJC Scopus subject areas

Sprache und Linguistik
Human-computer interaction
Signalverarbeitung
Software
Modellierung und Simulation

Zugriff auf Dokument

10.21437/Interspeech.2019-1197

Andere Dateien und Links

http://www.scopus.com/inward/record.url?scp=85074717993&partnerID=8YFLogxK

Dieses zitieren

Mowlaee, P, Scheran, D, Stahl, J, Wood, SUN & Bastiaan Kleijn, W 2019, Maximum a posteriori speech enhancement based on double spectrum. in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019. S. 2738-2742, 20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language, Graz, Österreich, 15/09/19. https://doi.org/10.21437/Interspeech.2019-1197

@inproceedings{5394373eb363482fac15cb0b43c21e0c,

title = "Maximum a posteriori speech enhancement based on double spectrum",

abstract = "While the acoustic frequency domain has been widely used for speech enhancement, usage of the modulation domain is less common. In this paper, we investigate single-channel speech enhancement in the recently proposed Double Spectrum (DS) framework and provide insights on the statistical properties of speech and noise in the DS domain. Relying on our statistical analysis in the DS, we derive a maximum a posteriori estimator of speech in the DS domain. By means of experiments, we evaluate the speech enhancement performance of the proposed method and relevant benchmarks in the acoustic frequency and modulation domains and show that the proposed method achieves a good balance between noise attenuation and speech distortion for various SNRs and noise types.",

keywords = "Double Spectrum, MAP Estimator, Modulation Domain Processing, Speech Enhancement",

author = "Pejman Mowlaee and Daniel Scheran and Johannes Stahl and Wood, {Sean U.N.} and {Bastiaan Kleijn}, W.",

year = "2019",

month = jan,

day = "1",

doi = "10.21437/Interspeech.2019-1197",

language = "English",

pages = "2738--2742",

booktitle = "Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019",

note = "20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language : INTERSPEECH 2019 ; Conference date: 15-09-2019 Through 19-09-2019",

}

TY - GEN

T1 - Maximum a posteriori speech enhancement based on double spectrum

AU - Mowlaee, Pejman

AU - Scheran, Daniel

AU - Stahl, Johannes

AU - Wood, Sean U.N.

AU - Bastiaan Kleijn, W.

PY - 2019/1/1

Y1 - 2019/1/1

N2 - While the acoustic frequency domain has been widely used for speech enhancement, usage of the modulation domain is less common. In this paper, we investigate single-channel speech enhancement in the recently proposed Double Spectrum (DS) framework and provide insights on the statistical properties of speech and noise in the DS domain. Relying on our statistical analysis in the DS, we derive a maximum a posteriori estimator of speech in the DS domain. By means of experiments, we evaluate the speech enhancement performance of the proposed method and relevant benchmarks in the acoustic frequency and modulation domains and show that the proposed method achieves a good balance between noise attenuation and speech distortion for various SNRs and noise types.

AB - While the acoustic frequency domain has been widely used for speech enhancement, usage of the modulation domain is less common. In this paper, we investigate single-channel speech enhancement in the recently proposed Double Spectrum (DS) framework and provide insights on the statistical properties of speech and noise in the DS domain. Relying on our statistical analysis in the DS, we derive a maximum a posteriori estimator of speech in the DS domain. By means of experiments, we evaluate the speech enhancement performance of the proposed method and relevant benchmarks in the acoustic frequency and modulation domains and show that the proposed method achieves a good balance between noise attenuation and speech distortion for various SNRs and noise types.

KW - Double Spectrum

KW - MAP Estimator

KW - Modulation Domain Processing

KW - Speech Enhancement

UR - http://www.scopus.com/inward/record.url?scp=85074717993&partnerID=8YFLogxK

U2 - 10.21437/Interspeech.2019-1197

DO - 10.21437/Interspeech.2019-1197

M3 - Conference paper

AN - SCOPUS:85074717993

SP - 2738

EP - 2742

BT - Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH 2019

T2 - 20th Annual Conference of the International Speech Communication Association: Crossroads of Speech and Language

Y2 - 15 September 2019 through 19 September 2019

ER -

Maximum a posteriori speech enhancement based on double spectrum

Abstract

Konferenz

ASJC Scopus subject areas

Zugriff auf Dokument

Andere Dateien und Links

Fingerprint

Dieses zitieren