Phase-Processing For Voice Activity Detection: A Statistical Approach

Johannes Stahl, Pejman Mowlaee Beikzadehmahaleh, Josef Kulmer

Research output: Chapter in Book/Report/Conference proceedingConference paperpeer-review

Abstract

Conventional voice activity detectors (VAD) mostly rely on the magnitude of the complex valued DFT spectral coefficients. In this paper, the circular variance of the Discrete Fourier transform (DFT) coefficients is investigated in terms of its ability to represent speech activity in noise. To this end we
model the circular variance as a random variable with different underlying distributions for the speech and the noise class. Based on this, we derive a binary hypothesis test relying only on the
circular variance estimated from the noisy speech. The experimental results show a reasonable VAD performance justifying that amplitude-independent information can characterize speech
in a convenient way.
Original languageEnglish
Title of host publicationEUSIPCO 2016
DOIs
Publication statusPublished - Aug 2016

Fingerprint

Dive into the research topics of 'Phase-Processing For Voice Activity Detection: A Statistical Approach'. Together they form a unique fingerprint.

Cite this