A two-channel minimum mean-square error log-spectral amplitude estimator for speech enhancement

Min Seok Choi, Hong Goo Kang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Citations (Scopus)

Abstract

This paper proposes a novel two-channel speech enhancement structure using the minimum mean-square error log-spectral amplitude (MMSE-LSA) estimator. The proposed two-channel enhancement algorithm utilizes a spatial relationship between two input signals to accurately estimate the noise power spectral density (PSD) needed for the MMSE-LSA algorithm. The proposed structure improves the noise reduction capacity with less speech distortion, while its complexity is much lower than simple cascade structures. The performance of the proposed algorithm is evaluated by automatic speech recognition tests in a car environment. Comparing to a simple cascading of two- and single-channel algorithms, the proposed algorithm improves the relative recognition rate by 17.5 % for high speed conditions and 14.8 % for low speed conditions, respectively.

Original languageEnglish
Title of host publication2008 Hands-free Speech Communication and Microphone Arrays, Proceedings, HSCMA 2008
Pages152-155
Number of pages4
DOIs
Publication statusPublished - 2008
Event2008 Hands-free Speech Communication and Microphone Arrays, HSCMA 2008 - Trento, Italy
Duration: 2008 May 62008 May 8

Publication series

Name2008 Hands-free Speech Communication and Microphone Arrays, Proceedings, HSCMA 2008

Other

Other2008 Hands-free Speech Communication and Microphone Arrays, HSCMA 2008
Country/TerritoryItaly
CityTrento
Period08/5/608/5/8

All Science Journal Classification (ASJC) codes

  • Hardware and Architecture
  • Electrical and Electronic Engineering
  • Communication

Fingerprint

Dive into the research topics of 'A two-channel minimum mean-square error log-spectral amplitude estimator for speech enhancement'. Together they form a unique fingerprint.

Cite this