Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders

Kyung Tae Kim, Sung Kyo Jung, Mi Suk Lee, Hong Goo Kang, Dae Hee Youn

Research output: Contribution to conferencePaperpeer-review

Abstract

In this paper we present an efficient coding method for the upper band(4-7kHz) of wideband(0.5-7kHz) speech coding based on a band-split approach. Due to the impulselike characteristics in upper band signal, it is very difficult to efficiently quantize the signal at low bit-rate when we use transform coding techniques. We propose two temporal normalization techniques, direct temporal energy normalization and frequency domain linear prediction, to reduce the extremely noticeable artifacts. Simulation results show that the proposed algorithm successfully encodes the upper band signal, and the new split-band type wideband coder adopting the proposed technology provides better quality than 56 kbit/s ITU-T G. 722 at the bitrate of 20 kbit/s.

Original languageEnglish
Pages2661-2664
Number of pages4
Publication statusPublished - 2004
Event8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, Korea, Republic of
Duration: 2004 Oct 42004 Oct 8

Other

Other8th International Conference on Spoken Language Processing, ICSLP 2004
Country/TerritoryKorea, Republic of
CityJeju, Jeju Island
Period04/10/404/10/8

Bibliographical note

Funding Information:
This work was supported by the Electronics and Telecommunications Research Institute (ETRI).

All Science Journal Classification (ASJC) codes

  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Temporal normalization techniques for transform-type speech coding and application to split-band wideband coders'. Together they form a unique fingerprint.

Cite this