Abstract
A novel scheme to analyze the effects of time variability of vocal tract for speaker recognition is proposed. We adopt a pitch synchronous feature extraction method to describe even more detailed characteristics of vocal tract, and decompose it into rapidly varying and slowly varying components with a specified linear filter along with time axis. Speaker identification tasks are performed with weighted combination of two decomposed feature sets and their corresponding models to show the efficiency of each decomposed feature set. Simulation results show that slowly varying components contain more speaker discriminative information than rapidly varying components do.
Original language | English |
---|---|
Pages | 2377-2380 |
Number of pages | 4 |
Publication status | Published - 2004 |
Event | 8th International Conference on Spoken Language Processing, ICSLP 2004 - Jeju, Jeju Island, Korea, Republic of Duration: 2004 Oct 4 → 2004 Oct 8 |
Other
Other | 8th International Conference on Spoken Language Processing, ICSLP 2004 |
---|---|
Country/Territory | Korea, Republic of |
City | Jeju, Jeju Island |
Period | 04/10/4 → 04/10/8 |
All Science Journal Classification (ASJC) codes
- Language and Linguistics
- Linguistics and Language