Phonetically optimized speaker modeling for robust speaker recognition

Bong Jin Lee, Jeung Yoon Choi, Hong Goo Kang

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)

Abstract

This paper proposes an efficient method to improve speaker recognition performance by dynamically controlling the ratio of phoneme class information. It utilizes the fact that each phoneme contains different amounts of speaker discriminative information that can be measured by mutual information. After classifying phonemes into five classes, the optimal ratio of each class in both training and testing processes is adjusted using a non-linear optimization technique, i.e., the Nelder-Mead method. Speaker identification results verify that the proposed method achieves 18% improvement in terms of error rate compared to a baseline system.

Original languageEnglish
Pages (from-to)EL100-EL106
JournalJournal of the Acoustical Society of America
Volume126
Issue number3
DOIs
Publication statusPublished - 2009

All Science Journal Classification (ASJC) codes

  • Arts and Humanities (miscellaneous)
  • Acoustics and Ultrasonics

Fingerprint

Dive into the research topics of 'Phonetically optimized speaker modeling for robust speaker recognition'. Together they form a unique fingerprint.

Cite this