TY - GEN
T1 - Emotional Speech Synthesis Based on Style Embedded Tacotron2 Framework
AU - Kwon, Ohsung
AU - Jang, Inseon
AU - Ahn, Chunghyun
AU - Kang, Hong Goo
PY - 2019/6
Y1 - 2019/6
N2 - In this paper, we propose a speech synthesis system that effectively generates multiple types of emotional speech using the concept of global style token (GST); where the emotion-related style information is presented by an additional style embedding vector. Although the GST is not a new idea, no one has been utilized the idea for an emotional speech synthesis task. We explicitly combine the GST idea with the Tacotron2 framework to implement an emotional text-to-speech system. The analysis results demonstrate that the proposed GST structure successfully transfers various types of emotional information to the synthesized speech. Subjective listening tests to evaluate the naturalness and emotional expression of synthesized speech are conducted to verify the superiority of the proposed algorithm.
AB - In this paper, we propose a speech synthesis system that effectively generates multiple types of emotional speech using the concept of global style token (GST); where the emotion-related style information is presented by an additional style embedding vector. Although the GST is not a new idea, no one has been utilized the idea for an emotional speech synthesis task. We explicitly combine the GST idea with the Tacotron2 framework to implement an emotional text-to-speech system. The analysis results demonstrate that the proposed GST structure successfully transfers various types of emotional information to the synthesized speech. Subjective listening tests to evaluate the naturalness and emotional expression of synthesized speech are conducted to verify the superiority of the proposed algorithm.
UR - http://www.scopus.com/inward/record.url?scp=85071500622&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85071500622&partnerID=8YFLogxK
U2 - 10.1109/ITC-CSCC.2019.8793393
DO - 10.1109/ITC-CSCC.2019.8793393
M3 - Conference contribution
T3 - 34th International Technical Conference on Circuits/Systems, Computers and Communications, ITC-CSCC 2019
BT - 34th International Technical Conference on Circuits/Systems, Computers and Communications, ITC-CSCC 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 34th International Technical Conference on Circuits/Systems, Computers and Communications, ITC-CSCC 2019
Y2 - 23 June 2019 through 26 June 2019
ER -