Abstract
Studies have been conducted to get a clean data from non-stationary noisy signal, which is one of the areas in speech enhancement. Since conventional methods rely on first-order statistics, the effort to eliminate noise using deep learning method is intensive. In the real environment, many types of noises are mixed with the target sound, resulting in difficulty to remove only noises. However, most of previous works modeled a small amount of non-stationary noise, which is hard to be applied in real world. To cope with this problem, we propose a novel deep learning model to enhance the auditory signal with adversarial learning of two types of discriminators. One discriminator learns to distinguish a clean signal from the enhanced one by the generator, and the other is trained to recognize the difference between eliminated noise signal and real noise signal. In other words, the second discriminator learns the waveform of noise. Besides, a novel learning method is proposed to stabilize the unstable adversarial learning process. Compared with the previous works, to verify the performance of the propose model, we use 100 kinds of noise. The experimental results show that the proposed model has better performance than other conventional methods including the state-of-the-art model in removing non-stationary noise. To evaluate the performance of our model, the scale-invariant source-to-noise ratio is used as an objective evaluation metric. The proposed model shows a statistically significant performance of 5.91 compared with other methods in t-test.
Original language | English |
---|---|
Title of host publication | Intelligent Data Engineering and Automated Learning – IDEAL 2019 - 20th International Conference, Proceedings |
Editors | Hujun Yin, Richard Allmendinger, David Camacho, Peter Tino, Antonio J. Tallón-Ballesteros, Ronaldo Menezes |
Publisher | Springer |
Pages | 367-374 |
Number of pages | 8 |
ISBN (Print) | 9783030336066 |
DOIs | |
Publication status | Published - 2019 |
Event | 20th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2019 - Manchester, United Kingdom Duration: 2019 Nov 14 → 2019 Nov 16 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 11871 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 20th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2019 |
---|---|
Country/Territory | United Kingdom |
City | Manchester |
Period | 19/11/14 → 19/11/16 |
Bibliographical note
Funding Information:Acknowledgment. This work was supported by grant funded by 2019 IT promotion fund (Development of AI based Precision Medicine Emergency System) of the Korean government (Ministry of Science and ICT).
Funding Information:
This work was supported by grant funded by 2019 IT promotion fund (Development of AI based Precision Medicine Emergency System) of the Korean government (Ministry of Science and ICT).
Publisher Copyright:
© 2019, Springer Nature Switzerland AG.
All Science Journal Classification (ASJC) codes
- Theoretical Computer Science
- Computer Science(all)