Chinese Journal of Acoustics----Institute Of Acoustics Chinese Academy Of Sciences

Location:Home>Chinese Journal of Acoustics

Improvement of joint optimization of masks and deep recurrent neural networks for monaural speech separation using optimized activation functions(2020 No.3)

Author：

ArticleSource：

Update time：

2024/07/24

Viewed：

Text Size: A A A

Title: Improvement of joint optimization of masks and deep recurrent neural networks for monaural speech separation using optimized activation functions

Author(s): MASOOD Asim; YE Zhongfu;

Affiliation(s): National Engineering Laboratory for Speech and Language Information Processing, University of Science and Technology of China

Abstract: Single channel speech separation was a challenging task for speech separation community for last three decades. It is now possible to separate speeches using deep neural networks (DNN) and deep recurrent neural networks (DRNN) due to deep learning. Researchers are now trying to improve different models of DNN and DRNN for monaural speech separation. In this paper, we have tried to improve existing DRNN and DNN based model for speech separation by using optimized activation functions. Instead of using rectified linear unit (RELU), we have implemented leaky RELU, exponential linear unit, exponential function, inverse square root linear unit and inverse cubic root linear unit (ICRLU) as activation functions. ICRLU and exponential function are new activation functions proposed in this research work. These activation functions have overcome the dying RELU problem. They have achieved better separation results in comparison with RELU function and they have also reduced the computational cost of DNN and DRNN based monaural speech separation.

Copyright ？ 1996 - 2020 Institute of Acoustics, Chinese Academy of Sciences
No. 21 North 4th Ring Road, Haidian District, 100190 Beijing, China
E-mail: ioa@mail.ioa.ac.cn