Achieving stationary speech enhancement in low signal-to-noise ratio (SNR) environments is a challenging problem. Because noise energy is dominant in noisy speech at low SNR level, the existence of numerous obvious random noises may lead neural network to forget some useful information obtained by early training. Moreover, it is difficult for a single neural network to obtain effective speech features and noise features. Therefore, this paper designs to utilize multiple neural networks in two stages to discriminately learn a certain type of noise features and reduce the introduction of interference. Experiment results demonstrate that proposed method leads to consistently better source-to-distortion ratio (SDR) and perceptual evaluation of speech quality (PESQ) than baseline models in low SNR condition. And the results indicate that the method can suppress the forgetting of early information of neural network.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.