Compared with the original speech, the replay attack speech passes through a complex channel mainly composed of a recording device and a playback device, and the frequency response of the channel causes a obvious change to the high and low frequency bands of the original speech spectrum. This paper proposed a Channel Difference Enhancement Cepstral Coefficient (CDECC) feature that enhances the channel frequency response difference, and detects the replay attack speech by enhancing the spectral difference caused by the channel frequency response. Experiments based on the ASVspoof 2017 Challenge data set show that the proposed method has a significant improvement in detection performance compared to the baseline system using Constant Q Cepstral Coefficients (CQCC), and the equal error rate (EER) is reduced by 18.20% under the same conditions, indicating that the performance of the CDECC feature is more effective than that of CQCC and MFCC features in detecting replay attack speech.
Access to the requested content is limited to institutions that have purchased or subscribe to SPIE eBooks.
You are receiving this notice because your organization may not have SPIE eBooks access.*
*Shibboleth/Open Athens users─please
sign in
to access your institution's subscriptions.
To obtain this item, you may purchase the complete book in print or electronic format on
SPIE.org.
INSTITUTIONAL Select your institution to access the SPIE Digital Library.
PERSONAL Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.