Frequency Offset Correction in Single Sideband (SSB) Speech by Deep Neural Network for Speaker Verification

Date

ORCID

Journal Title

Journal ISSN

Volume Title

Publisher

International Speech and Communication Association

item.page.doi

Abstract

Communication system mismatch represents a major influence for loss in speaker recognition performance. This paper considers a type of nonlinear communication system mismatch- modulation/ demodulation (Mod/DeMod) carrier drift in single sideband (SSB) speech signals. We focus on the problem of estimating frequency offset in SSB speech in order to improve speaker verification performance of the drifted speech. Based on a two-step framework from previous work, we propose using a multi-layered neural network architecture, stacked denoising autoencoder (SDA), to determine the unique interval of the offset value in the first step. Experimental results demonstrate that the SDA based system can produce up to a +16.1% relative improvement in frequency offset estimation accuracy. A speaker verification evaluation shows a +65.9% relative improvement in EER when SSB speech signal is compensated with the frequency offset value estimated by the proposed method.

Description

Keywords

Pattern recognition systems, Voice frequency, Radio, Single-sideband, Speaker verification, Noise, Sound analyzers, Neural networks (Computer science)

item.page.sponsorship

Rights

©2015 ISCA

Citation