Frequency Offset Correction in Single Sideband (SSB) Speech by Deep Neural Network for Speaker Verification

Xing, Hua; Liu, Gang; Hansen, John H. L.

Frequency Offset Correction in Single Sideband (SSB) Speech by Deep Neural Network for Speaker Verification

Files

JECS-3626-4714.34.pdf (128.18 KB)

Authors

Xing, Hua

Liu, Gang

Hansen, John H. L.

Publisher

International Speech and Communication Association

URI

http://hdl.handle.net/10735.1/5108

Abstract

Communication system mismatch represents a major influence for loss in speaker recognition performance. This paper considers a type of nonlinear communication system mismatch- modulation/ demodulation (Mod/DeMod) carrier drift in single sideband (SSB) speech signals. We focus on the problem of estimating frequency offset in SSB speech in order to improve speaker verification performance of the drifted speech. Based on a two-step framework from previous work, we propose using a multi-layered neural network architecture, stacked denoising autoencoder (SDA), to determine the unique interval of the offset value in the first step. Experimental results demonstrate that the SDA based system can produce up to a +16.1% relative improvement in frequency offset estimation accuracy. A speaker verification evaluation shows a +65.9% relative improvement in EER when SSB speech signal is compensated with the frequency offset value estimated by the proposed method.

Keywords

Pattern recognition systems, Voice frequency, Radio, Single-sideband, Speaker verification, Noise, Sound analyzers, Neural networks (Computer science)

Rights

Collections

Hansen, John H. L.

Full item page

Frequency Offset Correction in Single Sideband (SSB) Speech by Deep Neural Network for Speaker Verification

Files

Date

Authors

ORCID

Journal Title

Journal ISSN

Volume Title

Publisher

item.page.doi

URI

Abstract

Description

Keywords

item.page.sponsorship

Rights

Citation

Collections