Frequency Offset Correction in Single Sideband (SSB) Speech by Deep Neural Network for Speaker Verification

Xing, Hua; Liu, Gang; Hansen, John H. L.

Frequency Offset Correction in Single Sideband (SSB) Speech by Deep Neural Network for Speaker Verification

dc.contributor.VIAF	19968651 (Hansen, JHL)	en_US
dc.contributor.author	Xing, Hua	en_US
dc.contributor.author	Liu, Gang	en_US
dc.contributor.author	Hansen, John H. L.	en_US
dc.contributor.utdAuthor	Xing, Hua
dc.contributor.utdAuthor	Liu, Gang
dc.contributor.utdAuthor	Hansen, John H. L.
dc.date.accessioned	2016-10-27T18:17:41Z
dc.date.available	2016-10-27T18:17:41Z
dc.date.created	2015-09-06	en_US
dc.description.abstract	Communication system mismatch represents a major influence for loss in speaker recognition performance. This paper considers a type of nonlinear communication system mismatch- modulation/ demodulation (Mod/DeMod) carrier drift in single sideband (SSB) speech signals. We focus on the problem of estimating frequency offset in SSB speech in order to improve speaker verification performance of the drifted speech. Based on a two-step framework from previous work, we propose using a multi-layered neural network architecture, stacked denoising autoencoder (SDA), to determine the unique interval of the offset value in the first step. Experimental results demonstrate that the SDA based system can produce up to a +16.1% relative improvement in frequency offset estimation accuracy. A speaker verification evaluation shows a +65.9% relative improvement in EER when SSB speech signal is compensated with the frequency offset value estimated by the proposed method.	en_US
dc.identifier.bibliographicCitation	Xing, H., G. Liu, and J. H. L. Hansen. 2015. "Frequency offset correction in single sideband (SSB) speech by deep neural network for speaker verification." INTERSPEECH 2015 (16th Annual Conference of the International Speech Communication Association) p. 1156-1160.	en_US
dc.identifier.issn	2308-457X	en_US
dc.identifier.uri	http://hdl.handle.net/10735.1/5108
dc.identifier.volume	2015	en_US
dc.language.iso	en	en_US
dc.publisher	International Speech and Communication Association	en_US
dc.relation.uri	http://www.isca-speech.org/archive/interspeech_2015/i15_1156.html	en_US
dc.rights	©2015 ISCA	en_US
dc.source.journal	INTERSPEECH 2015	en_US
dc.subject	Pattern recognition systems	en_US
dc.subject	Voice frequency	en_US
dc.subject	Radio, Single-sideband	en_US
dc.subject	Speaker verification	en_US
dc.subject	Noise	en_US
dc.subject	Sound analyzers	en_US
dc.subject	Neural networks (Computer science)	en_US
dc.title	Frequency Offset Correction in Single Sideband (SSB) Speech by Deep Neural Network for Speaker Verification	en_US
dc.type.genre	article	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: JECS-3626-4714.34.pdf
Size:: 128.18 KB
Format:: Adobe Portable Document Format
Description:: Article

Download

Collections

Hansen, John H. L.