Frequency Offset Correction in Single Sideband (SSB) Speech by Deep Neural Network for Speaker Verification

dc.contributor.VIAF19968651 (Hansen, JHL)en_US
dc.contributor.authorXing, Huaen_US
dc.contributor.authorLiu, Gangen_US
dc.contributor.authorHansen, John H. L.en_US
dc.contributor.utdAuthorXing, Hua
dc.contributor.utdAuthorLiu, Gang
dc.contributor.utdAuthorHansen, John H. L.
dc.date.accessioned2016-10-27T18:17:41Z
dc.date.available2016-10-27T18:17:41Z
dc.date.created2015-09-06en_US
dc.description.abstractCommunication system mismatch represents a major influence for loss in speaker recognition performance. This paper considers a type of nonlinear communication system mismatch- modulation/ demodulation (Mod/DeMod) carrier drift in single sideband (SSB) speech signals. We focus on the problem of estimating frequency offset in SSB speech in order to improve speaker verification performance of the drifted speech. Based on a two-step framework from previous work, we propose using a multi-layered neural network architecture, stacked denoising autoencoder (SDA), to determine the unique interval of the offset value in the first step. Experimental results demonstrate that the SDA based system can produce up to a +16.1% relative improvement in frequency offset estimation accuracy. A speaker verification evaluation shows a +65.9% relative improvement in EER when SSB speech signal is compensated with the frequency offset value estimated by the proposed method.en_US
dc.identifier.bibliographicCitationXing, H., G. Liu, and J. H. L. Hansen. 2015. "Frequency offset correction in single sideband (SSB) speech by deep neural network for speaker verification." INTERSPEECH 2015 (16th Annual Conference of the International Speech Communication Association) p. 1156-1160.en_US
dc.identifier.issn2308-457Xen_US
dc.identifier.urihttp://hdl.handle.net/10735.1/5108
dc.identifier.volume2015en_US
dc.language.isoenen_US
dc.publisherInternational Speech and Communication Associationen_US
dc.relation.urihttp://www.isca-speech.org/archive/interspeech_2015/i15_1156.htmlen_US
dc.rights©2015 ISCAen_US
dc.source.journalINTERSPEECH 2015en_US
dc.subjectPattern recognition systemsen_US
dc.subjectVoice frequencyen_US
dc.subjectRadio, Single-sidebanden_US
dc.subjectSpeaker verificationen_US
dc.subjectNoiseen_US
dc.subjectSound analyzersen_US
dc.subjectNeural networks (Computer science)en_US
dc.titleFrequency Offset Correction in Single Sideband (SSB) Speech by Deep Neural Network for Speaker Verificationen_US
dc.type.genrearticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
JECS-3626-4714.34.pdf
Size:
128.18 KB
Format:
Adobe Portable Document Format
Description:
Article