I-Vector Based Physical Task Stress Detection with Different Fusion Strategies

dc.contributor.VIAF19968651 (Hansen, JHL)en_US
dc.contributor.authorZhang, C.en_US
dc.contributor.authorLiu, G.en_US
dc.contributor.authorYu, C.en_US
dc.contributor.authorHansen, John H. L.en_US
dc.contributor.utdAuthorZhang, C.
dc.contributor.utdAuthorLiu, G.
dc.contributor.utdAuthorYu, C.
dc.contributor.utdAuthorHansen, John H. L.
dc.date.accessioned2016-10-27T18:17:42Z
dc.date.available2016-10-27T18:17:42Z
dc.date.created2015-09-06en_US
dc.description.abstractIt is common for subjects to produce speech while performing a physical task where speech technology may be used. Variabilities are introduced to speech since physical task can influence human speech production. These variabilities degrade the performance of most speech systems. It is vital to detect speech under physical stress variabilities for subsequent algorithm processsing. This study presents a method for detecting physical task stress from speech. Inspired by the fact that i-vectors can generally model total factors from speech, a state-of-the-art ivector framework is investigated with MFCCs and our previously formulated TEO-CB-Auto-Env features for neutral/physical task stress detection. Since MFCCs are derived from a linear speech production model and TEO-CB-Auto-Env features employ a nonlinear operator, these two features are believed to have complementary effects on physical task stress detection. Two alternative fusion strategies (feature-level and score-level fusion) are investigated to validate this hypothesis. Experiments over the UT-Scope Physical Corpus demonstrate that a relative accuracy gain of 2.68% is obtained when fusing different feature based i-vectors. An additional relative performance boost with of 6.52% in accuracy is achieved using score level fusion.en_US
dc.description.sponsorshipFunded by AFRL (contract #FA8750-12-1-0188)en_US
dc.identifier.bibliographicCitationZhang, C., G. Liu, C. Yu, and J. H. L. Hansen. 2015. "I-vector based physical task stress detection with different fusion strategies." INTERSPEECH 2015 (16th Annual Conference of the International Speech Communication Association), p. 2689-2693.en_US
dc.identifier.issn2308-457Xen_US
dc.identifier.urihttp://hdl.handle.net/10735.1/5110
dc.identifier.volume2015en_US
dc.language.isoenen_US
dc.publisherInternational Speech and Communication Associationen_US
dc.relation.urihttp://www.isca-speech.org/archive/interspeech_2015/i15_2689.htmlen_US
dc.rights©2015 ISCAen_US
dc.source.journalINTERSPEECH 2015en_US
dc.subjectSpeech perceptionen_US
dc.subjectVector analysisen_US
dc.subjectStress (Physiology)en_US
dc.titleI-Vector Based Physical Task Stress Detection with Different Fusion Strategiesen_US
dc.type.genrearticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
JECS-3626-4719.04.pdf
Size:
240.95 KB
Format:
Adobe Portable Document Format
Description:
Article

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
International Speech Communication Association.pdf
Size:
94.27 KB
Format:
Adobe Portable Document Format
Description: