A Speech Processing Strategy Based on Sinusoidal Speech Model for Cochlear Implant Users

dc.contributor.ORCID0000-0002-4676-4917 (Tan, C-T)
dc.contributor.VIAF78509491 (Tan, C-T)
dc.contributor.authorLee, Sungmin
dc.contributor.authorAkbarzadeh, Sara
dc.contributor.authorSingh, Satnam
dc.contributor.authorTan, Chin-Tuan
dc.contributor.utdAuthorLee, Sungmin
dc.contributor.utdAuthorAkbarzadeh, Sara
dc.contributor.utdAuthorSingh, Satnam
dc.contributor.utdAuthorTan, Chin-Tuan
dc.date.accessioned2019-11-18T21:40:21Z
dc.date.available2019-11-18T21:40:21Z
dc.date.created2019-11-12
dc.descriptionDue to copyright restrictions full text access from Treasures at UT Dallas is restricted to current UTD affiliates (use the provided Link to Article).
dc.description.abstractIn sinusoidal modeling (SM), speech signal, which is pseudo-periodic in structure, can be approximated by sinusoids and noise without losing significant speech information. A speech processing strategy based on this sinusoidal speech model will be relevant for encoding electric pulse streams in cochlear implant (CI) processing, where the number of channels available is limited. In this study, 5 normal hearing (NH) listeners and 2 CI users were asked to perform the task of speech recognition and perceived sound quality rating on speech sentences processed in 12 different test conditions. The sinusoidal analysis/synthesis algorithm was limited to 1, 3 or 6 sinusoids from the sentences low-pass filtered at either 1 kHz, 1.5 kHz, 3 kHz, or 6 kHz, re-synthesized as the test conditions. Each of 12 lists of AzBio sentences was randomly chosen and process with one of 12 test conditions, before they were presented to each participant at 65 dB SPL (Sound Pressure Level). Participant was instructed to repeat the sentence as they perceived, and the number of words correctly recognized was scored. They were also asked to rate the perceived sound quality of the sentences including original speech sentence, on the scale of 1 (distorted) to 10 (clean). Both speech recognition score and perceived sound quality rating across all participants increase when the number of sinusoids increases and low-pass filter broadens. Our current finding showed that three sinusoids may be sufficient to elicit the nearly maximum speech intelligibility and quality necessary for both NH and CI listeners. Sinusoidal speech model has the potential in facilitating the basis for a speech processing strategy in CI. ©2018 APSIPA.
dc.description.departmentErik Jonsson School of Engineering and Computer Science
dc.description.departmentSchool of Behavioral and Brain Sciences
dc.identifier.bibliographicCitationLee, S., S. Akbarzadeh, S. Singh, and C. Tuan-Tan. 2019. "A Speech Processing Strategy Based on Sinusoidal Speech Model for Cochlear Implant Users." 2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference: 393-397, doi: 10.23919/APSIPA.2018.8659620
dc.identifier.isbn9789881476852
dc.identifier.urihttps://hdl.handle.net/10735.1/7121
dc.language.isoen
dc.publisherInstitute of Electrical and Electronics Engineers Inc.
dc.relation.isPartOf2018 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference,
dc.relation.urihttps://dx.doi.org/10.23919/APSIPA.2018.8659620
dc.rights©2018 APSIPA
dc.subjectHearing
dc.subjectCochlear implants
dc.subjectSound—Recording and reproducing
dc.subjectSpeech, Intelligibility of
dc.subjectSpeech processing systems
dc.subjectSpeech perception
dc.subjectHearing impaired
dc.titleA Speech Processing Strategy Based on Sinusoidal Speech Model for Cochlear Implant Users
dc.type.genrearticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
JECS-7046-260413.44-LINK.pdf
Size:
165.02 KB
Format:
Adobe Portable Document Format
Description:
Link to Article