Now showing items 1-3 of 3
Robust acoustic modeling and front-end design for distant speech recognition
In recent years, there has been a significant increase in the popularity of voice-enabled technologies which use human speech as the primary interface with machines. Recent advancements in acoustic modeling and feature ...
Advances in Audiovisual Speech Processing for Robust Voice Activity Detection and Automatic Speech Recognition
Speech processing systems are widely used in existing commercial applications, including virtual assistants in smartphones and home assistant devices. Speech-based commands provide convenient hands-free functionality for ...
Deep Neural Networks and Model-Based Approaches for Robust Speaker Diarization in Naturalistic Audio Streams
Speaker diarization is an unsupervised task that determines "who spoke and when" within input audio stream. It consists of four sub-systems: (i) speech activity detection (SAD); (ii) speaker segmentation and modeling; ...