Real-time large vocabulary spontaneous speech recognition for spoken dialog systems

Švec, Jan; Šmídl, Luboš

Title:	Real-time large vocabulary spontaneous speech recognition for spoken dialog systems
Other Titles:	Rozpoznání spojité spontánní řeči s velkým slovníkem a v reálném čase pro dialogové systémy
Authors:	Švec, Jan Šmídl, Luboš
Citation:	ŠVEC, JAN; ŠMÍDL, Luboš. Real-time large vocabulary spontaneous speech recognition for spoken dialog systems. In: Proceeding of the 4th International congres on Image and signal processing (CISP), 15th-17th October 2011, Shanghai. Beijing: IEEE Press, 2011, p. 2458-2463. (Volume 5).ISBN 978-1-4244-9304-3.
Issue Date:	2011
Publisher:	IEEE Press
Document type:	článek article
URI:	http://www.kky.zcu.cz/cs/publications/JanSvec_2011_Real-timeLarge http://hdl.handle.net/11025/17164
ISBN:	978-1-4244-9304-3
Keywords:	rozpoznávání řeči;jazykové modelování;hlasové dialogové systémy
Keywords in different language:	speech recognition;language modelling;spoken dialog system
Abstract:	Článek popisuje modifikaci výchozího systému pro rozpoznávání řeči. Výsledný systém je vhodný pro použití v hlasovém dialogovém systému se smíšenou iniciativou a přirozeným vstupem. Jsou prezentovány tři přístupy pro rozšiření rozpoznávacího slovníku za účelem zajištění schopnosti rozpoznat všechny entity z dané domény. Dále je navržena metoda normalizace nespisovného textu. Experimenty provedené na korpusu spontánní řeči ukazují, že navržená metoda je velmi významná pro jazyky, kde se podstatně liší psaná formální podoba jazyka a obecná nespisnovná řeč. Celková chybovost slov (Word Error Rate) byla redukována o 16.7%.
Abstract in different language:	This paper describes the method for modifying the baseline speech recognition system to be suitable for a use in spoken dialog system with mixed initiative and natural user’s input. We present three approaches for extending the recognition vocabulary to ensure the spoken dialog system is able to recognize all entities in the given domain. The colloquial text normalization method is proposed. The experiments performed on spontaneous speech corpus suggested that the proposed method is very important for languages where the formal written language and a common colloquial speech are very different. The overall word error rate was reduced by 16.7%.
Rights:	© Jan Švec - Luboš Šmídl
Appears in Collections:	Články / Articles (KKY)

Files in This Item:

File	Description	Size	Format
JanSvec_2011_Real-timeLarge.pdf	Plný text	237,78 kB	Adobe PDF	View/Open

Show full item record

Please use this identifier to cite or link to this item: http://hdl.handle.net/11025/17164

search

navigation