Title: Using morphological information for robust language modeling in czech ASR system
Other Titles: Využití morfologické informace pro robustní jazykové modelování v českém ASR systému
Authors: Ircing, Pavel
Psutka, Josef
Psutka, Josef V.
Citation: IRCING, Pavel; PSUTKA, Josef; PSUTKA, Josef V. Using morphological information for robust language modeling in czech ASR system. In: Audio, speech, and language processing, 17, 4, p. 840-847. ISSN 1558-7916.
Issue Date: 2009
Publisher: IEEE Press
Document type: článek
article
URI: http://www.kky.zcu.cz/cs/publications/IrcingPavel_2009_UsingMorphological
http://hdl.handle.net/11025/17177
ISSN: 1558-7916
Keywords: syntéza řeči;rozpoznávání;jazykové modelování
Keywords in different language: speech synthesis;recognition;language modelling
Abstract: Článek ukazuje, že použití bohatých morfologických značek v rámci třídového n-gramového jazykového modelu a kombinace tohoto modelu se standardním slovním n-gramovým modelem může zlepšit přesnost rozpoznávání oproti slovnímu modelu v úloze automatického přepisu spontánních českých rozhovorů.
Abstract in different language: Automatic speech recognition, or more precisely language modeling, of the Czech language has to face challenges that are not present in the language modeling of English. Those include mainly the rapid vocabulary growth and closely connected unreliable estimates of the language model parameters. These phenomena are caused mostly by the highly inflectional nature of the Czech language. On the other hand, the rich morphology together with the well-developed automatic systems for morphological tagging can be exploited to reinforce the language model probability estimates. This paper shows that using rich morphological tags within the concept of class-based n-gram language model with many-to-many word-to-class mapping and combination of this model with the standard word-based n-gram can improve the recognition accuracy over the word-based baseline on the task of automatic transcription of unconstrained spontaneous Czech interviews.
Rights: © Pavel Ircing - Josef V. Psutka - Josef Psutka
Appears in Collections:Články / Articles (KKY)

Files in This Item:
File Description SizeFormat 
IrcingPavel_2009_UsingMorphological.pdfPlný text418,48 kBAdobe PDFView/Open


Please use this identifier to cite or link to this item: http://hdl.handle.net/11025/17177

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.