Online speaker adaptation of an acoustic model using face recognition

Campr, Pavel; Pražák, Aleš; Psutka, Josef V.; Psutka, Josef

Title:	Online speaker adaptation of an acoustic model using face recognition
Other Titles:	Online adaptace akustického modelu na řečníka s využitím systému pro rozpoznávání obličejů
Authors:	Campr, Pavel Pražák, Aleš Psutka, Josef V. Psutka, Josef
Citation:	CAMPR, Pavel; PRAŽÁK, Aleš; PSUTKA, Josef V.; PSUTKA, Josef. Online speaker adaptation of an acoustic model using face recognition. In: Text, speech and dialogue. Berlin: Springer, 2013, p. 378-385. (Lectures notes in computer science; 8082). ISBN 978-3-642-40584-6.
Issue Date:	2013
Publisher:	Springer
Document type:	článek article
URI:	http://www.kky.zcu.cz/cs/publications/CamprPavel_2013_OnlineSpeaker http://hdl.handle.net/11025/17203
ISBN:	978-3-642-40584-6
Keywords:	akustický model;adaptace na řečníka;rozpoznávání obličeje;multimodální zpracování;automatické rozpoznávání řeči
Keywords in different language:	acoustic model;speaker adaptation;face recognition;multimodal processing;automatic speech recognition
Abstract in different language:	We have proposed and evaluated a novel approach for online speaker adaptation of an acoustic model based on face recognition. Instead of traditionally used audio-based speaker identification we investigated the video modality for the task of speaker detection. A simulated on-line transcription created by a Large-Vocabulary Continuous Speech Recognition (LVCSR) system for online subtitling is evaluated utilizing speaker independent acoustic models, gender dependent models and models of particular speakers. In the experiment, the speaker dependent acoustic models were trained offline, and are switched online based on the decision of a face recognizer, which reducedWord Error Rate (WER) by 12% relatively compared to speaker independent baseline system.
Rights:	© Pavel Campr - Aleš Pražák - Josef V. Psutka - Josef Psutka
Appears in Collections:	Články / Articles (NTIS) Články / Articles (KKY)

Files in This Item:

File	Description	Size	Format
CamprPavel_2013_OnlineSpeaker.pdf	Plný text	264,95 kB	Adobe PDF	View/Open

Show full item record

Please use this identifier to cite or link to this item: http://hdl.handle.net/11025/17203

search

navigation