Články / Articles (KKY) Domovská stránka kolekce Zobrazit statistiky

Procházet
Přihlásit se k zasílání denních e-mailů o novinkách RSS Feed RSS Feed RSS Feed
Záznamy kolekce (řazeno podle Datum zaslání v sestupně pořadí): 81 až 100 z 174
Kolář, Jáchym , Liu, Yang , Shriberg, Elizabeth
Genre effects on automatic sentence segmentation of speech: A comparison of broadcast news and broadcast conversations

We investigate genre effects on the task of automatic sentence segmentation, focusing on two important domains – broadcast news (BN) and broadcast conversation (BC). We employ an HMM model based on textual and prosodic information and analyze differences in segmentation ac...

Psutka, Josef , Švec, Jan , Psutka, Josef V. , Vaněk, Jan , Pražák, Aleš , Šmídl, Aleš , Ircing, Pavel
System for fast lexical and phonetic spoken term detection in a czech cultural heritage archive

The main objective of the work presented in this paper was to develop a complete system that would accomplish the original visions of the MALACH project. Those goals were to employ automatic speech recognition and information retrieval techniques to provide improved access to the&#...

Psutka, Josef V. , Vaněk, Jan , Psutka, Josef
Speaker-clustered acoustic models evaluated on GPU for on-line subtitling of parliament meetings

This paper describes the effort with building speaker-clustered acoustic models as a part of the real-time LVCSR system that is used more than one year by the Czech TV for automatic subtitling of parliament meetings broadcasted on the channel ČT24. Speaker-clustered acoustic models ...

Jurčíček, Filip , Švec, Jan , Zahradil, Jiří , Jelínek, Libor
Use of negative examples in training the HVS semantic model

This paper describes use of negative examples in training the HVS semantic model. We present a novel initialization of the lexical model using negative examples extracted automatically from a semantic corpus as well as description of an algorithm for extraction these examples. We e...

Kanis, Jakub , Müller, Luděk
Using the lemmatization technique for phonetic transcription in text-to-speech system

This paper deals with a lemmatization technique and its using for phonetic transcription of exceptional words. The lemmatizer is based on language morphology and uses a lexicon of basic word forms and a set of inversion derivation rules to acquire lemmatization rules, which are...

Kanis, Jakub , Müller, Luděk
Automatic lemmatizer construction with focus on OOV words lemmatization

This paper deals with the automatic construction of a lemmatizer from a Full Form - Lemma (FFL) training dictionary and with lemmatization of new, in the FFL dictionary unseen, i.e. out-of-vocabulary (OOV) words. Three methods of lemmatization of three kinds of OOV words (missing&#...

Kanis, Jakub , Zelinka, Jan , Müller, Luděk
Automatic numbers normalization in inflectional languages

This paper is devoted to the text normalization module in our text-to-speech synthesis system. We focused on conversion numerals written as figures into a readable full-length form. The numerals conversion is a significant issue in inflectional language as Czech, Russian or Slovak becau...

Kanis, Jakub , Müller, Luděk
Using lemmatization technique for automatic diacritics restoration

This paper is devoted to automatic construction of a lemmatizer from a Full Form - Lemma (FFL) training dictionary, and to lemmatization of new, in the FFL dictionary unseen - i.e. out-of-vocabulary (OOV), words. Three methods of lemmatization of three kinds of OOV words (miss...

Kanis, Jakub , Zahradil, Jiří , Jurčíček, Filip , Müller, Luděk
Czech-sign speech corpus for semantic based machine translation

This paper describes progress in a development of the human-human dialogue corpus for machine translation of spoken language. We have chosen a semantically annotated corpus of phone calls to a train timetable information center. The phone calls consist of inquiries regarding their train...

Kanis, Jakub , Müller, Luděk
Automatic czech – sign speech translation

This paper is devoted to the problem of automatic translation between Czech and SC in both directions. We introduced our simple monotone phrase-based decoder - SiMPaD suitable for fast translation and compared its results with the results of the state-of-the-art phrase-based decoder -&#...

Kanis, Jakub
Interactive HamNoSys notation editor for signed speech annotation

This paper discusses the practice with an annotation of signs of signed speech and the creation of a domain-specific lexicon. The domain-specific lexicon is primarily proposed for an automatic signed speech synthesizer. The symbolic notation system based on HamNoSys notation has been ad...

Kanis, Jakub , Müller, Luděk
Advances in czech – signed speech translation

This article describes advances in Czech - Signed Speech translation. A method using a new criterion based on minimal loss principle for log-linear model phrase extraction was introduced and it was evaluated against two another criteria. The performance of phrase table extracted with&#x...

Kanis, Jakub , Peňáz, Petr , Campr, Pavel , Hrúz, Marek
A methodology for automatic sign language dictionary creation

In this article we present the the sign language dictionary being developed by a research team of University of West Bohemia, Masaryk University and Palacký University. The aim is to create both an explanatory and a translation dictionary with respect to the linguistic...

Kanis, Jakub , Hrúz, Marek , Campr, Pavel
Metodika pro automatizovanou tvorbu slovníku znakového jazyka

Kanis, Jakub
On-line slovník znakového jazyka – přístup přes internet

Kolář, Jáchym , Müller, Luděk
The application of Bayesian information criterion in acoustic model refinement

Automatic speech recognition (ASR) systems usually consist of an acoustic model and a language model. This paper describes a technique of an efficient deployment of the acoustic model parameters. The acoustic model typically utilizes Continuous Density Hidden Markov Models (CDHMM). The outpu...

Kolář, Jáchym , Romportl, Jan , Psutka, Josef
The czech speech and prosody database both for ASR and TTS purposes

This paper describes a preparation of the first large Czech prosodic database which should be useful both in automatic speech recognition (ASR) and text-to-speech (TTS) synthesis. In the area of ASR we intend to use it for an automatic punctuation annotation, in the area of&#x...

Kolář, Jáchym , Švec, Jan , Psutka, Josef
Automatic punctuation annotation in czech broadcast news speech

This paper reports our initial experiments with automatic punctuation annotation from speech. We have focused on Czech broadcast news speech. We employed two statistical models - prosodic model and language model. The prosodic model expresses relationships between prosodic quantities (such as...

Kolář, Jáchym , Švec, Jan , Strassel, Stephanie , Walker, Christopher , Kozlíková, Dagmar , Psutka, Josef
Czech spontaneous speech corpus with structural metadata

This paper describes a Czech spontaneous speech corpus consisting of radio talk show recordings. As the first complete non-English MDE corpus, it has been annotated with structural metadata information beyond the words that is critical to both increasing transcript readability and allowing&#...

Kolář, Jáchym , Shriberg, Elizabeth , Liu, Yang
On speaker-specific prosodic models for automatic dialog act segmentation of multi-party meetings

We explore speaker-specific prosodic modeling for dialog act segmentation of speech from the ICSI Meeting Corpus. We ask whether features beyond pauses help individual speakers, and whether some speakers benefit from prosody models trained on only their speech. We find positive results ...

Záznamy kolekce (řazeno podle Datum zaslání v sestupně pořadí): 81 až 100 z 174