Články / Articles (KKY) Domovská stránka kolekce Zobrazit statistiky

Procházet
Přihlásit se k zasílání denních e-mailů o novinkách RSS Feed RSS Feed RSS Feed
Záznamy kolekce (řazeno podle Datum zaslání v sestupně pořadí): 141 až 160 z 174
Romportl, Jan , Kala, Jiří
Prosody modelling in czech text-to-speech synthesis

This paper describes data-driven modelling of all three basic prosodic features - fundamental frequency, intensity and segmental duration - in the Czech text-to-speech system ARTIC. The fundamental frequency is generated by a model based on concatenation of automatically acquired intonational pat...

Romportl, Jan
Prosodic phrases and semantic accents in speech corpus for czech TTS synthesis

We describe a statistical method for assignment of prosodic phrases and semantic accents in read speech data. The method is based on statistical evaluation of listening test data by a maximum-likelihood approach with parameters estimated by an EM algorithm. We also present linguisticall...

Romportl, Jan
Statistical evaluation of prosodic phrases in the czech language

The present paper understands prosodic phrases as units which take part in constituting the rhythmical structure of speech. Due to very subjective and inconsistent criteria for the prosodic phrase perception there must be an objectively underlain method for prosodic phrase assignment. This&#...

Savran, Arman , Celiktutan, Oya , Akyol, Aydin , Trojanová, Jana , Dibeklioglu, Hamdi , Esenlik, Semih , Bozkurt, Nesli , Demirkir, Cem , Akagunduz, Erdem , Caliskan, Kerem , Alyuz, Nese , Sankur, Bulent , Ulusoy, Ilkay , Akarun, Lale , Sezgin, Tevfik Metin
3D face recognition performance under adversarial conditions

We address the question of 3D face recognition and expression understanding under adverse conditions like illumination, pose, and accessories. We therefore conduct a campaign to build a 3D face database including systematic variation of poses, different types of occlusions, and a rich s...

Skorkovská, Lucie , Ircing, Pavel
Experiments with automatic query formulation in the extended boolean model

This paper concentrates on experiments with automatic creation of queries from natural language topics, suitable for use in the Extended Boolean information retrieval system. Because of the lack and/or inadequacy of the available methods, we propose a new method, based on pairing t...

Skorkovská, Lucie , Ircing, Pavel , Pražák, Aleš , Lehečka, Jan
Automatic topic identification for large scale language modeling data filtering

The paper presents a module for topic identification that is embedded into a complex system for acquisition and storing large volumes of text data from the Web. The module processes each of the acquired data items and assigns keywords to them from a defined topic hierarch...

Soutner, Daniel , Müller, Luděk
Application of LSTM neural networks in language modelling

Artificial neural networks have become state-of-the-art in the task of language modelling on a small corpora. While feed-forward networks are able to take into account only a fixed context length to predict the next word, recurrent neural networks (RNN) can take advantage of all&#x...

Strassel, Stephanie , Kolář, Jáchym , Song, Zhiyi , Barclay, Leila , Glenn, Meghan
Structural metadata annotation: moving beyond english

The goal of metadata extraction (MDE) is to enable technology that can take raw speech-to-text output and refine it into forms that are more useful to humans and to downstream automatic processes. Starting in 2003, a structural metadata annotation task was defined for English ...

Švec, Jan , Jurčíček, Filip , Müller, Luděk
Parameterization of the input in training the HVS semantic parser

The aim of this paper is to present an extension of the hidden vector state semantic parser. First, we describe the statistical semantic parsing and its decomposition into the semantic and the lexical model. Subsequently, we present the original hidden vector state parser. Then,&#x...

Trmal, Jan , Vaněk, Jan , Müller, Luděk , Zelinka, Jan
Independent components for acoustic modeling

In the paper, we present a comparative study of several methods used nowadays in the field of feature and information extraction. We compared several Independent Component Analysis (ICA) algorithms together with the commonly used Principal Component Analysis (PCA) algorithm in two real-world...

Trmal, Jan , Zelinka, Jan , Vaněk, Jan , Müller, Luděk
Silence/speech detection method based on set of decision graphs

In the paper we demonstrate a complex supervised learning method based on a binary decision graphs. This method is employed in construction of a silence/speech detector. Performance of the resulting silence/speech detector is compared with performance of common silence/speech detectors&...

Trmal, Jan , Zelinka, Jan , Müller, Luděk
Adaptation of a feedforward artificial neural network using a linear transform

In this paper we present a novel method for adaptation of a multi-layer perceptron neural network (MLP ANN). Nowadays, the adaptation of the ANN is usually done as an incremental retraining either of a subset or the complete set of the ANN parameters. However,�...

Trmal, Jan , Hrúz, Marek
Evaluation of feature space transforms for czech sign-language recognition

In the paper we give a brief introduction into sign language recognition and present a particular research task, where the access to MetaCentrum computing facilities was highly beneficial. Although the problem of signed speech recognition is currently being researched into by many resea...

Trmal, Jan , Pražák, Aleš , Loose, Zdeněk , Psutka, Josef
Online TV captioning of Czech parliamentary sessions

In the paper we introduce the on-line captioning system developed by our teams and used by the Czech Television (CTV), the public service broadcaster in the Czech Republic. The research project is targeted at incorporation of speech technologies into the CTV environment. One of...

Trmal, Jan , Zelinka, Jan , Müller, Luděk
On speaker adaptive training of artificial neural networks

In the paper we present two techniques improving the recognition accuracy of multilayer perceptron neural networks (MLP ANN) by means of adopting Speaker Adaptive Training. The use of the MLP ANN, usually in combination with the TRAPS parametrization, includes applications in speech rec...

Trojanová, Jana , Hrúz, Marek , Campr, Pavel , Železný, Miloš
Design and recording of czech audio-visual database with impaired conditions for continuous speech recognition

In this paper we discuss the design, acquisition and preprocessing of a Czech audio-visual speech corpus. The corpus is intended for training and testing of existing audio-visual speech recognition system. The name of the database is UWB-07-ICAVR, where ICAVR stands for Impaired Conditi...

Vaněk, Jan , Trmal, Jan , Psutka, Josef V. , Psutka, Josef
Optimization of the Gaussian mixture model evaluation on GPU

In this paper we present a highly optimized implementation of Gaussian mixture acoustic model evaluation algorithm. Evaluation of these likelihoods is one of the most computationally intensive parts of automatics speech recognizers but it can be well-parallelized and offloaded ...

Vaněk, Jan , Trmal, Jan , Psutka, Josef V. , Psutka, Josef
Full covariance gaussian mixture models evaluation on GPU

Gaussian mixture models (GMMs) are often used in various data processing and classification tasks to model a continuous probability density in a multi-dimensional space. In cases, where the dimension of the feature space is relatively high (e.g. in the automatic speech recognition (ASR)...

Vaněk, Jan , Psutka, Josef
Anti-models: an alternative way to discriminative training

Traditional discriminative training methods modify Hidden Markov Model (HMM) parameters obtained via a Maximum Likelihood (ML) criterion based estimator. In this paper, anti-models are introduced instead. The anti-models are used in tandem with ML models to incorporate a discriminative information...

Vaněk, Jan , Psutka, Josef V. , Zelinka, Jan , Pražák, Aleš , Psutka, Josef
Discriminative training of gender-dependent acoustic models

The main goal of this paper is to explore the methods of gender-dependent acoustic modeling that would take the possibly of imperfect function of a gender detector into consideration. Such methods will be beneficial in real-time recognition tasks (eg. real-time subtitling of meetings)&#...

Záznamy kolekce (řazeno podle Datum zaslání v sestupně pořadí): 141 až 160 z 174