Generování obrazových dat pro účely trénování hlubokých neuronových sítí

Andrlík, Pavel

Title:	Generování obrazových dat pro účely trénování hlubokých neuronových sítí
Other Titles:	Image data generation for the purpose of deep neural network training
Authors:	Andrlík, Pavel
Advisor:	Hrúz Marek, Ing. Ph.D.
Referee:	Neduchal Petr, Ing.
Issue Date:	2020
Publisher:	Západočeská univerzita v Plzni
Document type:	bakalářská práce
URI:	http://hdl.handle.net/11025/41862
Keywords:	generování dat;syntetická data;rozpoznávání textu;zpravodajské relace;neuronové sítě;obrazová data
Keywords in different language:	data generation;synthetic data;text recognition;news sessions;neural network;image data
Abstract:	Cílem bakalářské práce je vytvoření a implementace algoritmu pro generování obrázků s texty, imitující texty zpravodajských relací. Tyto obrázky budou následně sloužit pro trénování umělých neuronových sítí pro rozpoznávání textů v obraze. V první části práce jsou analyzovány zpravodajské relace, respektive je zjišťována struktura a rozložení obrazových dat. Dále jsou analyzovány používané texty a znaky včetně používaných fontů. Zjišťováno je také, jaká se nejčastěji objevují slova, speciální nebo v běžném jazyce neobvyklé znaky a symboly a kombinace těchto neobvyklých znaků s používanými slovy nebo druhy textů, jako jsou například jména, místa, povolání, politické strany a podobně. V druhé části se zabývám konkrétním návrhem algoritmu, jeho zobecněním pro možnost použití u různých stanic provozujících zpravodajské relace a v poslední řadě jeho implementací v konkrétním programovacím jazyce. V poslední části je experimentálně ověřena kvalita dosažených výsledků. Jaccardův index (také IoU) rozpoznaných oblastí s textem s použitím stávající natrénované sítě je průměrně 0,7. Dále jsou tyto výsledky diskutovány a podrobněji rozebrány. Úplně na závěr je diskutováno navrhnutí možných vylepšení a budoucí práce navázané na tuto práci.
Abstract in different language:	The purpose of this bachelor thesis is to create and implement an algorithm for generating images with texts imitating the texts of news sessions. These images will be used to train artificial neural networks for text-in-image recognition. In the first part of the work, the news sessions are analyzed, more precisely the structure and distribution of image data is discovered. In addition to, the used texts and characters, including the used fonts, are analyzed. Ascertained are also the most common words, special or unusual characters and symbols in common language, and combinations of these unusual characters with words or types of text used, such as names, places, professions, political parties, and so on. The second part is focused on a specific design of the algorithm, its generalization for the possibility of using for various news sessions and finally its implementation in a specific programming language. In the last part, the quality of the achieved results is experimentally verified. The Jaccard index (also IoU) of recognized areas with text using the existing trained network is on average 0.7. These results are then discussed and examined. Finally, the proposal of possible improvements and future work related to this work is discussed.
Rights:	Plný text práce je přístupný bez omezení.
Appears in Collections:	Bakalářské práce / Bachelor´s works (KKY)

Files in This Item:

File	Description	Size	Format
BP Pavel Andrlik.pdf	Plný text práce	3,63 MB	Adobe PDF	View/Open
andrlik-p.pdf	Průběh obhajoby práce	305,47 kB	Adobe PDF	View/Open
andrlik-v.pdf	Posudek vedoucího práce	688,1 kB	Adobe PDF	View/Open
andrlik-o.pdf	Posudek oponenta práce	757,56 kB	Adobe PDF	View/Open

Show full item record

Please use this identifier to cite or link to this item: http://hdl.handle.net/11025/41862

search

navigation