Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying

Pöllabauer, Thomas; Knauthe, Volker; Boller, André; Kuijper, Arjan; Fellner, Dieter W.

Full metadata record

DC pole	Hodnota	Jazyk
dc.contributor.author	Pöllabauer, Thomas
dc.contributor.author	Knauthe, Volker
dc.contributor.author	Boller, André
dc.contributor.author	Kuijper, Arjan
dc.contributor.author	Fellner, Dieter W.
dc.contributor.editor	Skala, Václav
dc.date.accessioned	2024-07-21T09:27:46Z	-
dc.date.available	2024-07-21T09:27:46Z	-
dc.date.issued	2024	-
dc.identifier.citation	Journal of WSCG. 2024, vol. 32, no. 1-2, p. 101-110.	en
dc.identifier.issn	1213 – 6972
dc.identifier.issn	1213 – 6980 (CD-ROM)
dc.identifier.issn	1213 – 6964 (on-line)
dc.identifier.uri	http://hdl.handle.net/11025/57349
dc.format	10 s.	cs_CZ
dc.format	10 s.	cs
dc.format.mimetype	application/pdf
dc.language.iso	en	en
dc.publisher	Václav Skala - UNION Agency	cs
dc.rights	© Václav Skala - UNION Agency	cs_CZ
dc.rights	© Václav Skala - UNION Agency	en
dc.subject	strojové učení	cs
dc.subject	detekce objektu	cs
dc.subject	segmentace objektů	cs
dc.subject	hluboké neuronové sítě	cs
dc.title	Fast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keying	en
dc.type	článek	cs
dc.type	article	en
dc.rights.access	openAccess	en
dc.type.version	publishedVersion	-
dc.description.abstract-translated	Deep Neural Networks (DNNs) require large amounts of annotated training data for a good performance. Often this data is generated using manual labeling (error-prone and time-consuming) or rendering (requiring geometry and material information). Both approaches make it difficult or uneconomic to apply them to many small-scale applications. A fast and straightforward approach of acquiring the necessary training data would allow the adoption of deep learning to even the smallest of applications. Chroma keying is the process of replacing a color (usually blue or green) with another background. Instead of chroma keying, we propose luminance keying for fast and straightforward training image acquisition. We deploy a black screen with high light absorption (99.99%) to record roughly 1-minute long videos of our target objects, circumventing typical problems of chroma keying, such as color bleeding or color overlap between background color and object color. Next we automatically mask our objects using simple brightness thresholding, saving the need for manual annotation. Finally, we automatically place the objects on random backgrounds and train a 2D object detector. We do extensive evaluation of the performance on the widely-used YCB-V object set and compare favourably to other conventional techniques such as rendering, without needing 3D meshes, materials or any other information of our target objects and in a fraction of the time needed for other approaches. Our work demonstrates highly accurate training data acquisition allowing to start training state-of-the-art networks within minutes	en
dc.subject.translated	machine learning	en
dc.subject.translated	object detection	en
dc.subject.translated	object segmentation	en
dc.subject.translated	deep neural networks	en
dc.identifier.doi	https://www.doi.org/10.24132/JWSCG.2024.11
dc.type.status	Peer-reviewed	en
Vyskytuje se v kolekcích:	Volume 32, number 1-2 (2024)

Soubory připojené k záznamu:

Soubor	Popis	Velikost	Formát
C89-2024.pdf	Plný text	4,78 MB	Adobe PDF	Zobrazit/otevřít

Zobrazit minimální záznam Zobrazit statistiky

Použijte tento identifikátor k citaci nebo jako odkaz na tento záznam: http://hdl.handle.net/11025/57349

Všechny záznamy v DSpace jsou chráněny autorskými právy, všechna práva vyhrazena.

hledání

navigace