Full metadata record
DC poleHodnotaJazyk
dc.contributor.authorPöllabauer, Thomas
dc.contributor.authorKnauthe, Volker
dc.contributor.authorBoller, André
dc.contributor.authorKuijper, Arjan
dc.contributor.authorFellner, Dieter W.
dc.contributor.editorSkala, Václav
dc.date.accessioned2024-07-21T09:27:46Z-
dc.date.available2024-07-21T09:27:46Z-
dc.date.issued2024-
dc.identifier.citationJournal of WSCG. 2024, vol. 32, no. 1-2, p. 101-110.en
dc.identifier.issn1213 – 6972
dc.identifier.issn1213 – 6980 (CD-ROM)
dc.identifier.issn1213 – 6964 (on-line)
dc.identifier.urihttp://hdl.handle.net/11025/57349
dc.format10 s.cs_CZ
dc.format10 s.cs
dc.format.mimetypeapplication/pdf
dc.language.isoenen
dc.publisherVáclav Skala - UNION Agencycs
dc.rights© Václav Skala - UNION Agencycs_CZ
dc.rights© Václav Skala - UNION Agencyen
dc.subjectstrojové učenícs
dc.subjectdetekce objektucs
dc.subjectsegmentace objektůcs
dc.subjecthluboké neuronové sítěcs
dc.titleFast Training Data Acquisition for Object Detection and Segmentation using Black Screen Luminance Keyingen
dc.typečlánekcs
dc.typearticleen
dc.rights.accessopenAccessen
dc.type.versionpublishedVersion-
dc.description.abstract-translatedDeep Neural Networks (DNNs) require large amounts of annotated training data for a good performance. Often this data is generated using manual labeling (error-prone and time-consuming) or rendering (requiring geometry and material information). Both approaches make it difficult or uneconomic to apply them to many small-scale applications. A fast and straightforward approach of acquiring the necessary training data would allow the adoption of deep learning to even the smallest of applications. Chroma keying is the process of replacing a color (usually blue or green) with another background. Instead of chroma keying, we propose luminance keying for fast and straightforward training image acquisition. We deploy a black screen with high light absorption (99.99%) to record roughly 1-minute long videos of our target objects, circumventing typical problems of chroma keying, such as color bleeding or color overlap between background color and object color. Next we automatically mask our objects using simple brightness thresholding, saving the need for manual annotation. Finally, we automatically place the objects on random backgrounds and train a 2D object detector. We do extensive evaluation of the performance on the widely-used YCB-V object set and compare favourably to other conventional techniques such as rendering, without needing 3D meshes, materials or any other information of our target objects and in a fraction of the time needed for other approaches. Our work demonstrates highly accurate training data acquisition allowing to start training state-of-the-art networks within minutesen
dc.subject.translatedmachine learningen
dc.subject.translatedobject detectionen
dc.subject.translatedobject segmentationen
dc.subject.translateddeep neural networksen
dc.identifier.doihttps://www.doi.org/10.24132/JWSCG.2024.11
dc.type.statusPeer-revieweden
Vyskytuje se v kolekcích:Volume 32, number 1-2 (2024)

Soubory připojené k záznamu:
Soubor Popis VelikostFormát 
C89-2024.pdfPlný text4,78 MBAdobe PDFZobrazit/otevřít


Použijte tento identifikátor k citaci nebo jako odkaz na tento záznam: http://hdl.handle.net/11025/57349

Všechny záznamy v DSpace jsou chráněny autorskými právy, všechna práva vyhrazena.