Transfer Learning and Hyperparameter Optimization for Instance Segmentation with RGB-D Images in Reflective Elevator Environments

Reithmeier, Lukas; Krauss, Oliver; Zwettler, Adam Gerald

Full metadata record

DC pole	Hodnota	Jazyk
dc.contributor.author	Reithmeier, Lukas
dc.contributor.author	Krauss, Oliver
dc.contributor.author	Zwettler, Adam Gerald
dc.contributor.editor	Skala, Václav
dc.date.accessioned	2021-09-01T07:24:17Z
dc.date.available	2021-09-01T07:24:17Z
dc.date.issued	2021
dc.identifier.citation	WSCG 2021: full papers proceedings: 29. International Conference in Central Europe on Computer Graphics, Visualization and Computer Vision, p. 273-282.	en
dc.identifier.isbn	978-80-86943-34-3
dc.identifier.issn	2464-4617
dc.identifier.issn	2464–4625(CD/DVD)
dc.identifier.uri	http://hdl.handle.net/11025/45033
dc.format	10 s.	cs
dc.format.mimetype	application/pdf
dc.language.iso	en	en
dc.publisher	Václav Skala - UNION Agency	cs
dc.rights	© Václav Skala - UNION Agency	cs
dc.subject	segmentace instance	cs
dc.subject	RGB-D data	cs
dc.subject	přenosové učení	cs
dc.subject	reflexní prostředí	cs
dc.title	Transfer Learning and Hyperparameter Optimization for Instance Segmentation with RGB-D Images in Reflective Elevator Environments	en
dc.type	conferenceObject	en
dc.type	konferenční příspěvek	cs
dc.rights.access	openAccess	en
dc.type.version	publishedVersion	en
dc.description.abstract-translated	Elevators, a vital means for urban transportation, are generally lacking proper emergency call systems besidesan emergency button. In the case of unconscious or otherwise incapacitated passengers this can lead to lethalsituations. A camera-based surveillance system with AI-based alerts utilizing an elevator state machine can helppassengers unable to initiate an emergency call. In this research work, the applicability of RGB-D images asinput for instance segmentation in the highly reflective environment of an elevator cabin is evaluated. For objectsegmentation, a Region-based Convolution Neural Network (R-CNN) deep learning model is adapted to use depthinput data besides RGB by applying transfer learning, hyperparameter optimization and re-training on a newlyprepared elevator image dataset. Evaluations prove that with the chosen strategy, the accuracy of R-CNN instancesegmentation is applicable on RGB-D data, thereby resolving lack of image quality in the noise affected andreflective elevator cabins. The mean average precision (mAP) of 0.753 is increased to 0.768 after the incorporationof additional depth data and with additional FuseNet-FPN backbone on RGB-D the mAP is further increased to0.794. With the proposed instance segmentation model, reliable elevator surveillance becomes feasible as firstprototypes and on-road tests proof.	en
dc.subject.translated	instance segmentation	en
dc.subject.translated	RGB-D data	en
dc.subject.translated	transfer learning	en
dc.subject.translated	reflective environments	en
dc.identifier.doi	https://doi.org/10.24132/CSRN.2021.3101.30
dc.type.status	Peer-reviewed	en
Vyskytuje se v kolekcích:	WSCG 2021: Full Papers Proceedings

Soubory připojené k záznamu:

Soubor	Popis	Velikost	Formát
J71.pdf	Plný text	4,43 MB	Adobe PDF	Zobrazit/otevřít

Zobrazit minimální záznam Zobrazit statistiky

Použijte tento identifikátor k citaci nebo jako odkaz na tento záznam: http://hdl.handle.net/11025/45033

Všechny záznamy v DSpace jsou chráněny autorskými právy, všechna práva vyhrazena.

hledání

navigace