Data simulation in deep learning-based human recognition

DSpace Repositorium (Manakin basiert)

Zur Kurzanzeige

dc.contributor.advisor Curio, Cristóbal (Prof. Dr.-Ing.)
dc.contributor.author Burgermeister (geb. Ludl), Dennis
dc.date.accessioned 2023-04-25T15:39:37Z
dc.date.available 2023-04-25T15:39:37Z
dc.date.issued 2023-04-25
dc.identifier.uri http://hdl.handle.net/10900/139699
dc.identifier.uri http://nbn-resolving.de/urn:nbn:de:bsz:21-dspace-1396990 de_DE
dc.identifier.uri http://dx.doi.org/10.15496/publikation-81046
dc.description.abstract Human recognition is an important part of perception systems, such as those used in autonomous vehicles or robots. These systems often use deep neural networks for this purpose, which rely on large amounts of data that ideally cover various situations, movements, visual appearances, and interactions. However, obtaining such data is typically complex and expensive. In addition to raw data, labels are required to create training data for supervised learning. Thus, manual annotation of bounding boxes, keypoints, orientations, or actions performed is frequently necessary. This work addresses whether the laborious acquisition and creation of data can be simplified through targeted simulation. If data are generated in a simulation, information such as positions, dimensions, orientations, surfaces, and occlusions are already known, and appropriate labels can be generated automatically. A key question is whether deep neural networks, trained with simulated data, can be applied to real data. This work explores the use of simulated training data using examples from the field of pedestrian detection for autonomous vehicles. On the one hand, it is shown how existing systems can be improved by targeted retraining with simulation data, for example to better recognize corner cases. On the other hand, the work focuses on the generation of data that hardly or not occur at all in real standard datasets. It will be demonstrated how training data can be generated by targeted acquisition and combination of motion data and 3D models, which contain finely graded action labels to recognize even complex pedestrian situations. Through the diverse annotation data that simulations provide, it becomes possible to train deep neural networks for a wide variety of tasks with one dataset. In this work, such simulated data is used to train a novel deep multitask network that brings together diverse, previously mostly independently considered but related, tasks such as 2D and 3D human pose recognition and body and orientation estimation. en
dc.language.iso en de_DE
dc.publisher Universität Tübingen de_DE
dc.rights ubt-podok de_DE
dc.rights.uri http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=de de_DE
dc.rights.uri http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=en en
dc.subject.classification Neuronales Netz , Deep learning , Fußgänger , Maschinelles Sehen , Klassifikation de_DE
dc.subject.ddc 004 de_DE
dc.title Data simulation in deep learning-based human recognition en
dc.type PhDThesis de_DE
dcterms.dateAccepted 2023-04-04
utue.publikation.fachbereich Informatik de_DE
utue.publikation.fakultaet 7 Mathematisch-Naturwissenschaftliche Fakultät de_DE
utue.publikation.noppn yes de_DE

Dateien:

Das Dokument erscheint in:

Zur Kurzanzeige