Learning and generalizing behaviors for robots from human demonstration

Fabisch, Alexander

Zitierlink: https://doi.org/10.26092/elib/382

Learning and generalizing behaviors for robots from human demonstration

Datei	Beschreibung	Größe	Format
dissertation_pdfa.pdf		18.13 MB	Adobe PDF	Anzeigen

Autor/Autorin:	Fabisch, Alexander
BetreuerIn:	Kirchner, Frank
1. GutachterIn:	Kirchner, Frank
Weitere Gutachter:innen:	Rothkopf, Constantin
Zusammenfassung:	Behavior learning is a promising alternative to planning and control for behavior generation in robotics. The field is becoming more and more popular in applications where modeling the environment and the robot is cumbersome, difficult, or maybe even impossible. Learning behaviors for real robots that generalize over task parameters with as few interactions with the environment as possible is a challenge that this dissertation tackles. Which problems we can currently solve with behavior learning algorithms and which algorithms we need in the domain of robotics is not apparent at the moment as there are many related fields: imitation learning, reinforcement learning, self-supervised learning, and black-box optimization. After an extensive literature review, we decide to use methods from imitation learning and policy search to address the challenge. Specifically, we use human demonstrations recorded by motion capture systems and imitation learning with movement primitives to obtain initial behaviors that we later on generalize through contextual policy search. Imitation from motion capture data leads to the correspondence problem: the kinematic and dynamic capabilities of humans and robots are often fundamentally different and, hence, we have to compensate for that. This thesis proposes a procedure for automatic embodiment mapping through optimization and policy search and evaluates it with several robotic systems. Contextual policy search algorithms are often not sample efficient enough to learn directly on real robots. This thesis tries to solve the issue with active context selection, active training set selection, surrogate models, and manifold learning. The progress is illustrated with several simulated and real robot learning tasks. Strong connections between policy search and black-box optimization are revealed and exploited in this part of the thesis. This thesis demonstrates that learning manipulation behaviors is possible within a few hundred episodes directly on a real robot. Furthermore, these new approaches to imitation learning and contextual policy search are integrated in a coherent framework that can be used to learn new behaviors from human motion capture data almost automatically. Corresponding implementations that were developed during this thesis are available in an open source software.
Schlagwort:	Reinforcement Learning; Imitation Learning; Embodiment Mapping; Contextual Policy Search; Manifold Learning; Robotics
Veröffentlichungsdatum:	3-Dez-2020
Dokumenttyp:	Dissertation
Zweitveröffentlichung:	no
DOI:	10.26092/elib/382
URN:	urn:nbn:de:gbv:46-elib45853
Institution:	Universität Bremen
Fachbereich:	Fachbereich 03: Mathematik/Informatik (FB 03)
Enthalten in den Sammlungen:	Dissertationen

Seitenansichten

683

checked on 03.04.2025

Download(s)

1.440

checked on 03.04.2025

Google Scholar^TM

Prüfe

Alle Ressourcen in diesem Repository sind urheberrechtlich geschützt.

Learning and generalizing behaviors for robots from human demonstration

Seitenansichten

Download(s)

Google ScholarTM

Google Scholar^TM