Learning and generalizing behaviors for robots from human demonstration

Fabisch, Alexander

doi:10.26092/elib/382

Zitierlink DOI

10.26092/elib/382

Learning and generalizing behaviors for robots from human demonstration

Veröffentlichungsdatum

2020-12-03

Autoren

Fabisch, Alexander

Betreuer

Kirchner, Frank

Gutachter

Rothkopf, Constantin

Zusammenfassung

Behavior learning is a promising alternative to planning and control for behavior generation in robotics. The field is becoming more and more popular in applications where modeling the environment and the robot is cumbersome, difficult, or maybe even impossible.

Learning behaviors for real robots that generalize over task parameters with as few interactions with the environment as possible is a challenge that this dissertation tackles. Which problems we can currently solve with behavior learning algorithms and which algorithms we need in the domain of robotics is not apparent at the moment as there are many related fields: imitation learning, reinforcement learning, self-supervised learning, and black-box optimization.

After an extensive literature review, we decide to use methods from imitation learning and policy search to address the challenge. Specifically, we use human demonstrations recorded by motion capture systems and imitation learning with movement primitives to obtain initial behaviors that we later on generalize through contextual policy search.

Imitation from motion capture data leads to the correspondence problem: the kinematic and dynamic capabilities of humans and robots are often fundamentally different and, hence, we have to compensate for that. This thesis proposes a procedure for automatic embodiment mapping through optimization and policy search and evaluates it with several robotic systems.

Contextual policy search algorithms are often not sample efficient enough to learn directly on real robots. This thesis tries to solve the issue with active context selection, active training set selection, surrogate models, and manifold learning. The progress is illustrated with several simulated and real robot learning tasks. Strong connections between policy search and black-box optimization are revealed and exploited in this part of the thesis. This thesis demonstrates that learning manipulation behaviors is possible within a few hundred episodes directly on a real robot.

Furthermore, these new approaches to imitation learning and contextual policy search are integrated in a coherent framework that can be used
to learn new behaviors from human motion capture data almost automatically. Corresponding implementations that were developed during this thesis are available in an open source software.

Schlagwörter

Reinforcement Learning

;

Imitation Learning

;

Embodiment Mapping

;

Contextual Policy Search

;

Manifold Learning

;

Robotics

Institution

Universität Bremen

Fachbereich

Fachbereich 03: Mathematik/Informatik (FB 03)

Dokumenttyp

Dissertation

Zweitveröffentlichung

Nein

Sprache

Englisch

Dateien

Name

dissertation_pdfa.pdf

Size

17.71 MB

Format

Adobe PDF

Checksum

(MD5):334384b26021fbd7c1bd072e807d891d