Learning unsupervised disentangled skill latents to adapt unseen task and morphological modifications

Taewoo Kim, Pamul Yadav, Ho Suk, Shiho Kim

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

Learning adaptable policies in the absence of explicit reward signals is a challenging problem in reinforcement learning. We propose an algorithm that disentangles the morphology-aware variables from skill latent space for adapting quickly to unseen morphological changes. Through several task learning experiments using MuJoCo Ant environments, we demonstrate that the agent can perform zero-shot inference and adapt to mild modification in morphology within an expected performance range. Furthermore, in the case of severe unseen morphological damage to the agent's body, with the help of add-on training steps, we can subjoin additional value incorporated into the disentangled latent space without catastrophically destroying the pre-trained network. We observe that the proposed separable-skill based method outperforms prior evolutionary meta-learning-based approaches, and the presented approach opens up research direction toward reinforcement learning for open-world novelty. Our source code is available at:https://github.com/boratw/sd4m.

Original languageEnglish
Article number105367
JournalEngineering Applications of Artificial Intelligence
Volume116
DOIs
Publication statusPublished - 2022 Nov

Bibliographical note

Publisher Copyright:
© 2022 The Author(s)

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Artificial Intelligence
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'Learning unsupervised disentangled skill latents to adapt unseen task and morphological modifications'. Together they form a unique fingerprint.

Cite this