LAST: Language-guided Skill Learning with Temporal Variational Inference

We present an algorithm for skill discovery from expert demonstrations. The algorithm utilizes Large Language Models (LLMs) to first generate an initial segmentation of the trajectories. Following that, our proposed hierarchical variational inference framework incorporates additional information generated from the LLM to discover reusable skills by merging trajectory segments. Additionally, the algorithm introduces a novel auxiliary objective based on the Minimum Description Length principle that helps guide this skill discovery process. Our results demonstrate that agents learned this way discover skills that help accelerate learning on new long-horizon tasks in BabyAI, a gridworld navigation environment, as well as ALFRED, a household simulation environment.

Language-guided Skill Learning with Temporal Variational Inference

ICML 2024

Abstract

Put a chilled apple on the counter

Put a microwaved potato in the sink

Put two credit cards on a table

Skills discovered

Visualize a skill