Discovery and Learning of Navigation Goals from Pixels in Minecraft

Nieto JJosé. Discovery and Learning of Navigation Goals from Pixels in Minecraft. Campos V, Giró-i-Nieto X. 2021.

(16.15 MB)

Abstract

Pre-training Reinforcement Learning (RL) agents in a task-agnostic manner has shown promising results. However, previous works still struggle to learn and discover meaningful skills in high-dimensional state-spaces. We approach the problem by leveraging unsupervised skill discovery and self-supervised learning of state representations. In our work, we learn a compact latent representation by making use of variational or contrastive techniques. We demonstrate that both allow learning a set of basic navigation skills by maximizing an information theoretic objective. We assess our method in Minecraft 3D maps with different complexities. Our results show that representations and conditioned policies learned from pixels are enough for toy examples, but do not scale to realistic and complex maps. We also explore alternative rewards and input observations to overcome these limitations.

Source code
Slides on Slideshare

Projects

Multimodal Deep Reinforcement Learning

Image Processing Group

Search form

User login

Abstract

Projects