Code Monkey home page Code Monkey logo

language-as-an-abstraction-for-hierarchical-deep-reinforcement-learning's Introduction

Language-as-an-Abstraction-for-Hierarchical-Deep-Reinforcement-Learning

PyTorch implementation of Language as an Abstraction for Hierarchical Deep Reinforcement Learning.
This paper uses language as the abstraction for Hierarchical Reinforcement Learning. Using this approach, agents can learn to solve to diverse, temporally-extended tasks such as object sorting and multi-object rearrangement.

Introduction

The proposed architecture has a 2-layer hierarchical policy with compositional language as the abstraction between the high-level policy and the low-level policy. This repository aims to replicate the results for low-level policy experiments (Figure 4 in the paper).

The experiments include state-observations and raw-pixel observations. This repository only focuses on the state-based representation (Figure 7 in the paper).

The environment and some instructions considered in this work
Low-level policy trying to complete randomly sampled goals

Installation and Running

The paper uses the CLEVR-Robot environment which is built on top of the MuJoCo physics simulator. These libraries are required. PyTorch 1.3 is used.

Simply running the main.py file starts the training.

Future Instruction Relabeling Strategy (Algorithm 4 in the paper) and the Computation graph of the state-based low level policy (Figure 7 in the paper) can be found in util.py

DQN, Instruction Encoder and the f1 network can be found in networks.py

Agent completing an instruction

Details

hal
The part showed by the red box is implemented in this repository. The instructions are sampled randomly from the environment.

References

  • Yiding Jiang, Shixiang Gu, Kevin Murphy, and Chelsea Finn. Language as an Abstraction for Hierarchical Deep Reinforcement Learning. In Workshop on “Structure & Priors in Reinforcement Learning”at ICLR 2019, jun 2019. URL http://arxiv.org/abs/1906.07343.

language-as-an-abstraction-for-hierarchical-deep-reinforcement-learning's People

Contributors

bhiziroglu avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.