Code Monkey home page Code Monkey logo

sergeivolodin / causality-disentanglement-rl Goto Github PK

View Code? Open in Web Editor NEW
1.0 2.0 0.0 178.61 MB

Simple world models lead to good abstractions, Google Cerebra internship 2020/master thesis at EPFL LCN 2021 ⬛◼️▪️🔦

Home Page: https://www.overleaf.com/read/mmmbrrvnkffq

License: Creative Commons Zero v1.0 Universal

Python 88.56% Shell 0.17% Mathematica 11.28%
causality causal-reinforcement-learning abstraction-learning sparsity consciousness-prior tf-agents tensorflow model-sparsity primal-dual model-based-reinforcement-learning

causality-disentanglement-rl's People

Contributors

dependabot[bot] avatar sergeivolodin avatar

Stargazers

 avatar

Watchers

 avatar  avatar

causality-disentanglement-rl's Issues

Create and run hyperparameter study

  • Different losses: reconstruction or matrix norm or both
  • Feature space or observation space fit loss and reconstruction loss
  • Two, three or one optimizer [tried manually]
  • Learning rate, dimension search
  • Annealer grid search
  • Batch norm / no batch norm after decoder
  • Add a metric computing the losses with thresholded model
  • Norm order p
  • Projection or loss term

Train a stock agent on transformed representation to a full return

Now in 761e211 DQN fails to converge on an environment with a 2x2 linear transform, while it's fine without the transform. The reason might be either that there were too few steps given (now 100 steps x 256 episodes, or that there is a problem with the environment being non-Markov).

Concrete step to try: try a linear q-network and see if it converges to different results with and without transform.

Next step: try policy gradients as they do not require to learn the value (which might not exist if the environment is not Markov), only the policy (which is simple -- compare two numbers and choose a corresponding action)

@jbrea adding you to the issue so that you learn faster about the progress.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.