Code Monkey home page Code Monkey logo

cs234's Introduction

My Solution to Assignments of CS234

This is my solution to three assignments of CS234.
CS234: Deep Reinforcement Learning is an interesting class, which teaches you what is the reinforcement learning: Learn to make good sequences of decisions. This class provides some basic knowledge and insights of cutting-edge research in reinforcement learning. More details are as follows:

  • Define the key features of RL vs AI & other ML
  • Define MDP, POMDP, bandit, batch offline RL, online RL
  • Describe the exploration vs exploitation challenge and compare and contrast 2 or more approaches
  • Given an application problem (e.g. from computer vision, robotics, etc) decide if it should be formulated as a RL problem, if yes how to formulate, what algorithm (from class) is best suited to address, and justify an answer
  • Implement several RL algorithms incl. a deep RL approach
  • Describe multiple criteria for analyzing RL algorithms and evaluate algorithms on these metrics: e.g. regret, sample complexity, computational complexity, convergence, etc.
  • List at least two open challenges or hot topics in RL

Note: If you consult my source codes that you may want to incorporate into your algorithm or system, you should clearly cite references in your codes.


Table of Contents

  • Assignment 1
    • Bellman Operator Properties
    • Value Iteration
    • Grid Policies
    • Frozen Lake MDP
    • Frozen Lake Reinforcement Learning
  • Assignment 2
    • Q-learning
    • Linear Approximation
    • Deepmind's DQN
    • (Bonus) Double DQN
    • (Bonus) Dueling DQN
  • Assignment 3
    • R-max algorithm
    • epsilon-greedy q-learning
    • Expected Regret Bounds

Dependencies

  • Anaconda
  • tensorflow>=0.12
  • matplotlib
  • scipy
  • numpy
  • sklearn
  • six

Author

@zlpure

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.