Light

mariko-sawada / coursera_reinforcement_learning_specialization Goto Github PK

View Code? Open in Web Editor NEW

0.0 1.0 0.0 1.72 MB

Coursera Reinforcement Learning Specialization offered by University of Alberta

Jupyter Notebook 100.00% Python 0.01%

coursera_reinforcement_learning_specialization's Introduction

Reinforcement Learning Specialization

Reinforcement Learning Specialization offered by University of Alberta and Alberta Machine Inteligence Institute

Course 1. Fundamentals of Reinforcement Learning

The main goals of the course are:

Understand Exploration-Exploitation tradeoff using the multi-armed bandits
Understand the structure and components of a (finite) Markov Decision Process
Understand the definition of the state-value function and the action-value function
Be able to explain how to derive the Bellman equations and the Bellman optimality equations
Understand the framework of Dynamic Programming(Policy evaluation, policy iteration and generalized policy iteration)

Course 2. Sample-based Learning Models

The main goals of the course are:

Understand prediction problems using Monte Carlo methods
Understand how Temporal Difference learning works in prediction problems compared to the Monte Carlo method
Understand different TD learning methods for control problems; Q-learning, SARSA, and Expected SARSA
Understand the Dyna architecture (Dyna-Q and Dyna-Q+)

Course 3. Prediction and Control with Function Approximation

The main goals of the course are:

Understand how value functions are approximated using parametrized functions
Understand what coarse coding for feature generalization is and how to use neural network for function approximation
Be able to implement Episodic SARSA
Understand what the average reward
Understand how Policy gradient and the actor-critic method work for continuing tasks

Course 4. A Complete Reinforcement Learning System(Capstone)

The final course aims to implement a complete RL system by

Interpretting the setting into a RL framework and identifying a proper method
Coding our environment
Coding our agent

Our setting is that we want a lunar lander to land on the surface of the moon without crushing.

coursera_reinforcement_learning_specialization's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.