Light

xrsrke / muzero Goto Github PK

View Code? Open in Web Editor NEW

3.0 1.0 1.0 53 KB

Implement MuZero from scratch [WORK IN PROGRESS]

License: Apache License 2.0

Python 54.18% Jupyter Notebook 45.12% CSS 0.71%

muzero's Introduction

muzero

This file will become your README and also the index of your documentation.

Install

pip install -r requirements
pip install -e .

MuZero

Representation function - $h_{\theta}(o_1, o_2,...o_t)$: It takes the history of observations and produces a hidden state $s_o$ of the observations
Prediction function - $f_{\theta}(s^k)$: It takes the hidden state of the observation and predicts the policy $\pi^k$ and the value $v^k$
Dynamic function - $g_{\theta}(s^{k-1}, a^k) = r^k, s^k$: It takes the current state and an action, it predicts the next state and reward

Components

Monte Carlo Tree Search (MCTS: to guide the exploration of the game state space and select the most promising actions

TODO

A replay buffer share between agents
Monte Carlo Tree Search

Resources

Resources that i used to implement MuZero - MuZero - Mastering Atari, Go, Chess and Shogi by Planning with a Learned Model | RL Paper explained

MCTS
- Recreating DeepMind's AlphaZero - AI Plays Connect 4 - Part 2: Intro to Monte Carlo Tree Search: https://youtu.be/HikhrP5sgQo

muzero's People

Stargazers

Watchers

Forkers

stjordanis

muzero's Issues

Updates

TIL: yay, finally understand the update rule for the policy in the soft actor-critic

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.