Code Monkey home page Code Monkey logo

lars12llt's Projects

brax icon brax

Massively parallel rigidbody physics simulation on accelerator hardware.

collaq icon collaq

A code implementation for our arXiv paper "Multi-agent Adhoc Team Play using Decompositional Q function"

distributedrl icon distributedrl

A framework for easy prototyping of distributed reinforcement learning algorithms

dqn_zoo icon dqn_zoo

DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (DQN) agent.

driml icon driml

Code for Deep Reinforcement and InfoMax Learning (Neurips 2020)

efficientzero icon efficientzero

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

efficientzerov2 icon efficientzerov2

[ICML 2024, Spotlight] EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data

generativerl icon generativerl

Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).

h-baselines icon h-baselines

A repository of high-performing hierarchical reinforcement learning models and algorithms.

jax-rl icon jax-rl

Jax (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.

jaxmarl icon jaxmarl

Multi-Agent Reinforcement Learning with JAX

level-replay icon level-replay

This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the fact that not all levels are equally useful for agents to learn from during training.

lightzero icon lightzero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

mrcl icon mrcl

Code for the NeurIPS19 paper "Meta-Learning Representations for Continual Learning"

muzero icon muzero

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

ntk icon ntk

Code for experiments in my blog post on the Neural Tangent Kernel: https://rajatvd.github.io/NTK

optimalrepresentationrl icon optimalrepresentationrl

An implementation in PyTorch of the paper "A Geometric Perspective on Optimal Representations for Reinforcement Learning" by Bellemare et al

procgen-competition icon procgen-competition

Sample efficiency and generalisation in reinforcement learning using procedural generation.

pytorch-a2c-ppo-acktr-gail icon pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.