High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

competition_olympics-integrated

cumcm2020b

2020全国大学数学建模大赛赛题B 穿越沙漠

daydaycode

Online Judge 刷题

deeprl_network

multi-agent deep reinforcement learning for networked system control.

elegantrl

Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥

gpt_academic

为GPT/GLM提供图形交互界面，特别优化论文阅读润色体验，模块化设计支持自定义快捷按钮&函数插件，支持代码块表格显示，Tex公式双显示，新增Python和C++项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持清华chatglm等本地模型

gym-jsbsim

A reinforcement learning environment for aircraft control using the JSBSim flight dynamics model

ilkit

A clean code base for imitation learning and reinforcment learning , written in Pytorch

image_classification

图片分类

images

jackory.github.io

A beautiful, simple, clean, and responsive Jekyll theme for academics

lightzero

LightZero: A lightweight and efficient MCTS/AlphaZero/MuZero algorithm toolkit.

nju_course_project

Recorded projects completed in NJU

omnisafe

OmniSafe is an infrastructural framework for accelerating SafeRL research.

on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

plants.vszombies

CUI版植物大战僵尸

ray

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.

rpbt

Implementation of RPPO(Risk-sensitive PPO) and RPBT(Population-based self-play with RPPO)

spinningup

An educational resource to help anyone learn deep reinforcement learning.

statistics-project

应用统计与R语言大作业

tcgaiic

天池人工智能技术创新大赛赛道三

tdmpc2

Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"

timechamber

vem

Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.09796)

jackory Goto Github PK

Yuhua Jiang's Projects

Recommend Projects

Recommend Topics

Recommend Org