Code Monkey home page Code Monkey logo

Yixuan Huang's Projects

baselines icon baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

bullet3 icon bullet3

Mainly focus is the racecar in the pybullet. Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.

cpo icon cpo

Constrained Policy Optimization

cs294-112_hws icon cs294-112_hws

My solution to assignments in UC Berkeley CS294-112: Deep Reinforcement Learning

deeplearning-500-questions icon deeplearning-500-questions

深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为15个章节,近20万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系[email protected] 版权所有,违权必究 Tan 2018.06

gps icon gps

Guided Policy Search

gym icon gym

A toolkit for developing and comparing reinforcement learning algorithms.

handful-of-trials icon handful-of-trials

Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

handful-of-trials-pytorch icon handful-of-trials-pytorch

Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"

mirage-rl-trpo icon mirage-rl-trpo

Fork of https://github.com/ikostrikov/pytorch-trpo with modifications for the paper "The Mirage of Action-Dependent Baselines in Reinforcement Learning".

models icon models

Models and examples built with TensorFlow

ppo icon ppo

Proximal Policy Optimization implementation with TensorFlow

pytorch-a2c-ppo-acktr-gail icon pytorch-a2c-ppo-acktr-gail

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

quadcopter_sim icon quadcopter_sim

simulation of quadcopter using pybullet (calculations) and pyqtgraph (visualisation)

rllab icon rllab

rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.

sornet icon sornet

Code for SORNet: Spatial Object-Centric Representations for Sequential Manipulation in CoRL 2021 (Best Systems Paper Finalist)

trex-gym icon trex-gym

OpenAI Gym environment using pybullet for a Tyrannosaur.

trpo icon trpo

Trust Region Policy Optimization with TensorFlow and OpenAI Gym

trpo-gae icon trpo-gae

Trust Region Policy Optimization with Generalized Advantage Estimator

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.