hsuth1996 Goto Github PK

followers: 3.0 following: 10.0 repos: 53.0 gists: 0.0

Name: XuYipeng

Type: User

Company: Tianjin University

Bio: A PhD candidate of EE in Tianjin university

Location: Tianjin

XuYipeng's Projects

privacyoptimizationsubspace

Privacy-Preserving Distributed Optimization via Subspace Perturbation: A General Framework

pymarl

Python Multi-Agent Reinforcement Learning framework

PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).

pytorch-maddpg

A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)

pytorch-soft-actor-critic

PyTorch implementation of soft actor critic

pytorch_sac

PyTorch implementation of Soft Actor-Critic (SAC)

reinforcement-implementation

Implementation of benchmark RL algorithms

reinforcement-learning-with-tensorflow

Simple Reinforcement learning tutorials, 莫烦Python 中文AI教学

rl-adventure-2

PyTorch0.4 implementation of: actor critic / proximal policy optimization / acer / ddpg / twin dueling ddpg / soft actor critic / generative adversarial imitation learning / hindsight experience replay

rl-baselines-zoo

A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.

rl_vvc_dataset

A Reinforcement Learning-based Volt-VAR Control Dataset

rlkit

Collection of reinforcement learning algorithms

safe-policy-optimization

This is a benchmark repository for safe reinforcement learning algorithms

safety-starter-agents

Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.

spinningup

An educational resource to help anyone learn deep reinforcement learning.

stablerl_voltagectrl

This repository contains source code necessary to reproduce the results presented in the following paper: Stability Constrained Reinforcement Learning for Real-Time Voltage Control (https://arxiv.org/pdf/2109.14854.pdf)

starcraft

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

supports_for_epc

电力建设论文《基于价值认同的需求侧电能共享分布式交易策略》的支撑文件

supports_for_pst_paper

论文《考虑实时市场联动的电力零售商鲁棒定价策略》的支撑文件，拟发表在《电网技术》杂志。2021年10月23日。

hsuth1996 Goto Github PK

XuYipeng's Projects

Recommend Projects

Recommend Topics

Recommend Org