View Code? Open in Web Editor NEW

Shell 0.02% JavaScript 5.52% C++ 19.01% Python 60.78% C 1.78% Makefile 9.92% HTML 1.00% CMake 1.97%

wtoe's Introduction

Implementation of the paper "WToE: Learning When to Explore in Multi-Agent Reinforcement Learning"

This is the code for "WToE: Learning When to Explore in Multi-Agent Reinforcement Learning". The paper will be published soon.

Environment

Grid Examples: The environment contains two basic grid environment (2-room and 4-room environments), which are implemented in the GRID/ENV file.
Multi-Agent Particle Environments (MPE): A simple multi-agent particle world with a continuous observation and discrete action space, along with some basic simulated physics. Used in the paper Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments. We test our algorithm in six scenarios, including simple_adversary,simple_crypto, simple_push, simple_reference, simple_spread, simple_tag.
MAgent: MAgent is a research platform for many-agent reinforcement learning. Unlike previous research platforms that focus on reinforcement learning research with a single agent or only few agents, MAgent aims at supporting reinforcement learning research that scales up from hundreds to millions of agents.

Quick start

Please follow the instruction of 'README.md' file in different environments to install Python requirements.

Cite our paper

@article{WToE,
  title={WToE: Learning When to Explore in Multi-Agent Reinforcement Learning},
  author={Dong, Shaokang and Mao, Hangyu and Yang, Shangdong and Zhu, Shengyu and Li, Wenbin and Hao, Jianye and Gao, Yang},
  journal={IEEE Transactions on Cybernetics},
  year={2023}
}

Recommend Projects