multi-agent-deep-deterministic-policy-gradients-tensorflow's Introduction

Multi-Agent-Deep-Deterministic-Policy-Gradients-Tensorflow

Solving multiagent agent problem using tensroflow of a simple multi-agent particle world with a continuous observation and discrete action space, along with some basic simulated physics.

The repository heavily relies on philtabor pytorch implementation for Multi Agent Actor Critic for Mixed Cooperative-Competitive Environments

clone Multi Agent Particle Environment(MAPE) as detailed.
cd to multiagent-particle-envs
Create virtual environment and activate it
Install required dependecies

Game rules

1 adversary (red), N good agents (green), N landmarks (usually N=2). All agents observe position of landmarks and other agents. One landmark is the ‘target landmark’ (colored green). Good agents rewarded based on how close one of them is to the target landmark, but negatively rewarded if the adversary is close to target landmark. Adversary is rewarded based on how close it is to the target, but it doesn’t know which landmark is the target landmark. So good agents have to learn to ‘split up’ and cover all landmarks to deceive the adversary.

Using petting zoo

Create python virtual env
install petting-zoo with MPE dependecies pip install 'pettingzoo[mpe]'

Recommend Projects

razisamuely / multi-agent-deep-deterministic-policy-gradients-tensorflow Goto Github PK