Solving multiagent agent problem using tensroflow of a simple multi-agent particle world with a continuous observation and discrete action space, along with some basic simulated physics.
The repository heavily relies on philtabor pytorch implementation for Multi Agent Actor Critic for Mixed Cooperative-Competitive Environments
-
clone Multi Agent Particle Environment(MAPE) as detailed.
-
cd to multiagent-particle-envs
-
Create virtual environment and activate it
-
Install required dependecies
1 adversary (red), N good agents (green), N landmarks (usually N=2). All agents observe position of landmarks and other agents. One landmark is the ‘target landmark’ (colored green). Good agents rewarded based on how close one of them is to the target landmark, but negatively rewarded if the adversary is close to target landmark. Adversary is rewarded based on how close it is to the target, but it doesn’t know which landmark is the target landmark. So good agents have to learn to ‘split up’ and cover all landmarks to deceive the adversary.
-
Create python virtual env
-
install petting-zoo with MPE dependecies
pip install 'pettingzoo[mpe]'