zhengyinan-air / omiga Goto Github PK
View Code? Open in Web Editor NEW[NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization"
[NeurIPS 2023] The official implementation of "Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization"
In the get_episode() function, the rewards have been turned into reward-to-gos, which is not describe in the paper.
for agent_trajectory in episode:
rtgs = 0
for i in reversed(range(len(agent_trajectory))):
rtgs += agent_trajectory[i][3][0]
agent_trajectory[i][3][0] = rtgs
return episode
Can you explain why should turn rewards to reward-to-go? Is this transformation applied for other baselines too?
Sorry to bother you.
I noticed you are using non-default SMAC arguments when initialising the environment. Can you explain why? And maybe provide a list of arguments you changed? Did you also change the SMAC code since you included the SMAC environment in your repository?
Thanks for the assistance.
Hi there, thanks for sharing your code for MAMuJoCo experiments. I was wondering if you plan to release the code to reproduce your experiments on SMAC?
Hi!
In your paper, you compared OMIGA with BCQ-MA, CQL-MA, ICQ and OMAR. I have found the official implementations for ICQ-MA and OMAR, can you share your version for BCQ-MA and CQL-MA or just share the implementations link? This will help me a lot!
Thanks for your work!
Hi, I am trying to download the HalfCheetah datasets from the link you provided. But a few seconds after starting, it always fails. Please advise how to get access to these datasets.
I did successfully manage to download the Ant and Hopper expert datasets.
Inspecting the datasets for 2x4 Ant I notice that the observation dimension for each agent is 113. But the observation dim when agent_obsk=1
is supposed to be 52. Does that mean in your experiments agents got to see the whole state?
Many thanks for the contribution. I look forward to being able to use your datasets.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.