Code Monkey home page Code Monkey logo

Comments (2)

iffiX avatar iffiX commented on June 19, 2024

Hi, thank you for trying out and report these bugs, I will answer them one by one:
[MADDPG]
This problem is caused by API change, the initialization part of MADDPG in the example should be changed to (for v0.4.1):

maddpg = MADDPG(
        [deepcopy(actor) for _ in range(agent_num)],
        [deepcopy(actor) for _ in range(agent_num)],
        [deepcopy(critic) for _ in range(agent_num)],
        [deepcopy(critic) for _ in range(agent_num)],
        t.optim.Adam,
        nn.MSELoss(reduction="sum"),
        critic_visible_actors=[list(range(agent_num))] * agent_num,
    )

[IMPALA]
For IMPALA I just found some serious bugs, first, please move learner group creation in the example outside the if-else like:

    learner_group = world.create_collective_group(ranks=[2, 3])
    if rank in (2, 3):
         ...

Then in machin.parallel.distributed.world, add sorted to these methods of class World:

    def get_ranks(self):
        """
        Returns:
            A list of ranks of all processes.
        """
        return sorted(list(self.rank_name_map.keys()))

    def get_members(self):
        """
        Returns:
            A list of names of all processes.
        """
        return sorted(list(self.rank_name_map.values()))

These are temporary fixes and should be able to solve your problem, the "connect()" problem is caused by incoherent member position over all involving processes, to be short.

[Bugs in the test]
These bugs are all from the auto module and they are caused by incorrect returning annotation, I think these type-checking features are introduced in newer PyTorch versions, I will fix that. Besides, there are also bugs caused by PyTorch lightning altering their APIs, I am currently fixing that. These problems will not affect you if auto module is not used.

from machin.

Vildnex avatar Vildnex commented on June 19, 2024

Thanks for the response, with your help now is working properly :D.

from machin.

Related Issues (18)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.