Code Monkey home page Code Monkey logo

Comments (8)

fabianlim avatar fabianlim commented on July 25, 2024

@kmehant while the fix is as you described, now that #53 is merged, I think it may be best to switch to accelerate, which uses a yaml config defaults file. With yaml explicit encasement of strings using " and ' are not necessary, and it will be more robust to such issues. I suggest the following changes

  1. update the README.md removing instructions for torch.run and replace with accelerate.launch
  2. replace the FSDP JSON with a config yaml like this.
    • BTW I suggest to move fsdp_config.json out of tuning/config (which houses code) into somewhere which only houses config fixtures.

@Ssukriti

from fms-hf-tuning.

Ssukriti avatar Ssukriti commented on July 25, 2024

Thanks Fabian, created issue for README updates #87 . We will prioritize it at earliest

from fms-hf-tuning.

kmehant avatar kmehant commented on July 25, 2024

#80 (comment)

I think it may be best to switch to accelerate, which uses a yaml config defaults file.

Thanks @fabianlim, I am aware of this, isn't accelerate a wrapper over torch.distributed?

I suggest the following changes

I guess @Ssukriti is tracking them in a different issue #87

from fms-hf-tuning.

Ssukriti avatar Ssukriti commented on July 25, 2024

@kmehant I was planning to get to issue #87 in next 2 days as its high priority for our deliverables, but if you are interested and want to contribute instead, feel free to do so. Just let me know so I can plan accordingly :) .
We do need it completed at earliest so we can also start some testing with multi-GPU on our end as well

from fms-hf-tuning.

kmehant avatar kmehant commented on July 25, 2024

#80 (comment)

@Ssukriti I will be glad to raise a PR in a couple of hours.

from fms-hf-tuning.

fabianlim avatar fabianlim commented on July 25, 2024

@kmehant its up to you but I should be able to get to #87 pretty soon.

from fms-hf-tuning.

kmehant avatar kmehant commented on July 25, 2024

@Ssukriti @fabianlim I have raised a PR here #92 Thanks.

from fms-hf-tuning.

fabianlim avatar fabianlim commented on July 25, 2024

@Ssukriti @fabianlim I have raised a PR here #92 Thanks.

@kmehant ok looks like we duplicated work, see #91

from fms-hf-tuning.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.