Code Monkey home page Code Monkey logo

Comments (10)

kurianbenoy avatar kurianbenoy commented on July 19, 2024 1

@sayakpaul I am also interested in collaborating with you to write this guide. I was initially planning for a notebook example without custom-training loops, to keep it simple as an example.

I fully agree having an end to end detailed guide about using Keras is necessary.

from keras-io.

sayakpaul avatar sayakpaul commented on July 19, 2024

@kurianbenoy do you plan to cover how one should use TPUs in custom training loops as well? If so, I am willing to collaborate. @fchollet I think an end-to-end tutorial on how one should should use TPUs for custom training loops would be really helpful. WDYT?

from keras-io.

fchollet avatar fchollet commented on July 19, 2024

If you want to write a detailed guide about using TPUs, feel free to send a PR (as guides/training_on_tpu.py. Note that it will be a fairly difficult guide to write. It should cover everything -- fit, callbacks, model saving, custom training loops -- since it will be the official TPU guide :)

from keras-io.

sayakpaul avatar sayakpaul commented on July 19, 2024

@fchollet thanks for passing along. How about we did it in parts to maintain brevity and simplicity? I am also interested to see what @kurianbenoy has to say here.

from keras-io.

sayakpaul avatar sayakpaul commented on July 19, 2024

@kurianbenoy when would you like to start? Here's a notebook we can extensively refer to while writing the part about using TPUs in custom training loops. Let me know your thoughts.

@fchollet would you like the guide in two parts i.e. part I for the simpler one and part II for the more complex one or would you like to see one?

from keras-io.

fchollet avatar fchollet commented on July 19, 2024

@fchollet would you like the guide in two parts i.e. part I for the simpler one and part II for the more complex one or would you like to see one?

Ideally just one guide, but it could have 2 sections :)

from keras-io.

yashk2810 avatar yashk2810 commented on July 19, 2024

Is this what you are looking for? https://www.tensorflow.org/guide/tpu

from keras-io.

sayakpaul avatar sayakpaul commented on July 19, 2024

I'm aware of @yashk2810. But this guide proposed in the PR would cover two scenarios including the things to keep in mind while using TPUs. Two scenarios:

  • The classic fit and compile one.
  • Custom training loops.

from keras-io.

yashk2810 avatar yashk2810 commented on July 19, 2024

That guide covers both those scenarios right?

from keras-io.

sayakpaul avatar sayakpaul commented on July 19, 2024

I agree but the official TensorFlow guide leaves out on the little details that are needed in order to make TPU-based training work. For example:

  • How should one set the batch size that is specifically suited for TPU-based training?
  • How should one go about setting the learning rate?

It also lacks guidance on aggregating the loss after each training step which is absolutely crucial to understand while doing any form of distributed training in general.

from keras-io.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.