Comments (10)
@sayakpaul I am also interested in collaborating with you to write this guide. I was initially planning for a notebook example without custom-training loops, to keep it simple as an example.
I fully agree having an end to end detailed guide about using Keras is necessary.
from keras-io.
@kurianbenoy do you plan to cover how one should use TPUs in custom training loops as well? If so, I am willing to collaborate. @fchollet I think an end-to-end tutorial on how one should should use TPUs for custom training loops would be really helpful. WDYT?
from keras-io.
If you want to write a detailed guide about using TPUs, feel free to send a PR (as guides/training_on_tpu.py
. Note that it will be a fairly difficult guide to write. It should cover everything -- fit, callbacks, model saving, custom training loops -- since it will be the official TPU guide :)
from keras-io.
@fchollet thanks for passing along. How about we did it in parts to maintain brevity and simplicity? I am also interested to see what @kurianbenoy has to say here.
from keras-io.
@kurianbenoy when would you like to start? Here's a notebook we can extensively refer to while writing the part about using TPUs in custom training loops. Let me know your thoughts.
@fchollet would you like the guide in two parts i.e. part I for the simpler one and part II for the more complex one or would you like to see one?
from keras-io.
@fchollet would you like the guide in two parts i.e. part I for the simpler one and part II for the more complex one or would you like to see one?
Ideally just one guide, but it could have 2 sections :)
from keras-io.
Is this what you are looking for? https://www.tensorflow.org/guide/tpu
from keras-io.
I'm aware of @yashk2810. But this guide proposed in the PR would cover two scenarios including the things to keep in mind while using TPUs. Two scenarios:
- The classic
fit
andcompile
one. - Custom training loops.
from keras-io.
That guide covers both those scenarios right?
from keras-io.
I agree but the official TensorFlow guide leaves out on the little details that are needed in order to make TPU-based training work. For example:
- How should one set the batch size that is specifically suited for TPU-based training?
- How should one go about setting the learning rate?
It also lacks guidance on aggregating the loss after each training step which is absolutely crucial to understand while doing any form of distributed training in general.
from keras-io.
Related Issues (20)
- Using the same model on medical Xra dataset HOT 3
- Incorrect Markdown Rendering of Double Hyphen in Installation Documentation HOT 4
- Can I Use This Code With My Own Dataset? HOT 3
- Keras Keypoint detection example doesn't converge HOT 3
- neural_machine_translation_with_keras_nlp Example test script segmentation fault
- using test set as validation set HOT 1
- Bounding box predictions are too small (too narrow) in yolov8 HOT 5
- Keras version and Formatting issue in `MultipleChoice Task with Transfer Learning` example HOT 3
- CTC Batch Cost error while migrating "handwriting_recognition" example to Keras 3 HOT 2
- 'Functional' object has no attribute 'parameters' HOT 3
- Improve documentation on Yolov8 Detector HOT 1
- Keras_NLP Getting Started Tutorial: Mixed Precision Error; AttributeError: 'LossScaleOptimizerV3' HOT 6
- Performance degraded after using the mixup HOT 5
- In keras-io/guides/keras_nlp /transformer_pretraining.py there is an documentation bug HOT 1
- Add example Zero-shot Image Classification with SigLIP / CLIP from scratch using KerasCV and KerasNLP HOT 4
- Add Token Classification / Named Entity Recongnition (NER) Example with KerasNLP HOT 2
- kerascv layers demonstration HOT 3
- Could not find TensorRT HOT 3
- Error occurred in the Named Entity Recognition using Transformers example HOT 6
- Support Distributed Training for Fine-tuning Stable Diffusion Example HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from keras-io.