Comments (4)
Thanks for your message, so actually it is not just that we adjusted the learning rate as a function of iteration, rather than epoch, but actually also did not even train with proper epochs. We simply, at each iteration, sampled 32 examples from the longer ground truth trajectory, randomly and without replacement. So we only have "epochs" on average, but nothing prevented the same sequence to be sampled twice before some other sequence was sample once.
We believed this did not matter in our case because an epoch is about 54k training examples, and we trained with 300k steps batch 32 each, which corresponds to about ~180 epochs, so it probably does not matter if some examples are sampled a bit more than 180 times, and some other a bit less than 180 times.
In the case of a small dataset probably it still does not matter much so long as the number of training iterations is still large compared to the dataset size, and the learning rate decay is very slow (like in our case), but of course if you are getting to limit where each example is only seen a few times, and the learning rate decays quickly, I think it makes sense to do it in the way you are proposing.
from graphcast.
Thank you for providing clarification and detailing the training process. Understanding the methodology behind your model training is insightful. Your point about smaller datasets and quicker learning rate decay suggests that an approach based on epochs might be more suitable in those scenarios.
It seems that the issue has been satisfactorily resolved.
from graphcast.
Hello, I have some questions about model training. Have you tried training models with different resolutions, GraphCast_small (13levels, 1°) and GraphCast (37levels, 0.25°)? How much time and memory does it take to train these two models?
I look forward to your response. Thank you.
Best regards!
from graphcast.
@zhongmengyi I have replied in your separate issue #77
from graphcast.
Related Issues (20)
- How was the Graphcast Small 1.0 degree Reanalysis made? HOT 1
- RESOURCE_EXHAUSTED HOT 1
- prediciton output time HOT 7
- Performance of GraphCast_small vs regular GraphCast vs GraphCast_operational HOT 1
- May I ask for a multiple batch example nc file?
- Training duration of pre-training model HOT 1
- Issues with GraphCast Training – Request for Assistance HOT 3
- Floating Point Error in 1-month forecast HOT 9
- Craphcast HOT 1
- UserWarning: no explicit representation of timezones available for np.datetime64
- Issues with performance of GraphCast operational model HOT 5
- Predictors
- The predictions may have some errors
- Discrepancy between calculated TSI in solar_radiation.py and ERA5 TISR
- JAX problems HOT 1
- Problem of building own dataset HOT 2
- Prediction variables HOT 1
- Comparison of Graphcast and Copernicus HOT 1
- Ensemble Prediction
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from graphcast.