developmentseed / sat-ml-training Goto Github PK
View Code? Open in Web Editor NEWHome Page: https://developmentseed.org/sat-ml-training/
License: Apache License 2.0
Home Page: https://developmentseed.org/sat-ml-training/
License: Apache License 2.0
Add more issues as found and we'll wrap into another update.
@lillythomas and @wildintellect should arrange a call mid Sept with ICIMOD to verify that MS Teams will work, and has all the features we need:
Some of the examples use non-public data. In the short term it's not an issue as the Shared Drive is not Public. Long term when finalizing the examples, we will need to only provide public data for examples, or instructions for how to obtain the needed data.
TODO:
From Slack discussion:
i think we should use an if/else statement for whether you want to use a saved model for predictions, dictated by boolean variable. the default setting should be False.
maybe if statement based on the model existing as a variable, question how do we make sure the weights are saved/reloaded before predictions?
if the saved_model.pb and variables folder are in the saved_model_path then that indicates the model saved out.
maybe we can cross compare the weights by printing them from the in memory trained model and then from the loaded model
Probably won't figure this out, automated upgrade is broken due to customizations and permissions. The only thing this blocks is that launching a local server while developing is broken, so you have to push your notebook to github first, then you can view, launch in colab, or do a PR to put into main branch to get it check it on the website.
Looking at our intro material, I think we need to better guide readers as to what they should actually review before jumping into our interactive lessons. We've got lots of good links but should narrow down the number of pages to review covering the key concepts before using.
http://devseed.com/sat-ml-training/python/background/2020/02/23/IntroMachineLearning.html
@drewbo this session "Large Scale Inference, Cloud Scaling", lesson number 6, on Oct 8, which is the final lesson before office hours is up to you.
@Geoyi mentioned discussing some topics with you and @lillythomas . A few things we thought about in scrum that might be good to include, which are not well covered elsewhere.
Last week we polished most of the pre-read content, namely the ML guide.
Remaining to-do items:
Was working with the Zindi data, it's not really feasible for participants to download the data themselves and then upload to their google drives, could easily take hours. As a workaround we are putting all the data unzipped on Google Shared Drive accessible to participants. This is a common problem in general for ML work.
However it would be really awesome to use Colab to download the data directly to the cloud and unzip.
Alternatives:
This is a wishlist item if we've got all the other important stuff done already, but important to talk about.
https://developmentseed.org/sat-ml-training/Randomforest_cropmapping-with_GEE the code cell with the Export to Drive is missing, but it is in the source code.
Participants are going to need several different accounts. We need to figure out that list and ensure everyone has instructions for each account type they need ahead of time.
Hi,
I have got the code for "supervised machine learning" of pyrasterframes, and the link is "https://rasterframes.io/supervised-learning.html". I have seen the author used 12 ".tiff" to train the machine learning model, however the training set and testing set has not been split. Also, I used the code "x_training_data, x_test_data, y_training_data, y_test_data = train_test_split(x, y, test_size = 0.3)" and hoped to split the training set and testing, but I am not sure which one is "x" and "y". So, could you pls help to give me some suggestions on how to split the training set and testing set for the program on the the link "https://rasterframes.io/supervised-learning.html"? Thanks!
Opening this issue will trigger GitHub Actions to fetch the lastest version of fastpages. More information will be provided in forthcoming comments below.
Create a Page with the listing in order of materials for the HKH training.
This might be done using the sticky feature of fastpages, or a tag + date ordering of the posts, although newest post seems to show up first. Might also be able to include a hidden Markdown page that is hand coded links in the order we want.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.