Comments (6)
@AkhilSinghRana Can you please try this with TF 1.0 and let us know if it works.
from cloudml-samples.
@puneith thanks for your response. It seems to be working with tensorflow1.0 but there are still some warnings related to GPU and float32 variables. Will let you know when the task gets completed. Thanks again.
from cloudml-samples.
@puneith Also do you have any idea how long it takes for the training to complete on cloud ? Or how many iterations will it continue for ? There is no accuracy report in between will it be shown at last! after complete training is done ? It just shows the Loss and step values. Hope it is working fine
from cloudml-samples.
@AkhilSinghRana Number of steps is a specified value, check the defaults if you did not specify. Closing this for now.
from cloudml-samples.
I have the same problem with tft 0.1.10 and tf 1.3. After reading this issue tried to install tf 1.0, but it does not help me.
tensorflow.python.training.basic_session_run_hooks.NanLossDuringTrainingError: NaN loss during training.
from cloudml-samples.
@lukashes Could you provide more complete logs in a gist or something? For what it's worth these examples have not yet been updated to TF 1.3 as CMLE does not yet have a TF 1.3 runtime, but 1.3 should be backwards compat with 1.2. Still without seeing more detailed logs the only recommendation I can make is try TF 1.2.
from cloudml-samples.
Related Issues (20)
- Problem with gcloud local prediction. HOT 5
- Unexpected change of bucket path due to lstrip in PyTorch container example HOT 1
- epoch_acc not defined in census tf-keras example? HOT 3
- Run failed "Creating a custom prediction routine with Keras" HOT 1
- census/estimator/trainer/task.py : 'module' object has no attribute 'logging' HOT 1
- I can't access to my instance HOT 1
- Links in Online Prediction Section result in 404 HOT 1
- Failed to submit the online prediction request HOT 1
- census/tf-keras training tensorflow2 HOT 6
- Adding parameter to execution fires an error HOT 3
- Public google/cloud/ml/v1/job_service.proto is out of sync with json API HOT 2
- error with gcloud init HOT 1
- Many links in readme is broken HOT 1
- Official AIP tutorial has a serious error "Could not find resource: localhost/dense/kernel error" HOT 1
- chore: snippet-bot full scan HOT 1
- Example/Template for Custom Container Online Prediction HOT 1
- ai-platform training : FATAL Flags parsing error: Unknown command line flag 'job_dir'. HOT 2
- [Policy Bot] found one or more issues with this repository.
- Python 3.5 build failing
- Dependency Dashboard
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cloudml-samples.