Comments (5)
First problem is that with -e
it fails with
The Resource 'Microsoft.Compute/virtualMachines/NCF-Trainer' under resource group 'NCF-Tutorial' was not found. For more details please go to https://aka.ms/ARMResourceNotFoundFix
This is because we do
az vm show \
--resource-group ${rg_name} \
--name ${vm_name}
and use the failure of that to determine if we create the VM or not.
from mlfirststeps_azure.
Getting rid of the -e
leads to a new error because of my git settings
Building images:
Deployment failed. Correlation ID: 5fb4219e-e038-46c7-b4f9-d048935d546a. VM has reported a failure when processing extension 'CustomScript'. Error message: "Enable failed: failed to execute command: command terminated with exit status=126
[stdout]
deployment/
deployment/build_pytorch.sh
deployment/docker_bootstrap.sh
[stderr]
/var/lib/waagent/custom-script/download/0/script.sh: ./deployment/docker_bootstrap.sh: /bin/bash^M: bad interpreter: No such file or directory
/var/lib/waagent/custom-script/download/0/script.sh: ./deployment/build_pytorch.sh: /bin/bash^M: bad interpreter: No such file or directory
"
More information on troubleshooting is available at https://aka.ms/VMExtensionCSELinuxTroubleshoot
Whats the best way to automatically ensure that anyone (with my git settings) who clones the repo and just runs the script without doing anything will not hit all these errors?
from mlfirststeps_azure.
After doing dos2unix
on all of the .sh files I now get the error
Building images:
Deployment failed. Correlation ID: 3088f286-14e9-42ff-b7a1-3eccc458f7f1. VM has reported a failure when processing extension 'CustomScript'. Error message: "Enable failed: failed to execute command: command terminated with exit status=126
[stdout]
deployment/
deployment/build_pytorch.sh
deployment/docker_bootstrap.sh
[stderr]
/var/lib/waagent/custom-script/download/0/script.sh: ./deployment/docker_bootstrap.sh: /bin/bash^M: bad interpreter: No such file or directory
/var/lib/waagent/custom-script/download/0/script.sh: ./deployment/build_pytorch.sh: /bin/bash^M: bad interpreter: No such file or directory
"
More information on troubleshooting is available at https://aka.ms/VMExtensionCSELinuxTroubleshoot
from mlfirststeps_azure.
The script isn't automatic. I get this
NOTE: Adding personal ratings to training dataset. Delete ncf/add_personal_ratings.py if this is undesirable
The authenticity of host '23.102.172.28 (23.102.172.28)' can't be established.
ECDSA key fingerprint is SHA256:Jd+LluwAVJwAfIIwa1oMySM6a5gB9mY4dF+0SDvwbZc.
Are you sure you want to continue connecting (yes/no)?
from mlfirststeps_azure.
This is followed by train.sh: No such file or directory
from mlfirststeps_azure.
Related Issues (12)
- Making it easy for the copy and pasters HOT 1
- Deleting the VM instance HOT 2
- Number of GPUs
- Newby-fying the training script
- Making sure the NVIIDA driver extension has finished before doing anything more. HOT 2
- Watching for 'waiting for a lock' HOT 1
- A check for personal ratings?
- Allow for SPOT pricing HOT 1
- Benchmarking results
- Doesn't for work for K80s
- Personalised reccomendations
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mlfirststeps_azure.