Code Monkey home page Code Monkey logo

Comments (8)

jiaxiang-wu avatar jiaxiang-wu commented on June 21, 2024

Hi, can you post the error message?

from pocketflow.

herbiezhao avatar herbiezhao commented on June 21, 2024

try "docker pull uber/horovod:0.14.1-tf1.10.0-torch0.4.0-py3.5"

from pocketflow.

MaxwellHan avatar MaxwellHan commented on June 21, 2024

Hi, can you post the error message?
看了教程,我做了以下步骤的操作:
step1. path.conf中指定了cifar数据集的地址
step2. 我用 docker pull uber/horovod:0.15.1-tf1.11.0-torch0.4.1-py3.5 pull下了镜像
step3. 修改scripts/run_docker.sh 文件,把执行nvidia-docker的镜像名从
docker.oa.com/g_tfplus/horovod:python3.5 改为 uber/horovod:0.15.1-tf1.11.0-torch0.4.1-py3.5
step4. 执行./scripts/run_docker.sh nets/resnet_at_cifar10_run.py,进入到了容器的bash命令中。但是既没有调用到gpu也没有运算,请问这样操作有什么问题,或者下一步应该做什么?

from pocketflow.

jiaxiang-wu avatar jiaxiang-wu commented on June 21, 2024

After entering the docker environment, use the following command to start the program:

$ bash main.sh

P.S.: Please use English for future discussion, if possible.

from pocketflow.

MaxwellHan avatar MaxwellHan commented on June 21, 2024

After entering the docker environment, use the following command to start the program:

$ bash main.sh

P.S.: Please use English for future discussion, if possible.

when i get into the docker eviroment and run "./main.sh" ,i got
"Could not find a version that satisfies the requirement docopt"
"Could not find a version that satisfies the requirement hdfs "
......
"Could not find a version that satisfies the requirement pandas" .et
,which means python doesn't have those modules.
Is the docker image "uber/horovod:0.15.1-tf1.11.0-torch0.4.1-py3.5" a wrong image?

My docker is in an internal network enviroment,which mean i can't connect the internet in the docker container, how do i fix this?

from pocketflow.

xieydd avatar xieydd commented on June 21, 2024

I have veen tested in k8s , and use horovod ,it`s ok.

from pocketflow.

herbiezhao avatar herbiezhao commented on June 21, 2024

hehe,because they use it inside tencent,and use internal source,you should use tsinghua or aliyun source instead of tencent internal source. Please modify main.sh with "index-url = https://pypi.tuna.tsinghua.edu.cn/simple" and "trusted-host=mirrors.aliyun.com"

from pocketflow.

jiaxiang-wu avatar jiaxiang-wu commented on June 21, 2024

@herbiezhao
Thanks for pointing this out. For users outside Tencent, you need to modify "index-url" and "trusted-host" to be able to install extra dependencies.

from pocketflow.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.