Comments (18)
Same problem. I am on the WSL Ubuntu 18.04 NVIDIA 4090 system, same error under same conditions.
have successfully compile PyFlex in the past multiple times. When running compile.sh I get the following error
Also looking for ideas to fix it, thanks a lot.
from cloth-funnels.
I encountered the same issue.
When comparing against the flingbot codebase (on which this codebase is based) I found that Pyflex/lib
is missing and Pyflex/external
is incomplete. My guess is this is a bug caused by the default .gitignore
file that ignores all lib/
folders (see here)
Manually copying the lib/
and external/
folders into this codebase resolved the build error, although I am not 100% sure it was okay to just copy them, in case any modifications were made. Maybe @alpercanberk could confirm this.
from cloth-funnels.
@tlpss no major modifications were made, so copying them over should be fine. let me know if you run into any related issues.
from cloth-funnels.
@tlpss no major modifications were made, so copying them over should be fine. let me know if you run into any related issues.
Can confirm that copying the files worked fine for building the simulator.
I encountered a few other issues during the installation of the codebase, but got the task generation to work in the end. Haven't tried training/running the models.
from cloth-funnels.
我遇到了同样的问题。
当与 flingbot 代码库(该代码库所基于的)进行比较时,我发现它
Pyflex/lib
并不Pyflex/external
完整。.gitignore
我的猜测是这是由忽略所有文件夹的默认文件引起的错误lib/
(请参见此处)
lib/
手动将和文件夹复制external/
到此代码库中解决了构建错误,尽管我不能 100% 确定复制它们是否可以,进行任何修改。@alpercanberk 可以证实这一点。
I also encountered the same problem. My system is ubuntu18.04. Can you tell me where to manually copy the lib and external folders?
from cloth-funnels.
king for ideas to fix it, tha
Have you solved this problem yet?
from cloth-funnels.
I think it should be from Flingbot https://github.com/columbia-ai-robotics/flingbot, but just to make sure @sashank-tirumala could you point us to where you copied the files from?
from cloth-funnels.
我遇到了同样的问题。
当与 flingbot 代码库(该代码库所基于的)进行比较时,我发现它Pyflex/lib
并不Pyflex/external
完整。.gitignore
我的猜测是这是由忽略所有文件夹的默认文件引起的错误lib/
(请参见此处)
lib/
手动将和文件夹复制external/
到此代码库中解决了构建错误,尽管我不能 100% 确定复制它们是否可以,进行任何修改。@alpercanberk 可以证实这一点。I also encountered the same problem. My system is ubuntu18.04. Can you tell me where to manually copy the lib and external folders?
You should take a look at the PyFlex folder from the flingbot codebase here.
That worked for me to compile pyflex and make some modifications to the bindings.
from cloth-funnels.
我遇到了同样的问题。
当与 flingbot 代码库(该代码库所基于的)进行比较时,我发现它Pyflex/lib
并不Pyflex/external
完整。.gitignore
我的猜测是这是由忽略所有文件夹的默认文件引起的错误lib/
(请参见此处)
lib/
手动将和文件夹复制external/
到此代码库中解决了构建错误,尽管我不能 100% 确定复制它们是否可以,进行任何修改。@alpercanberk 可以证实这一点。I also encountered the same problem. My system is ubuntu18.04. Can you tell me where to manually copy the lib and external folders?
You should take a look at the PyFlex folder from the flingbot codebase here.
That worked for me to compile pyflex and make some modifications to the bindings.
Thank you very much for your reply. I have successfully compiled according to your guidance, but there are still some errors reported during the evaluation and training. Have you run through the author's training and evaluation code?
from cloth-funnels.
I think it should be from Flingbot https://github.com/columbia-ai-robotics/flingbot, but just to make sure @sashank-tirumala could you point us to where you copied the files from?
Thank you for your guidance
from cloth-funnels.
我遇到了同样的问题。
当与 flingbot 代码库(该代码库所基于的)进行比较时,我发现它Pyflex/lib
并不Pyflex/external
完整。.gitignore
我的猜测是这是由忽略所有文件夹的默认文件引起的错误lib/
(请参见此处)
lib/
手动将和文件夹复制external/
到此代码库中解决了构建错误,尽管我不能 100% 确定复制它们是否可以,进行任何修改。@alpercanberk 可以证实这一点。I also encountered the same problem. My system is ubuntu18.04. Can you tell me where to manually copy the lib and external folders?
You should take a look at the PyFlex folder from the flingbot codebase here.
That worked for me to compile pyflex and make some modifications to the bindings.Thank you very much for your reply. I have successfully compiled according to your guidance, but there are still some errors reported during the evaluation and training. Have you run through the author's training and evaluation code?
@zcswdt I did encounter a few issues when I tried to run training, but did not look into them as I was mostly interested in the data generation part.
from cloth-funnels.
我遇到了同样的问题。
当与 flingbot 代码库(该代码库所基于的)进行比较时,我发现它Pyflex/lib
并不Pyflex/external
完整。.gitignore
我的猜测是这是由忽略所有文件夹的默认文件引起的错误lib/
(请参见此处)
lib/
手动将和文件夹复制external/
到此代码库中解决了构建错误,尽管我不能 100% 确定复制它们是否可以,进行任何修改。@alpercanberk 可以证实这一点。I also encountered the same problem. My system is ubuntu18.04. Can you tell me where to manually copy the lib and external folders?
You should take a look at the PyFlex folder from the flingbot codebase here.
That worked for me to compile pyflex and make some modifications to the bindings.Thank you very much for your reply. I have successfully compiled according to your guidance, but there are still some errors reported during the evaluation and training. Have you run through the author's training and evaluation code?
@zcswdt I did encounter a few issues when I tried to run training, but did not look into them as I was mostly interested in the data generation part.
May I ask if you have successfully run the code for the author's training section? When I was training the training instructions provided by the author, I found that as the number of training steps increased, the program would consume memory until all memory was consumed, and then the training program was killed. (Process finished with exit code 137 (interrupted by signal 9: SIGKILL).I don't know what caused this, it's really helpless. Please help me, thank you!
from cloth-funnels.
我遇到了同样的问题。
当与 flingbot 代码库(该代码库所基于的)进行比较时,我发现它Pyflex/lib
并不Pyflex/external
完整。.gitignore
我的猜测是这是由忽略所有文件夹的默认文件引起的错误lib/
(请参见此处)
lib/
手动将和文件夹复制external/
到此代码库中解决了构建错误,尽管我不能 100% 确定复制它们是否可以,进行任何修改。@alpercanberk 可以证实这一点。I also encountered the same problem. My system is ubuntu18.04. Can you tell me where to manually copy the lib and external folders?
You should take a look at the PyFlex folder from the flingbot codebase here.
That worked for me to compile pyflex and make some modifications to the bindings.Thank you very much for your reply. I have successfully compiled according to your guidance, but there are still some errors reported during the evaluation and training. Have you run through the author's training and evaluation code?
@zcswdt I did encounter a few issues when I tried to run training, but did not look into them as I was mostly interested in the data generation part.
May I ask if you have successfully run the code for the author's training section? When I was training the training instructions provided by the author, I found that as the number of training steps increased, the program would consume memory until all memory was consumed, and then the training program was killed. (Process finished with exit code 137 (interrupted by signal 9: SIGKILL).I don't know what caused this, it's really helpless. Please help me, thank you!
@zcswdt I'm afraid I won't be able to help.. As mentioned before, I have also had some issues with the training script, but have not looked into them properly as I was focused on the data generation.
from cloth-funnels.
Sorry @zcswdt, I don't have access to my original setup anymore. If you're having memory issues, have you been able to try using fewer processes / shrinking the network?
from cloth-funnels.
Thank you for your reply. Setting the parameter num_processes to 8 will also consume memory. Today, I will adjust it to 1 to see the situation. How to shrink the network? I don't quite understand this
from cloth-funnels.
@zcswdt I'm afraid I won't be able to help.. As mentioned before, I have also had some issues with the training script, but have not looked into them properly as I was focused on the data generation.
Thank you very much for your reply. I have actually completed the training, but it will consume my memory.
from cloth-funnels.
Thank you for your reply. Setting the parameter num_processes to 8 will also consume memory. Today, I will adjust it to 1 to see the situation. How to shrink the network? I don't quite understand this
If the memory issue is due to the neural network being trained simultaneously with the simulation, then you may be able to set the network to have fewer parameters, I doubt it will change things by much
from cloth-funnels.
Thank you for your reply. Setting the parameter num_processes to 8 will also consume memory. Today, I will adjust it to 1 to see the situation. How to shrink the network? I don't quite understand this
If the memory issue is due to the neural network being trained simultaneously with the simulation, then you may be able to set the network to have fewer parameters, I doubt it will change things by much
Thank you very much for your reply. Yesterday, I tried to evaluate the code again and found that it not only consumes memory for training, but also estimates approximately 400 data_size when using the evaluation model you provided When it comes to size, it also consumes my memory and causes the problem of killing the program. And during the evaluation, I added num_ process is also set to 1. I really don't know what to do. Your code is too important to me. Can you help me check it? My computer environment is strictly built according to the requirements on your readme. Of course, if possible, remotely control my computer and check my problem. Thank you very much. I really hope for your help. I have been trying this project for three months. Looking forward to your reply
from cloth-funnels.
Related Issues (9)
- I traped into the cycle, when I run the Demo.
- ray.exceptions.RayActorError: The actor died unexpectedly before finishing this task. HOT 1
- Evaluate Cloth Funnels Error HOT 2
- Dockerfile "FROM" command failed, failed to solve: nvidia/cuda:9.2
- Running compiled pyflex under cuda-11.1 failed, undefined symbol: cudaSetupArgument HOT 7
- Process finished with exit code 137 (interrupted by signal 9: SIGKILL) HOT 1
- about cuda
- memory leak HOT 9
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cloth-funnels.