Comments (5)
Hi!
My understanding of this error message is that PyTorch was only able to reserve 1.88 GiB of memory. Maybe the difference to the 3 GB you see is reserved but unallocated in other processes? Nevertheless, it will be difficult to train models on 3 GB VRAM. We have been able to run on 11 GB, but even then only on very small batch sizes.
Besides closing down other processes that may be using the GPU, what you can try is to downsample the experiments more and use smaller patch sizes, but this will of course be at the expense of output resolution. I would aim for something like a 1200x1200 px resolution (by setting the --scale
argument in xfuse convert
to 1200 divided by the resolution of the image file) and a patch size of maybe 384 or 256 (by setting the attribute patch_size = 384
under the optimization header in the .toml config file passed to xfuse run
). The batch_size
attribute will likely need to be either 2 or 3.
from xfuse.
Thank you very much for your reply, your suggestions are very useful to me. However, when I was about to end the operation, I encountered the following misalignment.
🚨 ERROR : IndexError: tensors used as indices must be long, byte or bool tensors
Traceback (most recent call last):
File "e:\conda\lib\site-packages\xfuse\model\experiment\st\st.py", line 519, in model
data = data[idxs - 1]
IndexError: tensors used as indices must be long, byte or bool tensors
Is there an error in the code? The torch version I use is 1.10.0+cu113.
from xfuse.
Thanks for the report! I failed to reproduce this on my Linux computer with a clean environment and torch 1.10.0+cu102, so it may be Windows-specific. The idxs
tensor originates from here in the dataloader:
xfuse/xfuse/data/slide/data/st_slide.py
Line 169 in 5ac2333
data = data[idxs.long() - 1]
. You are welcome to submit a PR if you find a solution!from xfuse.
Well, I have tried to run XFuse in Windows successfully, but at the end of the run, there are the following warnings. Do I need to ignore these warnings or how to deal with them?
⚠ WARNING : UserWarning (e:\conda\lib\site-packages\xfuse\session\io.py:25): Failed to store session item "covariates".The error returned was: [Errno 2] No such file or directory: '/dev/null'
⚠ WARNING : UserWarning (e:\conda\lib\site-packages\xfuse\session\io.py:25): Failed to store session item "genes".The error returned was: [Errno 2] No such file or directory: '/dev/null'
⚠ WARNING : UserWarning (e:\conda\lib\site-packages\xfuse\session\io.py:25): Failed to store session item "model".The error returned was: [Errno 2] No such file or directory: '/dev/null'
⚠ WARNING : UserWarning (e:\conda\lib\site-packages\xfuse\session\io.py:25): Failed to store session item "training_data".The error returned was: [Errno 2] No such file or directory: '/dev/null'
⚠ WARNING : UserWarning (e:\conda\lib\site-packages\xfuse\session\io.py:25): Failed to store session item "metagene_expansion_strategy".The error returned was: [Errno 2] No such file or directory: '/dev/null'
from xfuse.
These warnings indicate that the model couldn't be saved, so it won't be possible to use the trained model for additional analyses. I have attempted to fix this in #31. If you have time to try it out, any feedback would be great! You can install the fixed version using pip install --force-reinstall --user git+https://github.com/ludvb/xfuse@check-pickle
.
from xfuse.
Related Issues (20)
- Error: ValueError: The parameter loc has invalid values HOT 1
- License? HOT 1
- Error when Running with Windows HOT 1
- error when installing
- AttributeError: 'DataLoader' object has no attribute 'reset_workers' HOT 1
- In silico spatial transcriptomics HOT 1
- Runtime error related to tensor size when running "analysis-gene_maps" HOT 4
- prediction of spatial gene expression
- ModuleNotFoundError: No module named 'xfuse' HOT 12
- When is my model fully trained? HOT 9
- Is it possible for Gene_maps values to use the same scale instead of min-max per gene? HOT 3
- How do I make a tissue mask (with photoshop?) HOT 4
- What is the minimum of sample to train a model on? HOT 1
- Installation of xfuse HOT 11
- TypeError: cannot pickle 'weakref' object related to save_session HOT 2
- How to predict expression from histology in unseen samples? What specific input and configuration files should be prepared?
- Customizing Training Procedure
- RuntimeError: CUDA error: no kernel image is available for execution on the device HOT 2
- RuntimeError: Annotation layer "" is missing HOT 1
- Missing h5 file question
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from xfuse.