Comments (5)
Hello!
D:\traiNNer\codes\models\base_model.py:921: FutureWarning: Non-finite norm encountered in torch.nn.utils.clip_grad_norm_; continuing anyway. Note that the default behavior will change in a future release to error out if a non-finite total norm is encountered. At that point, setting error_if_nonfinite=false will be required to retain the old behavior.
This means the model is probably unstable and some values are becoming InF. Normally gradient clipping will set those values to max at the defined values, but looks like this behavior is changing. You can try lowering the learning rate a bit to see if the error goes away, otherwise, there might be some other configuration causing these values to become infinite.
C:\Python39\lib\site-packages\torch\optim\lr_scheduler.py:129: UserWarning: Detected call of
lr_scheduler.step()
beforeoptimizer.step()
. In PyTorch 1.1.0 and later, you should call them in the opposite order:optimizer.step()
beforelr_scheduler.step()
. Failure to do this will result in PyTorch skipping the first value of the learning rate schedule. See more details at https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate
warnings.warn("Detected call oflr_scheduler.step()
beforeoptimizer.step()
. "
This is normal when using autocasting, no problem.
from trainner.
Thanks for the answer! But I still have a problem, the program does not use video card, only video memory and and one epoch lasts for a very long time.
Core of video card dont used or used non correct, but only the processor. (G.Translate)
from trainner.
How long is an epoch taking? Regarding the GPU utilization in Windows task manager, please check: #72
from trainner.
The epoch lasts 250-400 seconds(I don't know whether it is long but I think for a long time).
I load windows task manager screen, and GPU-Z.
Please tell us the video card is normally used? Because the uneven graphic is surprising. (Results from GPU Z and from WIndows Task Manager very different)
UPD. My bad, i dont check this issue(#72), how to include monitoring of cuda in windows task manager?
from trainner.
How many images are you using for training? The time to complete a full epoch depends on how many images you have. More concretely, one epoch means that all the images have been used and it has to load all the images again with the dataloader.
from trainner.
Related Issues (20)
- [Feature Request] Curriculum Training for Augmentations
- Pixel Unshuffle is broken HOT 2
- Video dataloader crashes at 1x scale
- Video learning rate too high
- Add lr_crop_size in config HOT 1
- Update requirements.txt with proper versioning for torch
- Feature request/bug fix: Perform scaling and other operations in linear light HOT 3
- `nearest_aligned` is not aligned HOT 2
- so many bugs in your sftgan implementation HOT 6
- How do i even use this HOT 2
- Is there any way to train video super resolution models using this? HOT 4
- lmdb has no valid image file HOT 3
- how to train for real ESRGAN HOT 1
- GPU usage at 0% during training HOT 7
- I Have No Idea What I Am Doing to Cause This: HOT 5
- "cv2.error: OpenCV(4.7.0) D:\a\opencv-python\opencv-python\opencv\modules\imgproc\src\resize.cpp:4065: error: (-215:Assertion failed) inv_scale_x > 0 in function 'cv::resize'" HOT 1
- ETA is grossly over-estimated. HOT 2
- Pix2Pix 3->1 channel HOT 1
- AttributeError: module 'collections' has no attribute 'Iterable' HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from trainner.