Code Monkey home page Code Monkey logo

Comments (5)

victorca25 avatar victorca25 commented on June 15, 2024

Hello!

D:\traiNNer\codes\models\base_model.py:921: FutureWarning: Non-finite norm encountered in torch.nn.utils.clip_grad_norm_; continuing anyway. Note that the default behavior will change in a future release to error out if a non-finite total norm is encountered. At that point, setting error_if_nonfinite=false will be required to retain the old behavior.

This means the model is probably unstable and some values are becoming InF. Normally gradient clipping will set those values to max at the defined values, but looks like this behavior is changing. You can try lowering the learning rate a bit to see if the error goes away, otherwise, there might be some other configuration causing these values to become infinite.

C:\Python39\lib\site-packages\torch\optim\lr_scheduler.py:129: UserWarning: Detected call of lr_scheduler.step() before optimizer.step(). In PyTorch 1.1.0 and later, you should call them in the opposite order: optimizer.step() before lr_scheduler.step(). Failure to do this will result in PyTorch skipping the first value of the learning rate schedule. See more details at https://pytorch.org/docs/stable/optim.html#how-to-adjust-learning-rate
warnings.warn("Detected call of lr_scheduler.step() before optimizer.step(). "

This is normal when using autocasting, no problem.

from trainner.

cheuS1-n avatar cheuS1-n commented on June 15, 2024

Thanks for the answer! But I still have a problem, the program does not use video card, only video memory and and one epoch lasts for a very long time.
Core of video card dont used or used non correct, but only the processor. (G.Translate)
Знімок екрана 2022-12-13 093533

from trainner.

victorca25 avatar victorca25 commented on June 15, 2024

How long is an epoch taking? Regarding the GPU utilization in Windows task manager, please check: #72

from trainner.

cheuS1-n avatar cheuS1-n commented on June 15, 2024

The epoch lasts 250-400 seconds(I don't know whether it is long but I think for a long time).
I load windows task manager screen, and GPU-Z.
image
image

Please tell us the video card is normally used? Because the uneven graphic is surprising. (Results from GPU Z and from WIndows Task Manager very different)

UPD. My bad, i dont check this issue(#72), how to include monitoring of cuda in windows task manager?

from trainner.

victorca25 avatar victorca25 commented on June 15, 2024

How many images are you using for training? The time to complete a full epoch depends on how many images you have. More concretely, one epoch means that all the images have been used and it has to load all the images again with the dataloader.

from trainner.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.