Comments (9)
I encountered the same problem, and it's been solved by downgrading the nvidia-ml-py to a former version 11.525.112 using pip install nvidia-ml-py==11.525.112
. I hope it's helpful.
from gpustat.
+1 same error
- OS: Windows 10 Enterprise (Version: 2004, OS build: 19041.264)
- NVIDIA Driver version: 536.99
- The name(s) of GPU card: NVIDIA GeForce RTX 4090 x 2
- gpustat version: gpustat 1.1.1
Thanks for the workaround @Lunar13737 , it worked for me.
from gpustat.
@wookayin I think it was most likely 12.535.77 that caused the error, though I'm not 100% sure because I didn't keep a record of it. I downgraded to 11.525.112 which worked, and now 12.535.108 works too.
from gpustat.
Thanks. I can conclude that the root cause of this bug is essentially same as #161: one should use neither nvidia-ml-py=11.535.77
nor broken NVIDIA drivers >= 535.43, < 535.98
.
gpustat
will print warnings when any of these versions of nvml library or driver is detected, so we can close this issue without adding an unnecessary compatibility layer.
from gpustat.
+1 and the workaround with downgrading nvidia-ml-py
did not work for me :(
- OS: Windows 11 Pro N
- NVIDIA Driver Version: 535.98, CUDA Version: 12.2
- GPU: NVIDIA gpuGeForce RTX 4070
- gpustat version: gpustat 1.1.1
Any hints?
from gpustat.
I'd like to reproduce this issue to have a correct fix. But I've never seen the issue.
What we know from #161 (comment):
- nvidia-ml-py=11.535.77 is buggy, only works for 535.43 and 535.86 (the OP's case):
- Does the problem go away if you install nvidia-ml-py==12.535.108? @JensWendt
- It looks like that nvidia-ml-py 12.535.108 should correct all process-information related bugs, reverting the breaking changes in the previous versions. But this is just my guess, I'm not sure. I would need the nvidia-ml-py version installed on the system.
@Lunar13737, @PyroGenesis, @mjmikulski thanks for the datapoints. Could you please try upgrading nvidia-ml-py==12.535.108 and see if the OverflowError is gone?
from gpustat.
Could you please try upgrading nvidia-ml-py==12.535.108 and see if the OverflowError is gone?
@wookayin I can confirm, overflow error does not occur in nvidia-ml-py 12.535.108
from gpustat.
@PyroGenesis Thanks. What was the previous version of nvidia-ml-py that resulted in this bug?
from gpustat.
@wookayin nvidia-ml-py 12.535.108 works for me, no overflow error
from gpustat.
Related Issues (20)
- Some low-level errors (like `pynvml.nvml.NVMLError_LibRmVersionMismatch`) result in nothing printed (std or diagnostic) HOT 1
- UserWarning: Failed to setupterm(kind='xterm'): setupterm: could not find terminfo database HOT 1
- Support anaconda's legacy pynvml package HOT 7
- How to obtain RPM value for the fans ? HOT 2
- Plugin Architecture
- module 'pynvml' has no attribute '_nvmlGetFunctionPointer' HOT 17
- Truncate the "command" when use "-f" HOT 1
- ModuleNotFoundError: No module named '_curses' HOT 2
- ModuleNotFoundError: No module named '_curses' HOT 2
- Process not displayed HOT 3
- make appimage format or binary file οΌit can run everywhere HOT 1
- make appimage format HOT 1
- gpustat reports only the first program on nv driver 535 HOT 4
- Include GDDR6(X) VRAM temperatures HOT 2
- Show CUDA Driver Version in the output HOT 2
- Enhance gpustat to Display Latest CUDA Version Compatible with Current NVIDIA Driver HOT 4
- Even more compact (single-line) output for statusline use HOT 2
- Misreported used memory with the driver 535.129.03 HOT 3
- /usr/bin/gpustat:6: DeprecationWarning: pkg_resources is deprecated as an API. HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
π Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. πππ
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google β€οΈ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from gpustat.