Comments (3)
It is theoretically possible but slightly tricky: we would have to filter the partition names every time (any admin can decide to partition the cluster in multiple 'slices') before running the 'squeue' commands.
This will bloat the code greatly......What we can do is add a command line option to filter the partition name. In this case the exporter will extract only the data related to a certain partition, including its jobs.
In this case you'll have to run multiple exporters (in theory for every partition) and just the listening
port with the command line flag.
I can try to add this feature in the coming weeks but I cannot say for sure when I'll be able to deliver it.
Regards,
Matteo
from prometheus-slurm-exporter.
Hi,
I have a closely related if not identical question. I want to essentially create a table in the Grafana dashboard that has all of the squeue information. We only have a single partition if that's an issue. Is this possible with the current code base, or would it require a significant update?
Thank you,
Collin
from prometheus-slurm-exporter.
If any of you are still interested in these functionalities, you should check our latest commits.
Now this exporter is able to provide the following additional info:
- Running/suspended Jobs per partitions, divided between Slurm accounts and users.
- CPUs total/allocated/idle per partition plus used CPU per userid.
from prometheus-slurm-exporter.
Related Issues (20)
- Add TLS/SSL to slurm_exporter
- prometheus-slurm-exporter (v0.20) crash with slurm 23.02.2 HOT 1
- Slurm Exporter Compatibility issues with Slurm 23.X Version. HOT 1
- 开启GPU支持报错 HOT 2
- Cut a new release for -gpus-acct to work
- Slurm Federation Support
- Create an official docker image HOT 1
- Errors when attempting make on RHEL 7.9 HOT 4
- long node name causes index out of range error HOT 1
- panic: runtime error: index out of range [4] with length 4 when running slurm-exporter (HEAD)
- Crashes during HTTP request HOT 1
- Getting "Connection Refused" HOT 1
- panic: runtime error: index out of range [4] with length 4 HOT 1
- Is this still maintained? HOT 10
- squeue metrics: handle more pending states HOT 1
- Job Status not retrieved HOT 2
- Nested accounts missing from fairshare HOT 2
- Running as systemd service with port change does not work HOT 8
- Nodelist and jobID HOT 1
- Update dependencies
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from prometheus-slurm-exporter.