sagarkar10 / jobqueue Goto Github PK

View Code? Open in Web Editor NEW

0.0 2.0 0.0 13 KB

A simple Job queue to take docker as tasks and process them on a server.

License: MIT License

Jupyter Notebook 76.96% Shell 3.77% Python 19.28%

jobqueue's Introduction

@author Sagar Kar

A RabbitMq based job queying system to run docker as jobs.

Todo:

Requirements: -python3 -pika (pip) -rabbitmq (unix)

Remarks:

Learned about RabbitMQ
Made the docker run work wihout docker compose and understood the underying concept
Difficulty in the Popen Output. Pretty nasty.
The -t arg in docker run caussed the output formatting mess in th subprocess.Popen.PIPE.stdout.readline().

File Structure:

.
├── job_desc (all sample job description holds here)
│   ├── job_desc_1.json
│   └── job_desc_2.json
├── notebooks (scrap notebooks for testing)
│   └── rabbitmqSample.ipynb
├── README.md
└── scripts (the fileterd scripts)
    ├── cpu_usage.sh
    ├── mem_check.sh
    ├── receive.py (worker)
    └── send.py (client)

Project Description:

Job Queue

Create a job queue. It consists of two parts: a worker and a client.

Each job consists of:

a docker image
an array of cmd parameters to pass to the image
a dictionary of environment variables to pass to the image
a dictionary of cpu and memory requirements

The client takes the above details and enqueues the job.

A worker then pops the jobs from the queue and runs them.

Notes:

There are no restrictions on how long a task can run; Some may finish in weeks while others may run for weeks.
Assume jobs are idempotent; Each job should be run at least once.
Each worker should take cpu and memory available as inputs when it's started.
A worker should simultaneously run as many jobs as possible without overrunning either the cpu or the memory available
We must be able to run multiple workers with different cpu and memory availability simultaneously.

Recommend Projects