Code Monkey home page Code Monkey logo

images's Introduction

Images

This repository holds code to create Saturn Docker images.

Default images for customer use

A default image is defined as an image that, upon a fresh customer install, is immediately available to be attached to a Jupyter server or Dask cluster.

All default images should have at least the following packages with appropriate pins, floors, or ceilings. This ensures customers will be able to use Dask, Prefect, and Snowflake in every image.

name: saturn
channels:
- defaults
- conda-forge
dependencies:
- blas=*=mkl
- bokeh
- dask-ml
- dask
- distributed
- ipykernel
- ipywidgets
- matplotlib
- numpy
- pandas
- pip
- prefect
- pyarrow
- python=3.7
- python-graphviz
- s3fs
- scikit-learn
- scipy
- voila
- xgboost
- pip:
  - dask-saturn
  - prefect-saturn
  - snowflake-connector-python

We need to keep images as small as possible, because image size directly impacts instance spinup time.

  • saturn: Data analysis, machine learning, and parallel processing with Dask
  • saturn-rapids: GPU-acceleration with RAPIDS (GPU instance recommended)
  • saturn-tensorflow: Deep learning with tensorflow (GPU instance recommended)
  • saturn-pytorch: Deep learning with pytorch (GPU instance recommended)
  • examples-cpu: For running examples-cpu project
  • examples-gpu: For running examples-gpu project (GPU instance recommended)
  • saturn-geospatial: Geospatial IO, analysis and visualization

Adding a new image definition

Each image is stored in its own subdirectory. That subdirectory should have at least a Dockerfile and .dockerignore.

Dockerfile

A script that defines how to build the image.

For complete details on how to write .dockerignore files, see the official docker documentation.

.dockerignore

Similar to .gitignore, .dockerignore is used to prevent unwanted files from being bundled in an image. For a good explanation of this, see "Do Not Ignore .dockerignore".

The images in this repository use .dockerignore files like this:

*
!app.py
!environment.yml

That syntax says "ignore everything EXCEPT app.py and environment.yml".

For complete details on how to write .dockerignore files, see the docker documentation.

R Images

There are two R base images. Each installs R and RStudio (and Python), and sets up the right environment variables including Reticulate support.

  • saturnbase-rstudio - Built in the same manner as saturnbase but without JupyterLab.
  • saturnbase-rstudio-gpu-11.1 - Built using rocker/ml as the starting point so that GPUs can be supported. The rocker/ml image is copyright of the rocker project.

The R images that build from these two base images can then add R packages or Python packages as needed.

images's People

Contributors

hhuuggoo avatar jnolis avatar forana avatar bhperry avatar andrewelkin avatar jsignell avatar saturn-automation avatar jameslamb avatar wreis avatar skirmer avatar grzxz avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.