Mglawica

Status:	Proof of Concept
Documetation:	tutorial

The basic idea here is to start with Dokku-like experience to make first deployment as easy as possible, but allow to scale a cluster to several nodes.

Theoretically you could continue to scale the cluster beyond several nodes, but usually this uncovers more details specific to your project. Which basically means you will fork our scripts, tweak verwalter's scheduler and continue use the tools on your own.

This project is built with vagga, cantal, lithos and verwalter. We also use nginx, rsync, linux and more, they are pretty ubuquitous, though.

See Verwalter's concepts for description of roles of all these tools, but you don't need to learn all that. You should get used to vagga basics to be productive, though. Luckily there are plenty of tutorials.

License

Licensed under either of

Apache License, Version 2.0, (./LICENSE-APACHE or http://www.apache.org/licenses/LICENSE-2.0)
MIT license (./LICENSE-MIT or http://opensource.org/licenses/MIT)

at your option.

Contribution

Unless you explicitly state otherwise, any contribution intentionally submitted for inclusion in the work by you, as defined in the Apache-2.0 license, shall be dual licensed as above, without any additional terms or conditions.

New concept of deployment

Configuration

User adds the following to their vagga file:

mixins:
- vagga/deploy.yaml

commands:
  deploy: !CapsuleCommand
  run:
  - vagga
  - _script
  - https://github.com/.../deployment_script
  - --destination=http://internal.network/your-deployment.json
  - --containers="python,assets"

Then they can run vagga deploy staging or vagga deploy production

How Does it Work?

CapsuleCommand downloads a script
Then script downloads your-deployment.json
Then script generates vagga/deploy.yaml according to your-deployment.json
Then it runs ciruela upload to servers described in json
Then it pushes a list of images to verwalter HTTP API

Details follow. The important points here:

Step (4) is configurable. We might allow rsync, or use docker import and docker push
Step (5) is also configurable, it might not use verwalter, or it might put the metadata into intermediate storage and create a release from multiple repositories individually uploaded to servers.

What is in `vagga/deploy.yaml`

Basically, it wraps each container into:

containers:
  xxx-deploy:
    setup:
    - !SubConfig
      path: vagga.yaml
      container: xxx
    - !Tar
      url: https://github.com/.../container-cooker.tar
      script: "./container-cooker"

What Does "container-cooker" Do ?

Note: the name container-cooker is just for explanatory purposes.

It validates configs and fixes things that lead to many mistakes:

!EnsureDir for all the volumes
Find lithos (or maybe other) configs, check which ones belong to this container (probably by looking at executable), and copy them to the container
Put configs or metadata extracted from them to some well-known place, so that verwalter could find them
Maybe optimize some things in container: clean common tmp folders which vagga doesn't clean by default, reset timestamps and recompile *.pyc files (latter makes containers reproducible)
Might execute some vulnerability checks, or extract package lists so it would be easier to run vulnerability checks later
Might generate some lithos configs from vagga config
Make hosts and resolv.conf symlinks

What Does `your-deployment.json` Contain?

It should describe the full deployment target, here is the non-exhaustive list of things:

A validator for lithos' metadata. We expect that every deployment can have it's own scripting in verwalter so it might need more or less metadata. Still, validation of metadata is super-useful. [*]
Additional checks for the config, i.e. it may require always having PYTHONIOENCODING if the executable is pythonic, or an /app_version file in the root of the container
An entry point for the ciruela or another way of image upload
An API endpoint for verwalter
Conventions on the configuration files, which are staging and which are production so you can just name the environment.

All of these things except hostnames (3, 4) could be a hardcoded convention, but I feel it would be too restrictive and does not take advantage of full verwalter power (or makes it less convenient if metadata is not validated properly).

No keys/secrets/passwords are contained in json. Keys are passed through the environment variables.

[*] not sure which validator to use though, maybe json_schema or livr

How Verwalter Works?

Currently, verwalter relies on having all needed metadata extracted from the repo/image/config and put into a "runtime" folder. While we're trying to move most things into container itself we still need one thing left: a link between container which constitute a new version, i.e. a version v1.2.3 might have containers app.v1.2.3 and redis.023fed, i.e. the redis container is versioned by hashed and only updated when configuration change.

So the thing pushed into verwalter will be basically a dict:

{
  'app-web-staging': 'app.v1.2.3',
  'app-celery-staging': 'app.v1.2.3',
  'redis': 'redis.023fed',
}

I.e. a mapping with process name to it's container name. The other metadata/configuration files are stored in the image itself with some convention (not a real one, to be determined):

/lithos-configs/app-web-staging.yaml

And presumably, verwalter needs to figure out few things:

Which machines in the cluster have this image
Get the lithos config from image (by accessing ciruela itself) and extract metadata from it (by metadata I mean both: the metadata key and useful things such as memory-limit or cpu-shares, maybe even display the whole thing in GUI)

This should be enough for verwalter to do the work. Note: it's up to scheduler whether to enable version immediately, wait for more machines to fetch image, only upgrade existing processes or run new ones right from the point their config is pushed, add new services to nginx automatically or not and so on.

Notes

Versioning of deployment might be suboptimal, if configs are copied by container-cooker. But since these are deployment containers, it's usually just enough to put !GitDescribe into command, to make them rebuild often enough (basically every commit). We don't want to put it by default because you might want database containers which are not restarted on each deploy. Other option is explicitly opt out of versioning on script's command-line
Caching of json is unclear, but basically it can be cached for dry-run and never cached for deployment (i.e. you can check configs in airplane but obviously not deploy)
At the end of the day, you can fork both the deployment-script and container-cooker and provide very different deployment tool with very same interface, say pack and deploy it to heroku or AMI

/cc @anti-social, @popravich

tailhook / mglawica Goto Github PK

mglawica's Introduction

Mglawica

License

Contribution

mglawica's People

Contributors

Stargazers

Watchers

mglawica's Issues

New concept of deployment

Configuration

How Does it Work?

What is in `vagga/deploy.yaml`

What Does "container-cooker" Do ?

What Does `your-deployment.json` Contain?

How Verwalter Works?

Notes

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

tailhook / mglawica Goto Github PK

mglawica's Introduction

Mglawica

License

Contribution

mglawica's People

Contributors

Stargazers

Watchers

mglawica's Issues

Configuration

How Does it Work?

What is in vagga/deploy.yaml

What Does "container-cooker" Do ?

What Does your-deployment.json Contain?

How Verwalter Works?

Notes

Recommend Projects

Recommend Topics

Recommend Org

What is in `vagga/deploy.yaml`

What Does `your-deployment.json` Contain?