Code Monkey home page Code Monkey logo

criu-het's Introduction

More information can be found on the Wiki

CRIUHET -- A project to implement checkpoint/restore functionality for Linux on heterogeneous-ISA platforms

'criu-het' (het for heterogeneous) allows to create a checkpoint to an architecture that is different from the current one. Currently only aarch64 and x86_64 are supported. Only binaries compiled witht the popcorn-compiler (branch criu) are supported.

criu-het omes in two forms crit-in-criu (basically a patch to original CRIU binary) or heterogenous-simplified (a modification to CRIT). Additionally, criu-het includes scripts to simplify its usage: you can use criu-het -h to see the option added by criu-het, which is basically just one "--arch", the architecture to which your application will be restarted!

To install criu-het refer to the INSTALL.md file.

criu-het is based on CRIU (see below after the Example), and it was developed by Antonio Barbalace and Mohamed L. Karaoui.

Checkpoint/Restore Example (uses two bash windows):

[Before using criu-het, it is highly recommanded to try a homogeneous (maybe on the same machine) checkpoint/restore using simply criu]

Note: all criut-het commands may require root access ('sudo')

checkpoint (dump):

#To run a popcorn process on needs a binary for all supported arch (popcorn_x86-64 and popcorn_aarch64)
#In addition to the one of current arch (popcorn-hello: a copy of popcorn_x86-64)
bash0 $ ls
popcorn-hello popcorn-hello_x86-64 popccorn_aarch64
# start popcorn-hello on bash0
bash0 $ ./popcorn-hello


# on bash1 we use ps to find the pid of popcorn-hello
bash1 $ ps -C popcorn-hello
...
22851  pts/2  00:00 ./popcorn-hello
# we checkpoint the given process
bash1 $ criu-het dump --arch aarch64 -j -t 22851

# The dump. In addition to the normal dump. One can find the dump of aarch64 in aarch64' folder.
bash1 $ ls
arch64/  core-22851.img  fdinfo-2.img  ... tty-info.img

Restore:

# before restoring, we cp the target file to the target architecture
bash0 $ cp popcorn-hello_aarch64 popcorn-hello
# Note: the above step is currently done by criu-het...

bash1 $ #ssh to remote node
bash1 $ #cd to the dump folder and into the 'aarch64' folder
#Note: both systems are supposed to have the same filesystem
#Otherwise one needed to copy the dump and the binaries (into the same paths)

#Restoring the task on aarch64
bash1 $ ciru-het restore -j

CRIU -- A project to implement checkpoint/restore functionality for Linux

CRIU (stands for Checkpoint and Restore in Userspace) is a utility to checkpoint/restore Linux tasks.

Using this tool, you can freeze a running application (or part of it) and checkpoint it to a hard drive as a collection of files. You can then use the files to restore and run the application from the point it was frozen at. The distinctive feature of the CRIU project is that it is mainly implemented in user space. There are some more projects doing C/R for Linux, and so far CRIU appears to be the most feature-rich and up-to-date with the kernel.

The project started as the way to do live migration for OpenVZ Linux containers, but later grew to more sophisticated and flexible tool. It is currently used by (integrated into) OpenVZ, LXC/LXD, Docker, and other software, project gets tremendous help from the community, and its packages are included into many Linux distributions.

The project home is at http://criu.org. This wiki contains all the knowledge base for CRIU we have. Pages worth starting with are:

A video tour on basic CRIU features

CRIU introduction

Advanced features

As main usage for CRIU is live migration, there's a library for it called P.Haul. Also the project exposes two cool core features as standalone libraries. These are libcompel for parasite code injection and libsoccr for TCP connections checkpoint-restore.

Live migration

True live migration using CRIU is possible, but doing all the steps by hands might be complicated. The phaul sub-project provides a Go library that encapsulates most of the complexity. This library and the Go bindings for CRIU are stored in the go-criu repository.

Parasite code injection

In order to get state of the running process CRIU needs to make this process execute some code, that would fetch the required information. To make this happen without killing the application itself, CRIU uses the parasite code injection technique, which is also available as a standalone library called libcompel.

TCP sockets checkpoint-restore

One of the CRIU features is the ability to save and restore state of a TCP socket without breaking the connection. This functionality is considered to be useful by itself, and we have it available as the libsoccr library.

How to contribute

CRIU project is (almost) the never-ending story, because we have to always keep up with the Linux kernel supporting checkpoint and restore for all the features it provides. Thus we're looking for contributors of all kinds -- feedback, bug reports, testing, coding, writing, etc. Here are some useful hints to get involved.

Licence

The project is licensed under GPLv2 (though files sitting in the lib/ directory are LGPLv2.1).

criu-het's People

Contributors

0x7f454c46 avatar abarbala avatar adrianreber avatar alexkvp avatar aryabinin avatar avagin avatar carnil avatar cometzero avatar compor avatar covracer avatar dima-anet avatar eabatalov avatar efiop avatar fbocharov avatar filbranden avatar hqhq avatar intelfx avatar isilence avatar koct9i avatar kolyshkin avatar ligurio avatar moharaka avatar nixprime avatar oleg-nesterov avatar pstradomski avatar rppt avatar rst0git avatar snorch avatar veruu avatar xemul avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Forkers

bwry

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.