The Tracing Plane and Baggage

1 Introduction

The Tracing Plane is a layered design for context propagation in distributed systems. The tracing plane enables interoperability between systems and tracing applications. It is designed to provide a simple "narrow waist" for tracing, much like how TCP/IP provides a narrow waist for the internet.

The Tracing Plane comprises a library for system instrumentation (Transit Layer) and a library for specifying and generating contexts (Baggage Buffers). These libraries are outlined in more detail below.

Baggage is our name for general purpose request context in distributed systems, and Baggage is implemented by the Tracing Plane. Though many systems already have request contexts -- e.g., Go's context package; Span contexts in Zipkin, OpenTracing and Dapper; request tags in Census; etc. -- none of them are general purpose. What this means is that if I instrument my distributed system to pass around Zipkin span contents, then later wish to use Census, I must reinstrument everything in order to pass around Census tags. That sucks.

This repository contains our Java reference implementation for the Tracing Plane and Baggage. This is an active research project at Brown University by Jonathan Mace and Prof. Rodrigo Fonseca. It is motivated by many years of collective experience in end-to-end tracing and numerous tracing-related research projects including X-Trace, Quanto, Retro, Pivot Tracing. You can also check out our research group's GitHub. Keep an eye out for our research paper on Baggage, which will appear later in 2017!

Useful Links

Javadoc for this repository: jonathanmace.github.io/tracingplane/doc/javadoc/index.html

Example Zipkin / OpenTracing Tracers that are backed by Baggage Buffers: github.com/JonathanMace/tracingplane-opentracing

1 Introduction
2 Overview of The Tracing Plane
- 2.1 Tracing Plane Outer Layers
  - 2.1.1 Transit Layer (for System Developers)
  - 2.1.2 Baggage Buffers (for Tracing Applications)
- 2.2 Tracing Plane Inner Layers
3. Building and Compiling
Old notes and thoughts

TODO

I need more details TODO (FAQ for researchers, tracing application devs, system devs, and curious observers)
Getting started - downloading, prerequisites, and building TODO
Simple example - baggage buffers TODO
Tutorial - instrument your system TODO
Overview of APIs for each layer TODO
Project Status TODO

2 Overview of The Tracing Plane

The Tracing Plane has four layers, illustrated in green in the figure below. Depending on who you are, your entry point to the Tracing Plane differs.

System developers use the Transit Layer APIs to instrument their system to pass baggage around.

Tracing application developers use the Baggage Buffers IDL to generate contexts and APIs for their tracing application.

In the middle, the Atom and Baggage layers provide generic interfaces that together enable a multitude of different kinds of tracing applications to coexist.

The above figure illustrates how the transit and atom layers are the minimum requirement for a system to be 'Tracing Plane enabled'. The Atom Layer is a very simple, straightforward, and generic representation of context, with simple rules for how to propagate it.

The Baggage Layer defines the Baggage Protocol, a way to lay out data for the atom layer. The Baggage Buffers layer provides an IDL for defining custom contexts in an easy way, and automatically generates the correct Baggage Protocol representation of the custom context.

2.1 Tracing Plane Outer Layers

There are two target audiences for the Tracing Plane.

First is the System Developers who write components of the distributed system and must instrument their system to pass contexts around. The Transit Layer at the bottom of the stack is the entry point for this audience.

Second is Tracing Application Developers who write tracing applications such as Zipkin, X-Trace, and many others. These developers want to pass metadata through many system components, across application, process, and system boundaries. The Baggage Buffers layer at the top of the stack is the entry point for this audience.

2.1.1 Transit Layer (for System Developers)

The Transit Layer has just one purpose: abstract the task of system instrumentation so that it only has to be done once. System instrumentation is the most laborious part of tracing. You have to modify every system component to make sure request contexts are passed around -- for example, passed to new threads when they're created, included in continuations and thread pool queues, serialized to RPC headers, etc.

Many systems already have this kind of instrumentation -- they already pass around request IDs, span contexts, tags, or other metadata. However, in every case we have ever seen, the metadata passed around is tightly bound to the system, making it difficult or impossible to easily extend it to add new fields or change its behavior.

The Transit Layer is an instrumentation abstraction that makes no attempt to interpret the contents or meaning of the baggage being carried. This lets you reuse existing instrumentation whenever you want to deploy a new tracing application. Instrumentation reuse overcomes an enormous barrier to entry -- we cannot overstate how useful this is!

To the transit layer, baggage is only ever an opaque object or byte array. When system developers instrument their system, they only need to consider where requests go -- they do not need to think about how to manipulate and update request contexts while requests execute.

The Transit Layer API is a set of static methods of the class Baggage (Javadoc). These methods provide a simple means to set and remove Baggage in a thread-local variable, create copies, and serialize Baggage.

See the (not yet written sorry) Transit Layer Tutorial for examples and usage of Transit Layer APIs.

2.1.2 Baggage Buffers (for Tracing Applications)

Baggage Buffers is an Interface Definition Language for generating tracing context interfaces. It makes it super easy to specify data that you want to be propagated in your system. Baggage Buffers is similar to protocol buffers in terms of syntax and usage -- first, you write a baggage buffers definition, eg xtrace.bb:

package edu.brown.xtrace;

bag XTraceMetadata {
	fixed64 taskId = 1;
	set<fixed64> parentEventIds = 2;
}

The Baggage Buffers Compiler is a command line tool that generates source files

bbc --java_out="target/generated_sources" src/main/baggage/xtrace.bb

The compiler generates source files with interfaces for accessing Baggage, eg XTraceMetadata.java (Javadoc)

public class XTraceMetadata implements Bag {
    public Long taskId = null;
    public Set<Long> parentEventIds = null;
    ...
    public static XTraceMetadata get();
    public static void set(XTraceMetadata xTraceMetadata);
    ...
}

Within the generated source files are two important accessor methods. These accessor methods interface with the baggage being carried in the current thread (controlled by the transit layer). get() accesses the baggage set in the current Thread, finds, and returns the XTraceMetadata bag. Similarly, set(..) sets the XTraceMetadata bag being carried in the current thread's Baggage.

That's all you need to be able to start propagating XTraceMetadata! A service in one part of a large, complicated system can toss some XTraceMetadata into a Baggage instance, and will faithfully receive it back (possibly merged with other XTraceMetadata instaces).

See the (not yet written sorry) Baggage Buffers Tutorial and Language Guide for examples and usage of Baggage Buffers. For now example.bb shows the supported fields and types (though numerous types are specified but not yet implemented eg counters, clocks, min, max, avg, sum, etc.... they will be by february)

2.2 Tracing Plane Inner Layers

Under the covers, Baggage Buffers interacts with the Baggage Layer, which defines a protocol for seamlessly composing potentially many contexts that may be present from different tracing applications.

In order to do this and also make it extremely simple for systems to propagate contexts, the baggage layer builds on top of the atom layer. The logic of the atom layer is extremely simple, but sufficient to support everything described so far.

2.2.1 Atom Layer

As described in Section 2.1.1, the Transit Layer abstracts the task of system instrumentation so that it only has to be done once. To the transit layer, and to system developers using Transit Layer APIs, baggage is only ever an opaque object or byte array. As a result, the transit layer delegates logic for the following two tasks:

Dividing and combining contexts when executions branch and rejoin. If baggage is just a cryptic array of bytes [ 0x08, 0xAF, ...], how are you supposed to take two different arrays and merge them into one?
Enforcing capacity restrictions on baggage. Again, if baggage is just a cryptic array of bytes, how can you ditch some of the bytes if the array is too big?

The Atom Layer provides a simple implementation of branch, join, serialize and trimming logic for the Transit Layer and is designed to support our principle goal: a general purpose request context.

The Atom Layer represents baggage as an array of atoms where an atom is an array of bytes. Atoms can have arbitrary length. The Atom Layer implements operations as follows:

Serialization and Deserialization: length prefix the bytes of each atom
Branch: each branch receives its a copy of the atoms with no modifications
Join: merge the two arrays of atoms using lexicographic comparison
Trim: drop atoms from the end of the array of atoms until size requirement is met; then append the overflow marker (more below)

A full description of the Atom Layer with examples can be found here.

The Atom Layer Javadoc also provides more information in comments.

2.2.2 Baggage Layer

The Baggage Layer specifies and implements the Baggage Protocol. The Baggage Protocol specifies the data format and layout for atoms such that:

Different tracing applications can put atoms in the baggage and get them back without interference from other tracing applications
Tracing applications can utilize a variety of data types including primitives, sets, maps, counters, clocks, and any state-based conflict-free replicated datatype.
If overflow occurs, we know exactly which tracing applications were affect and which were not. This means we can also implement inexact datatypes such as approximate counters.

The full outline of the Baggage Protocol can be found here. A summary of what the Baggage Protocol offers is:

Atom representations for tree-structured data, where each node in the tree can contain data and have children.
Consistent merging of atoms of tree-structured data, such that merge(A, B) contains all nodes from both A and B, but does not duplicate identicial nodes that occur in both.
As a result, if an execution repeatedly branches and joins again, merged baggage does not blow up with duplicate elements.

The Baggage Layer Javadoc 1, 2 also provides more information in comments.

3. Building and Compiling

Clone this repository and build with maven:

mvn clean package install

Currently, the project is set up to first build the Baggage Buffers Compiler, which requires Scala to build.

After building, the dist folder will contain built jars and dependencies, which you can place on your classpath.

Additionally, the Baggage Buffers Compiler will be built in resources/bbc.jar. To run the compiler, invoke java -jar resources/bbc.jar

Old notes and thoughts

=== Why is this problem hard? ===

Change is the norm -- components change all the time, hard to keep up, only a few involved
Lots of different tasks need e2e propagation and propagate different things
Executions aren't simple! Not linear, but graphs

=== What do we propose? ===

Ultimate goal: generic protocol for propagating context that:

enables multiple participants simultaneously and opaquely
dynamic and adaptable at runtime
handles graph structure without requiring knowledge of the data being carried
supports many different data types

chen0031 / tracingplane Goto Github PK

tracingplane's Introduction

The Tracing Plane and Baggage

1 Introduction

Useful Links

Table of Contents

TODO

2 Overview of The Tracing Plane

2.1 Tracing Plane Outer Layers

2.1.1 Transit Layer (for System Developers)

2.1.2 Baggage Buffers (for Tracing Applications)

2.2 Tracing Plane Inner Layers

2.2.1 Atom Layer

2.2.2 Baggage Layer

3. Building and Compiling

Old notes and thoughts

tracingplane's People

Contributors

Stargazers

Watchers

Recommend Projects

Recommend Topics

Recommend Org