tlspuffin / tlspuffin Goto Github PK

View Code? Open in Web Editor NEW

123.0 6.0 11.0 41.61 MB

A Dolev-Yao-model-guided fuzzer for TLS

License: MIT License

GDB 0.01% Rust 91.57% C 0.49% Shell 1.90% Python 1.97% Perl 3.77% sed 0.06% Just 0.15% Nix 0.08%

fuzzer tls symbolic tls13 fuzzing tls12

tlspuffin's Introduction

tlspuffin

TLS Protocol Under FuzzINg

A Dolev-Yao guided fuzzer for TLS

Developed at LORIA, Inria, France and Trail of Bits, USA

Master Thesis | Thesis Presentation | Documentation

Disclaimer: The term "symbolic-model-guided" should not be confused with symbolic execution or concolic fuzzing.

Description

Fuzzing implementations of cryptographic protocols is challenging. In contrast to traditional fuzzing of file formats, cryptographic protocols require a specific flow of cryptographic and mutually dependent messages to reach deep protocol states. The specification of the TLS protocol describes sound flows of messages and cryptographic operations.

Although the specification has been formally verified multiple times with significant results, a gap has emerged from the fact that implementations of the same protocol have not undergone the same logical analysis. Because the development of cryptographic protocols is error-prone, multiple security vulnerabilities have already been discovered in implementations in TLS which are not present in its specification.

Inspired by symbolic protocol verification, we present a reference implementation of a fuzzer named tlspuffin which employs a concrete semantic to execute TLS 1.2 and 1.3 symbolic traces. In fact attacks which mix \TLS versions are in scope of this implementation. This method allows us to utilize a genetic fuzzing algorithm to fuzz protocol flows, which is described by the following three stages.

By mutating traces we can deviate from the specification to test logical flaws.
Selection of interesting protocol flows advance the fuzzing procedure.
A security violation oracle supervises executions for the absence of vulnerabilities.

The novel approach allows rediscovering known vulnerabilities, which are out-of-scope for classical bit-level fuzzers. This proves that it is capable of reaching critical protocol states. In contrast to the promising methodology no new vulnerabilities were found by tlspuffin. This can can be explained by the fact that the implementation effort of TLS protocol primitives and extensions is high and not all features of the specification have been implemented. Nonetheless, the innovating approach is promising in terms of quickly reaching high edge coverage, expressiveness of executable protocol traces and stable and extensible implementation.

Features

Uses the LibAFL fuzzing framework
Fuzzer which is inspired by the Dolev-Yao symbolic model used in protocol verification
Domain specific mutators for Protocol Fuzzing!
Supported Libraries Under Test:
- OpenSSL 1.0.1f, 1.0.2u, 1.1.1k
- LibreSSL 3.3.3
- wolfSSL 5.1.0 - 5.4.0
Reproducible for each LUT. We use Git submodules to link to forks this are in the tlspuffin organisation
70% Test Coverage
Writtin in Rust!

Dependencies

build-essential (make, gcc)
clang
graphviz

OpenSSL 1.0:

makedepend from `xutils-dev package

WolfSSL:

autoconf
libtool

For the python tlspuffin-analyzer:

libyajl-dev
wheel from Python pip

Building

Build the project:

git clone https://github.com/tlspuffin/tlspuffin.git
git submodule update --init --recursive
cargo build

Running

Fuzz using three clients:

cargo run --bin tlspuffin -- --cores 0-3

Note: After switching the Library Under Test or its version do a clean rebuild (cargo clean). For example when switching from OpenSSL 1.0.1 to 1.1.1.

Testing

cargo test

Command-line Interface

The syntax for the command-line of is:

tlspuffin [⟨options] [⟨sub-commands⟩]

Global Options

Before we explain each sub-command, we first go over the options in the following.

-c, --cores ⟨spec⟩

This option specifies on which cores the fuzzer should assign its worker processes. It can either be specified as a list by using commas "0,1,2,7" or as a range "0-7". By default, it runs just on core 0.
-i, --max-iters ⟨i⟩

This option allows to bound the amount of iterations the fuzzer does. If omitted, then infinite iterations are done.
-p, --port ⟨n⟩

As specified in [sec:design-multiprocessing] the initial communication between the fuzzer broker and workers happens over TCP/IP. Therefore, the broker requires a port allocation. The default port is 1337.
-s, --seed ⟨n⟩

Defines an initial seed for the prng used for mutations. Note that this does not make the fuzzing deterministic, because of randomness introduced by the multiprocessing (see [sec:design-multiprocessing]).

Sub-commands

Now we will go over the sub-commands execute, plot, experiment, and seed.

execute ⟨input⟩

This sub-command executes a single trace persisted in a file. The path to the file is provided by the ⟨input⟩ argument.
plot ⟨input⟩ ⟨format⟩ ⟨output_prefix⟩

This sub-command plots the trace stored at ⟨input⟩ in the format specified by ⟨format⟩. The created graphics are stored at a path provided by ⟨output_prefix⟩. The option --multiple can be provided to create for each step in the trace a separate file. If the option --tree is given, then only a single graphic which contains all steps is produced.
experiment

This sub-command initiates an experiment. Experiments are stored in a directory named experiments/ in the current working directory. An experiment consists of a directory which contains . The title and description of the experiment can be specified with --title ⟨t⟩ and --description ⟨d⟩ respectively. Both strings are persisted in the metadata of the experiment, together with the current commit hash of , the version and the current date and time.
seed

This sub-command serializes the default seed corpus in a directory named corpus/ in the current working directory. The default corpus is defined in the source code of using the trace dsl.

Rust Setup

Install rustup.

The toolchain will be automatically downloaded when building this project. See ./rust-toolchain.toml for more details about the toolchain.

Make sure that you have the clang compiler installed. Optionally, also install llvm to have additional tools like sancov available. Also make sure that you have the usual tools for building it like make, gcc etc. installed. They may be needed to build OpenSSL.

Advanced Features

Running with ASAN

ASAN_OPTIONS=abort_on_error=1 \
    cargo run --bin tlspuffin --features asan -- --cores 0-3

It is important to enable abort_on_error, else the fuzzer workers fail to restart on crashes.

Compiling with ASAN using rustc

RUSTFLAGS=-Zsanitizer=address cargo +nightly build --target x86_64-unknown-linux-gnu --bin tlspuffin -p tlspuffin --release --features wolfssl530

Generate Corpus Seeds

cargo run --bin tlspuffin -- seed

Plot Symbolic Traces

To plot SVGs do the following:

cargo run --bin tlspuffin -- plot corpus/seed_client_attacker12.trace svg ./plots/seed_client_attacker12

Note: This requires that the dot binary is in on your path. Note: The utility tools/plot-corpus.sh plots a whole directory

Execute a Symbolic Trace (with ASAN)

To analyze crashes you can also execute a trace which crashes the testing harness using ASAN:

cargo run --bin tlspuffin -- execute test.trace

To do the same with ASAN enabled:

ASAN_OPTIONS=detect_leaks=0 \
      cargo run --bin tlspuffin --features asan -- execute test.trace

Crash Deduplication

Creates log files for each crash and parses ASAN crashes to group crashes together.

tools/analyze-crashes.sh

Benchmarking

There is a benchmark which compares the execution of the dynamic functions to directly executing them in benchmark.rs. You can run them using:

cargo bench
xdg-open target/criterion/report/index.html

Documentation

This generates the documentation for this crate and opens the browser. This also includes the documentation of every dependency like LibAFL or rustls.

cargo doc --open

You can also view the up-to-date documentation here.

tlspuffin's People

Contributors

Stargazers

Watchers

Forkers

timb-machine-mirrors microsvuln fabianwildgrube sizaif lcbh trail-of-forks openssl-sg-insights nimics michaelmera aeyno 5l1v3r1

tlspuffin's Issues

Execute the whole TLS handshake using (extended) traces

We want to be able to execute a TLS handshake successfully by only moving the statemachine of OpenSSL forward untili the handshake is established.

Search for CVEs which give Ideas about Mutators

Like:

SKIP
...

Add certificates to fuzzing space

Not all code of openssl is triggered by using a single static RSA certificate.

Try to include a set of certificates.

Implement a trace which outputs and inputs terms

It must be possible to get data out of it. This will probably rusttls Messages.
It also must be possible to inject data. This could come from the attacker.

This task includes refactoring as we do not need the same kind of agents as in the current code.
We should also rename Send to Output and Expect to Input.

Add a function to Trace which checks whether it is consistent

Are only agents referenced which exist?
Are only steps referenced which exist?

Add mutators for traces

This means we want to be able to add input and output steps, as well as mutate who receives which handle.

Implement a trace in which the attacker is the client

Ignore the TLS client and implement the handshake on our self.

After that send an encrypted Finished from the client to the server:

After sending CH and Finished the handshake is finished.

Use only rustls Messages

Refactor such that we only deal with buffers of rustls messages. Right now we only want to send rustls messages. This means we do not need to store them as byte buffers.

This maybe simplifies code as the parsing of packages happens right at the interface between openssl and rust.

See whether we can refactor VariableData

The VariableData and the data which flows into operations is basically the same. We could simply remove VariableData and replace it with any probably.

Another way would be to add a Enum VariableData. That would also remove the need of Box<> inside the dynamic function type, because the enum would be copyable/cloneable.

Add OpenSSL to the projects

The project is added via an openssl crate and a git submodule to a fork of it.

Control TLS Handshake Transcript

In TLS we keep a transcript hash of all actions. Maybe the attacker needs some kind of "memory" from which we can calculate a transcript hash.

Unable to stop LibAFL broker with Ctrl+C

Add a way to deconstruct messages

Right now we manually deconstruct each message in the old Expect steps. We need a procedure to deconstruct it.

Maybe use generators?

Trace to trigger a Vulnerability

We want to test whether the attacker is able to detect simple implementation vulnerabilities in TLS 1.3.

There are two ways:

Choose a known volnerability of which we know that we can trigger it by modifing fields in a TLS message
Introduce a vulnerability which can be triggered by an attacker

We do not yet need a way of automatically detecting it. This is done e.g. in https://gitlab.inria.fr/mammann/tlspuffin/-/issues/15

Implementation of a term algebra

A term looks like the following f(g(a, b,c)). There is a basic implementation which allows the following:

Evaluate it using concrete implementations
Make it possible to dynamically build terms during runtime.

Some possibilities for ways of doing this:

Use a message (enum) type from which the types can be derived
- The defined operations take care of getting the correct type
Use higher level apis like KeyScheduleHandshake to implement operations: Needs exposure here https://github.com/ctz/rustls/blob/d03bf27e0b520fe73c901d0027bab12753a42bb6/rustls/src/lib.rs#L279
Automatically get all functions from rustls and make it possible to call them (through Map<str, fn>
- Statically analyze rustls and get all possible ways of calling functions
- -> This would eliminate the need to define all operations manually
- Also very hard because attacker capabilities are not exposed easily. Some need context which can not easily be provided (deep static understanding is required)

Bug Oracle: Channel Binding/Unique Channel Identifier

https://gitlab.inria.fr/mammann/tlspuffin/-/issues/15

Def.:

Unique Channel Identifier: If a client session and a
server session have the same identifier cid , then all
other parameters in these sessions must match (same
cid , offer C , mode S , pk C , pk S , psk , kc , ks, psk ).

Model attacker knowledge

The attacker needs to have knowledge available to be able to generate terms.

This could be as simple as a Vector of variables with a specific types and corresponding handles to VariableData.

Add mutators for terms (attacker)

Add a config for TLS

Cipher Suite
Keys

Have about 5 configs with different keys.

Use procedural macros to generate dynamic functions from functions

Maybe we can use macros to speed up the calling of dynamic functions: https://doc.rust-lang.org/book/ch19-06-macros.html#procedural-macros-for-generating-code-from-attributes

This does not demolish the use of Box because we still need a dynamic trait. To get rid of this we would need an enum type.

Check whether we need to serialize certain parts for LibAFL

LibAFL stores seeds on disk. It may require some serializable format for the trace and terms.

Expose every API automatically in rustls

The rust-analyzer tool allows to rewrite rust code based on syntax trees: rust-analyzer ssr 'fn $a() -> $d { $e } ==>> pub fn $a() -> $d { $e }'

Unfortunately this fails right now: rust-lang/rust-analyzer#5868

Other deprecated tool: https://github.com/google/rerast/

Add comments on Rust modules

Do not use Payload, but use Vec<u8>

Vec is more generic and can later be mapped to Payload, PayloadU16 etc.

Wrap all functions in a Result<Data, FnError>

Detect Deep Structure of Messages

When OpenSSl outputs a TLS message we can only see which fields are there. We are not able to see deeper and know which secrets have been used to compute the message. This would be needed to detect secrecy.

Here are some notes:

keyloging callback
-> follow flow into bitstrings

dynamic analysis ->

find constructors and desctructors in source code manually
When creating a tls message log which in formation was used in which order
- e.g. session->session_id, ssl->server_random, hs->secret, derive_secret , hs->client_handshake_secret, hs->secret, derive_secret , hs->server_handshake_secret
Instrument these variables/functions which are traceable via XRay
Build terms out of these logs via heuristics.

https://llvm.org/docs/XRay.html#xray-runtime-library

https://github.com/llvm/llvm-project/tree/d480f968ad8b56d3ee4a6b6df5532d485b0ad01e/compiler-rt/lib/xray

Compile OpenSSL via clang

For:

Add libfuzzer sanitizer

Make sure OpenSSL and function symbols are deterministic

Implement a server attacker trace

Right now we only have a client as an attacker (seed), which does the computations necessary.

Why is 2nd CCS missing?

Rename Operation to ConcreteFunction

There is no need for an other term.

Implement a trace with Session Resumption

https://datatracker.ietf.org/doc/html/rfc8446#section-2.2

Remove logical checks in rustls message parsing

tlspuffin/rustls@clone-message...maxammann:remove-checks

Search by fn read(r: &mut Reader) -> Option< in IntelliJ

Discussion: Modle Variables as dyn Traid or Enum?

https://www.possiblerust.com/guide/enum-or-trait-object

Pro Trait:

Open Set
No code generation

Pro Enum:

Fast
Cloneable and no Heap required

The VariableData and the data which flows into operations is basically the same. We could simply remove VariableData and replace it with any probably.

Another way would be to add a Enum VariableData. That would also remove the need of Box<> inside the dynamic function type, because the enum would be copyable/cloneable.

Add a puffin logo for the project

Add a nice lookin puffin!

Setup LibAFL

LibAFL needs to be setup:

Allow the fuzzer to execute a trace
Serialize traces
Define a dummy mutator, which does nothing

Add launcher for LibAFL

https://github.com/maxammann/LibAFL/blob/bfbaa7ae83cfab6d94853b6be4081a54f772d921/fuzzers/libfuzzer_libpng_launcher/src/lib.rs#L170

Add a trace DSL: Improve macro syntax for traces

Even with var! and app! the syntax is still verbose and hard to get right. Maybe we can improve on this. Maybe we should have just one macro with multiple cases.

Serialize Box<DynamicFunction> and TypeShape in type_helper.rs

Evaluate if MultiMessage is required

I think we also could just have multiple In/Out steps

Add openssl options to fuzzing space

Right now we use SslOptions::ENABLE_MIDDLEBOX_COMPAT. Maybe we can increase the coverage by allowing the fuzzer to explore some more options

CI/CD

Run tests
Generate documentation: https://mammann.gitlabpages.inria.fr/tlspuffin/tlspuffin/

Path Coverage
Node Coverage to some extend
Different Protocol Runs (probably similar to Path Coverage)
New Types of messages/alerts discovered?

Bug Oracle: Authenfication

Detect an authentification violation