mozilla / neqo Goto Github PK

View Code? Open in Web Editor NEW

1.8K 38.0 115.0 7.27 MB

Neqo, an implementation of QUIC in Rust

Home Page: https://firefox-source-docs.mozilla.org/networking/http/http3.html

License: Apache License 2.0

C 0.13% Rust 99.17% Dockerfile 0.07% Shell 0.33% Python 0.29%

firefox ietf mozilla quic rust http3

neqo's Introduction

Neqo, an Implementation of QUIC in Rust

To build Neqo:

cargo build

This will use a system-installed NSS library if it is new enough. (See "Build with Separate NSS/NSPR" below if NSS is not installed or it is deemed too old.)

To run test HTTP/3 programs (neqo-client and neqo-server):

./target/debug/neqo-server '[::]:12345'
./target/debug/neqo-client 'https://[::]:12345/'

Build with separate NSS/NSPR

You can clone NSS and NSPR into the same directory and export an environment variable called NSS_DIR pointing to NSS. This causes the build to use the existing NSS checkout. However, in order to run anything that depends on NSS, you need to set an environment as follows:

Linux

export LD_LIBRARY_PATH="$(dirname "$(find . -name libssl3.so -print | head -1)")"

macOS

export DYLD_LIBRARY_PATH="$(dirname "$(find . -name libssl3.dylib -print | head -1)")"

Note: If you did not already compile NSS separately, you need to have Mercurial (hg), installed. NSS builds require GYP and Ninja to be installed.

Debugging Neqo

QUIC logging

Enable generation of QLOG logs with:

target/debug/neqo-server '[::]:12345' --qlog-dir .
target/debug/neqo-client 'https://[::]:12345/' --qlog-dir .

You can of course specify a different directory for the QLOG files. You can upload QLOG files to qvis to visualize the flows.

Using `SSLKEYLOGFILE` to decrypt Wireshark logs

You can export TLS keys by setting the SSLKEYLOGFILE environment variable to a filename to instruct NSS to dump keys in the standard format to enable decryption by Wireshark and other tools.

Using RUST_LOG effectively

As documented in the env_logger documentation, the RUST_LOG environment variable can be used to selectively enable log messages from Rust code. This works for Neqo's command line tools, as well as for when Neqo is incorporated into Gecko, although Gecko needs to be built in debug mode.

Some examples:

```
RUST_LOG=neqo_transport::dump ./mach run
```
lists sent and received QUIC packets and their frames' contents only.
```
RUST_LOG=neqo_transport=debug,neqo_http3=trace,info ./mach run
```
sets a debug log level for transport, trace level for http3, and info log level for all other Rust crates, both Neqo and others used by Gecko.
```
RUST_LOG=neqo=trace,error ./mach run
```
sets trace level for all modules starting with neqo, and sets error as minimum log level for other unrelated Rust log messages.

Trying in-development Neqo code in Gecko

In a checked-out copy of Gecko source, set [patches.*] values for the four Neqo crates to local versions in the root Cargo.toml. For example, if Neqo was checked out to /home/alice/git/neqo, add the following lines to the root Cargo.toml.

[patch."https://github.com/mozilla/neqo"]
neqo-http3 = { path = "/home/alice/git/neqo/neqo-http3" }
neqo-transport = { path = "/home/alice/git/neqo/neqo-transport" }
neqo-common = { path = "/home/alice/git/neqo/neqo-common" }
neqo-qpack = { path = "/home/alice/git/neqo/neqo-qpack" }
neqo-crypto = { path = "/home/alice/git/neqo/neqo-crypto" }

Then run the following:

./mach vendor rust

Compile Gecko as usual with

./mach build

Note: Using newer Neqo code with Gecko may also require changes (likely to neqo_glue) if something has changed.

neqo's People

Stargazers

Watchers

Forkers

mozilla-github-standards beurdouche quininer agrover isgasho ralph-mcteggart o0ignition0o stephendonner alex nanne007 kfabryczny neuroradiology harshulmca17 ekr 0x00a5 ddragana anshulmalik emilio manishearth cambricorp glandium mirefly nezdolik tomsnunes ayoalfonso undef1nd surender786 vonasek mahakbansal2019 martinthomson hilalisadev abbcdfin hanaasagi mdlglobal-atlassian-net fimbault peskyp juniorhsu global-localhost global19-atlassian-net acidburn0zzz chaosstudygroup kevinmiles sshyran kershawchang age-rs sinhasantos hawkinsw 16yuki0702 10allday-software cszwkyle makotokato clarkguan pdx-trader cnguoyj sme-ek awfeequdng jaydenelliott mb hugefiver 12101111 xirdigh hixio-mh nhnt11 redstrike heftig strogo metavai ajunlonglive noxazer-fr grubba27 standardgalactic caglaryucekaya dlrobertson mikeling swift-quic jimblandy jschwartzentruber mmmarcos lingchar xerg-sid dmgolembiowski highpon mayyasunil qiaoqiao321 slavapeshkin suryatmodulus marcblanchet forestofrain edgul tosunkaya binadamu-isiyoonekana matcha1024 jesup pyjcode frisoft lpardue acreskeymoz belhadjd caffeelake salaoiuid

neqo's Issues

Use constants in loss recovery tests

There are lots of constants here, like Duration::from_micros(94_609). The test would be far more readable with constants. Separating this into T1, T2, or STEP1, STEP2 and so forth would help a lot.

Originally posted by @martinthomson in #39

Coalesce 0-RTT with first client Initial

This would allow us to avoid wasting bytes on padding.

This is very tricky though. We don't currently determine what our 0-RTT parameters are until after we've generated - and constructed a packet - for the ClientHello. In order to do this, I think that we would need to have the set_resumption_token() function also have TLS generate the first CRYPTO frames. Then we'd have access to all of the 0-RTT state before calling generating any packets with process(). I think that this all just "works", but we'll need to be careful not to generate the ClientHello twice.

accessor for latest_rtt

#39 (comment)

Key update

Server class

We probably need a server class at the transport layer that manages routing of packets to different connections, based on connection ID.

I don't understand why this isn't a bug: detect_lost_packets()

        // Packets with packet numbers before this are deemed lost.
        let lost_pn = self
            .space_mut(pn_space)
            .largest_acked
            .saturating_sub(PACKET_THRESHOLD);

        qdebug!(
            [self]
            "detect lost packets - time={}, pn={}",
            lost_send_time,
            lost_pn
        );

        let packet_space = self.space_mut(pn_space);

        let mut lost = Vec::new();
        for (pn, packet) in &packet_space.sent_packets {
            // Mark packet as lost, or set time when it should be marked.
            if *pn <= packet_space.largest_acked {
                if packet.time_sent <= lost_send_time || *pn <= lost_pn {
                    qdebug!("lost={}", pn);
                    lost.push(*pn);

I just need to think about if using 1, 2, or 3 as an example value for .largest_acked, all do the correct thing. I have a hunch there's a bug if largest_acked saturates to 0 and then pn is small.

Resumption

http3 client/server don't like too-big data.

I think the server might need to be a little smarter to deal with when the connection's send buffer (64KiB) is full.

Send blocked signal when qpack is blocked

In addition to the interface in #31, we should add a means of forcing the transport to send an appropriate blocked frame.

http3: server-initiated bidi stream should be connection error

I think I just made it just do stop sending, but no it should be a connection error.

Http3Connection::new() should not take a Connection as param

Maybe just make one internally?

client: work with privateoctopus server

This currently fails:

RUST_LOG=trace ./target/debug/neqo-http3-client http://test.privateoctopus.com:4433/ --db ./neqo-crypto/db

but client is still waiting for something instead of exiting. The ConnectionClose should be exposed in some (both?) event APIs so the client code can see it and do the right thing?

Add interface to query stream flow control

In h3, it's possible that the qpack encoder stream could get blocked. When that happens, we will want to send literals rather than block. If we block on the encoder stream, we also have to block the request stream. In order to test this, we need a way to ask the stream how much flow control credit it has.

This should also check the connection-level flow control.

Refactor generators

Right now, generators are just a little dynamic -- there are three up until the connection enters Closing state, and then the three get replaced by the one CloseGenerator.

This could be improved by being a little more dynamic and fine-grained. For example, right now StreamGenerator iterates through streams and each stream presents available data in lowest to highest offset so retransmit ranges are first, but between streams, new data for one stream could go out before retransmitted bytes for another, which is unfortunate.

This just all needs a rethink and refactor.

Current Qpack is a very simple one

the qpack algorithm needs optimization.

Filter incoming frame types by epoch

We shouldn't accept STREAM in Initial packets. Just as an example.

Test reception and handling of flow control "blocked" messages

@martinthomson says: It would be good to have a test that verified that we were getting "blocked" messages appropriately.

Convert now() to Instant::now()

Save HTTP state with session tickets

In #45 I added transport parameters to the "resumption token" that the client hands back to applications when resuming so that the transport can know what its bounds are for 0-RTT. However, HTTP doesn't do that.

quicwg/base-drafts#2790 points out that SETTINGS and session tickets don't always arrive in the right order for this to happen. That probably means withholding resumption tokens until both arrive. That implies that there might need to be another state involved or some sort of notification arrangement so that applications know when a token is available. Right now, the crypto and transport pieces don't have any way to signal the availability of a resumption token, but that might need to be added.

Firefox integration (placeholder)

dragana: I will try to do this, by Monday, but the time it too short.

Use lazy_static instead of OnceResult

See #45 review comments.

btw lazy_static is already used in gecko so we should not hesitate to use it in Neqo if it's useful.

Do the massive Huffman decode tables help?

huffman_decode_helper.rs is pretty big. In theory, this is to help make the decoding process more efficient, but we don't have any evidence that a simpler design is significantly less efficient. These files are impossible to review, so they really have to justify their existence pretty well.

merge h3/.9 client and server versions (client)

0-RTT

panic in set_loss_detection_timer

RUST_BACKTRACE=1 ./target/debug/neqo-client http://127.0.0.1:4433/6600000 --db ./neqo-crypto/db causes:

thread 'main' panicked at 'attempt to multiply with overflow', /builddir/build/BUILD/rustc-1.34.2-src/src/libcore/num/mod.rs:3516:24
stack backtrace:
   0: std::sys::unix::backtrace::tracing::imp::unwind_backtrace
   1: std::sys_common::backtrace::_print
   2: std::panicking::default_hook::{{closure}}
   3: std::panicking::default_hook
   4: std::panicking::rust_panic_with_hook
   5: std::panicking::continue_panic_fmt
   6: rust_begin_unwind
   7: core::panicking::panic_fmt
   8: core::panicking::panic
   9: core::num::<impl u64>::pow
             at /builddir/build/BUILD/rustc-1.34.2-src/src/libcore/num/mod.rs:3516
  10: neqo_transport::connection::LossRecovery::set_loss_detection_timer
             at neqo-transport/src/connection.rs:2787
  11: neqo_transport::connection::LossRecovery::on_packet_sent
             at neqo-transport/src/connection.rs:2633
  12: neqo_transport::connection::Connection::output_path
             at neqo-transport/src/connection.rs:1276
  13: neqo_transport::connection::Connection::output
             at neqo-transport/src/connection.rs:1160
  14: neqo_transport::connection::Connection::process_output
             at neqo-transport/src/connection.rs:934
  15: neqo_http3::connection::Http3Connection::process_output
             at neqo-http3/src/connection.rs:347
  16: neqo_client::process_loop
             at neqo-client/src/main.rs:152
  17: neqo_client::client
             at neqo-client/src/main.rs:261
  18: neqo_client::main
             at neqo-client/src/main.rs:291
  19: std::rt::lang_start::{{closure}}
             at /builddir/build/BUILD/rustc-1.34.2-src/src/libstd/rt.rs:64
  20: std::panicking::try::do_call
  21: __rust_maybe_catch_panic
  22: std::rt::lang_start_internal
  23: std::rt::lang_start
             at /builddir/build/BUILD/rustc-1.34.2-src/src/libstd/rt.rs:64
  24: main
  25: __libc_start_main
  26: _start

with PTO 807073 and pto_count 134 immediately prior.

transport: handle connection close better

We're currently transitioning state and sending an event but we should also probably be cleaning up streams, and maybe sending events that they have been closed/reset as well? Refer to the spec for desired behavior.

Need a programmatic way to terminate neqo-http3-server

For testing purposes, it would be convenient if there was an automatable way to shut down the test server. @ddragana maybe could can say what you think the most useful way might be? we can't send a SIGINT via kill() because... it's not cross platform, is that why?

Clarify loss delay calculation steps

#39 (comment)

It's looking a little LISPy, so maybe make the calculation steps a little more explicit.

Generate stateless reset

This probably needs a separate class: as a server you want to hold this globally (for all connections), but for a client you want to have this per-connection.

Block sending when there is no flow control credit

If we don't prevent buffering, we could create a deadlock. By blocking writes, applications (see h3) can abort writes when a stream is blocked, which is important if there is an inter-stream dependency.

Right now, all writes are buffered up to TX_STREAM_BUFFER, which means that deadlocks are quite possible.

Retry -- client role

Send first Initial multiple times

We appear to only send an Initial from the client once. We should probably try a few times.

deploy neqo-server on internet (either on AWS or one of our personal servers, even.)

Timers that end up being set in the past

The recovery spec (A.8) does say something about "This algorithm may result in the timer being set in the past, particularly if timers wake up late. Timers set in the past SHOULD fire immediately."

Should we just be ensuring the caller re-calls us, or should we go ahead and assert this never happens?

Originally posted by @agrover in #39

Well, for our API, we can't pass out a negative Duration for the delay time, so if loss recovery sets a timer in the past, we will probably clamp to 0 (I hope we don't underflow). In that case, I think that it is our responsibility to drive the state forward before we return.

congestion control

possible bug in get_earliest_loss_time

Both branches do the same thing.

            if loss_time == 0 {
                loss_time = packet_space.loss_time;
                pn_space = *space;
            } else if packet_space.loss_time != 0 && packet_space.loss_time < loss_time {
                loss_time = packet_space.loss_time;
                pn_space = *space;
            }

@ddragana what do you think?

Simplify type in http3::Connection

#39 (comment)

Retry -- Server

like #15 but for server role.

Detect stateless reset

Rewrite to avoid usage of SliceDeque crate

transport send stream uses SliceDeque to get a single slice of bytes to send, even though the buffer is circular and therefore can sometimes require two slices if the buffer wraps. (It does this with virtual memory magic.)

This is cool but probably not worth the added dependency. We should rewrite to use a regulat VecDeque. Either the users of next_bytes must handle two slices, or maybe we're ok with next_bytes not returning the entire range of available bytes if it wraps (could lead to two smaller stream frames instead of one, perhaps?)

Put this on Phabricator

This depends on #1 being resolved/

Process (thanks to glob): file a bug here with the information listed here.

Decide on final name for this code

Yes it's a bikeshed, but it seems like some things are gated on resolving this. Here's some possibilities I'll throw out there:

Neqo (current working name)
Quico
Minqo
Sable (a Mink-like creature)
Polecat (same as above)
Quickness (QUIC + NSS)
Morq (Mozilla-Originated Rust-based Quic)
Marq (Mozilla-Authored Rust-based Quic)
Jiffy

Un-Hack NSS version

#45 needed some NSS changes and these were taking a long time to make it through review, so @martinthomson put in a temp hack. This should be undone once the needed NSS changes are approved, but I don't want to hold up #45 any longer.

Fix Clippy issues in neqo-crypto

Disabled running clippy on some files in neqo-crypto, see lib.rs. It would be great to re-enable it, but I don't have enough knowledge of the code to determine if Clippy is finding things that should be fixed or not.

refactor Connection

Break out self-contained things into their own files
- streamId/StreamIdx structs
- FlowMgr
- Events
- Generators
get StreamIndexes out of struct Connection

@martinthomson any more?

Http3 client API vs server API

We currently have some APIs in the Http3Connection code that are exclusively for the client, and also others that are just server. Although they both share a great deal of common code, I'd like to raise the question as to whether at an API level these should be distinct -- a Http3Client class and an Http3Server class.

CODE_OF_CONDUCT.md file missing

As of January 1 2019, Mozilla requires that all GitHub projects include this CODE_OF_CONDUCT.md file in the project root. The file has two parts:

Required Text - All text under the headings Community Participation Guidelines and How to Report, are required, and should not be altered.
Optional Text - The Project Specific Etiquette heading provides a space to speak more specifically about ways people can work effectively and inclusively together. Some examples of those can be found on the Firefox Debugger project, and Common Voice. (The optional part is commented out in the raw template file, and will not be visible until you modify and uncomment that part.)

If you have any questions about this file, or Code of Conduct policies and procedures, please see Mozilla-GitHub-Standards or email [email protected].

(Message COC001)

Start HTTP over again if 0-RTT is rejected

Right now, the transport has some provisions for 0-RTT failing, but HTTP does not. It needs to throw everything out and start over. If we consider HTTP to have exclusive use of the transport, then it doesn't need to worry too much about getting an incompatible ALPN value, but we might need to worry about which of the different HTTP ALPN values are chosen.

Establish common time base for tests using Once

#39 (comment)

This changes the tests significantly. It exposes the tests to timing variance that the fixed value didn't. If we can fix the value here, that will help a lot.

I like that this now uses now() instead of 0 in a lot of places, but I think that we want a constant function. In doing the 0-RTT code, I need a lot more control over the way that we manage timers and tying this to the system clock is unworkable.

I know that it's hard to get a concrete Instant instance, but maybe we can use Once for picking that point in time.

Implement idle timeout

see -transport 10.2.

mozilla / neqo Goto Github PK

neqo's Introduction

Neqo, an Implementation of QUIC in Rust

Build with separate NSS/NSPR

Linux

macOS

Debugging Neqo

QUIC logging

Using SSLKEYLOGFILE to decrypt Wireshark logs

Using RUST_LOG effectively

Trying in-development Neqo code in Gecko

neqo's People

Stargazers

Watchers

Forkers

neqo's Issues

Recommend Projects

Recommend Topics

Recommend Org

Using `SSLKEYLOGFILE` to decrypt Wireshark logs