Skip to content
This repository has been archived by the owner on Oct 4, 2019. It is now read-only.

RLPx Encryption

ia edited this page Mar 5, 2018 · 11 revisions

Establishing a connection

This section describes how a peer session is created.

It is assumed that the underlying transport layer has an already established connection between peers. A session is established in two phases. The first phase is peer authentication with an encryption handshake (or crypto handshake) and the second is the base protocol handshake.

The purpose of peer authentication is to establish a secure communication channel by setting up an encrypted, authenticated message stream. And the purpose of the encryption handshake is to exchange ephemeral keys to set initial values for this secure session.

The second phase is to negotiate supported protocols, checking versions and network Ids and described in the DEVp2p wire protocol specification.

From the point of view of one peer we will distinguish connections in two ways. If the connection was initiated by a peer, we say that they are the initiator, and the other peer is receiver. The word remote is used to describe the 'other' peer in a connection when talking from a point of view of a node.

The other distinction is whether the remote peer is known or new. A known peer is one which has previously been connected and for which a corresponding session token is remembered. Session tokens are assumed to be persisted across sessions.

The protocol is somewhat different depending on whether the peer is initiator or receiver on the one hand and whether the remote peer is known or new on the other.

Encryption handshake

Creating a secure connection consists of the following steps.

  1. Initiator sends an authentication message to receiver
  2. Receiver responds with an authentication response message and sets up a secure session
  3. Initiator checks receiver's response and establishes a secure session
  4. Receiver and Initiator then send base protocol handshake on the established secure channel.

Either side may disconnect if authentication fails or if the protocol handshake isn’t appropriate.

If the handshake fails, if and only if initiating a connection TO a known peer, then the nodes information should be removed from the node table and the connection MUST NOT be reattempted. Due to the limited IPv4 space and common ISP practices, this is likely a common and normal occurrence, therefore, no other action should occur. If a handshake fails for a connection which is received, no action pertaining to the node table should occur.

All packets following the encryption handshake, including protocol negotiation handshake in step 4, are framed, encrypted and authenticated using the setup negotiated during the encryption handshake.

If the handshakes succeed, the fixed array of protocols supported by both peers will run on the connection paralelly to send and receive messages. Once established, packets are encapsulated as frames which are encrypted using AES-256 in CTR mode. Initial values for the message authentication and cipher are never the same, key material for the session is derived via a KDF and ECDHE-derived shared-secret. ECC uses secp256k1 curve (ECP). It is the purpose of the encryption handshake to negotiate these key values for a new secure session.

Encryption handshake spec

Initiator handshake

Initiator sends the following handshake.

initiator-handshake := ECIES.Encrypt(remote-pubkey, auth)

The initiator handshake is the authentication message auth encrypted with the remote peer's known public key using ECIES (Elliptic Curve Integrated Encryption System). Using AES256 in CTR mode with full MAC, the ECIES encrypted payload will have the following structure:

Offset Name Description
  0 | `ecies-pubkey`   |  65 byte representation of the pubkey needed for symmetric cipher
 65 | `AES-initial-vector` | 16 byte initial block for AES
 81 | `cipher-text`         | ECIES ciphertext size identical to plain text (uses AES-256 CTR, blocksize 16)
 275 | `ecies-mac` | ECIES message authentication code (256 bit)
 307 | **Total**

This authentication message serves to prove the following:

  • the owner of the private key wants a connection with the node controlling a particular public key
  • that they want the connection now based on common history
  • ...

and has the following structure:

Offset Name Description
  0 | `signature`   | `Sign(ecdhe-random-key, shared-secret ^ init_nonce)`
 65 | `ephemeral-key-mac`         | `SHA3(initiator-ecdhe-random-pubkey)` (256 bit)
 97 | `permanent-public-key` | NodeId public key (512 bit)
161 | `initiator-nonce` | a random key generated by the initiator for this session
193 | `known-peer` | flag indicating if a token was used as shared secret, 0x00: no, 0x01: yes 
194 | **Total

The signature part is there to check if both parties agree on the same shared secret. Initiator signs the shared secret with an ephemeral key using standard ECDSA with P256 Curve. The signature length is therefore exactly 65 bytes.

signature := Sign(ecdhe-random-key, shared-secret ^ init_nonce) 

The shared secret is xor-ed with a random nonce to make signatures non-reusable. Since the nonce is in the decrypted auth message and the receiver owns and agrees on a shared secret with initiator, the message text can be reconstructed by the receiver. Therefore the receiver is able to verify the signature and recover initiator's ephemeral public key using EC Recover.

What the shared secret actually is depends on whether the remote peer is a known peer or not. If a remote's public key is associated with a last session token, then that token is used as shared secret. If the peer is new, the initial shared secret is generated by ECDH based on the nodes' public keys.

shared-secret = session_token // for known peer
shared-secret = SSK(initiator-privkey, receiver-pubkey) // for new peers

Where SSK(initiator-privkey, receiver-pubkey) is a symmetric shared secret key as given by ECDH.

Whether a token was used is indicated in the last byte (known-peer) of the handshake.

ephemeral-key-mac is to authenticate the signature and is a Sha3 hash of initiator's ephemeral public key.

permanent-public-key is used to facilitate lookup of a token, if IP address and port is not reliable.

initiator-nonce is a 32 byte random nonce, used to derive session key material.

known-peer is a byte indicating if a previous session token is used (0x00 if not, 0x01 if yes), so basically signal whether the initiator recognises the receiver as a known peer.

Receiver handshake

Receiver receives the initiator handshake and decrypts it with its own public key. If the initiator indicated the use of a session token (last byte is set to 0x01) and the receiver finds it (under persistent-public-key or otherwise), then it can recover initiator's ephemeral public key with ECrecover by supplying the plain text which is shared_secret^initiator-nonce:

initiator-ecdhe-random-pubkey = ECRECOVER(signature, shared_secret^initiator-nonce)

once this is done, the recipient has all information in order to set up the secure session. See below.

The receiver handshake is sent encrypted with the initiator's public key using ECIES:

receiver-handshake = ECIES.Encrypt(initiator-pubkey, receiver-handshake)
Offset Name Description
  0 | `ecies-pubkey`   |  65 byte representation of the pubkey needed for symmetric cipher
 65 | `AES-initial-vector` | 16 byte initial block for AES
 81 | `cipher-text`         | ECIES ciphertext size identical to plain text (uses AES-256 CTR, blocksize 16)
 178 | `ecies-mac` | ECIES message authentication code (256 bit)
 210 | **Total**

where receiver-handshake has the following structure:

Offset Name Description
0 receiver-ecdhe-random-pubkey newly generated ephemeral key to serve as basis for session-secret
64 receiver-nonce a randomly generated nonce
96 receiver-known-peer if receiver has found session token for initiator (0x00) or not (0x01)
97 Total

It is somewhat unclear what the expected behaviour is if initiator submits an auth with session token, but receiver does not remember it and responds with 0x00. Should the connection be terminated or is there a fallback to shared secret?

Originator needs to inspect the receiver handshake response to recover receiver-ecdhe-random-pubkey and receiver-nonce. And this completes the key exchange.

Creating a new secure session

Once the encryption handshake's been completed, both parties can calculate the same initial values for encryption and authentication.

First we need the remote ephemeral key to use it with ECDH. From the ecdhe key we derive a shared secret.

ecdhe-shared-secret = ecdh.agree(ecdhe-random-key, remote-ecdhe-random-pubkey)
shared-secret = sha3(ecdhe-shared-secret || sha3(receiver-nonce || initiator-nonce))

The session token is updated to be the sha3 hash of the new shared secret and need to be remembered.

token = sha3(shared-secret)

It is unclear at exactly what point we should update this token, ie., shall we somehow wait for the baseprotocol handshake to complete as well?

The other parameters are only relevant for this session:

aes-secret = sha3(ecdhe-shared-secret || shared-secret)
mac-secret = sha3(ecdhe-shared-secret || aes-secret)
initiator-mac = sha3(mac-secret^receiver-nonce || init-auth)
receiver-mac = sha3(mac-secret^initiator-nonce || init-auth)

Note that if any of these calculations do not match for the peers, frame authentication will fail immediately, so the connection will be terminated.

Encryption and authentication

The connection is secured by encryption and authentication. Encryption and authentication is done frame by frame separately for headers and payload.

Authentication is using Keyed-Hash Message Authentication Code (HMAC FIPS198) using SHA256. An HMAC is set up for incoming (ingress) and outgoing (egress) traffic.

Egress MAC is updated with the plaintext of the outgoing frame payload and the checksum is sent appended to the frame (in cleartext).

Ingress MAC is updated with the plaintext of the decrypted frame payload and the checksum is verified against the MAC received appended to the frame. If they do not match, the datastream has been tampered with, the connection must be terminated. If they match, it proves the authenticity of the frame since an attacker would need the entire history of this session as well as the ephemeral keys exchanged during the handshake even.

For the MACs to match, we need to ensure that initiator's egress MAC is identical to receiver's ingress MAC, and initiator's ingress MAC is identical to receiver's egress MAC. So given the key definitions at the end of the handshake, initiator initialises its egress-mac with initiator-mac and its ingress-mac with receiver-mac, whereas receiver initialises its egress-mac with receiver-mac and its ingress-mac with initiator-mac.

Encryption uses the AES-256 block cipher in CTR mode using aes-secret as key and initial vector ?