feat: stun protocol & stun connection #9

lchenut · 2024-03-08T13:10:49Z

Presentation

This PR is a part of the stack to create the nim-libp2p webrtc-direct transport (defined here: https://github.com/libp2p/specs/blob/master/webrtc/webrtc-direct.md)
We implement a small version of the STUN protocol following https://datatracker.ietf.org/doc/html/rfc5389
As the RFC tells us: STUN serves as a tool for other protocols in dealing with NAT traversal, thus we also implement a small version of the ICE Lite protocol following https://datatracker.ietf.org/doc/html/rfc8445.

The STUN protocol reads and writes from the UdpConn. All the received STUN messages are treated as such and answered directly, all the other received messages are queued to be read by the underlying DTLS protocol.

Stun protocol

As said above, this STUN implementation isn't complete. At the moment of writing this PR, we are only interested in implementing the server part of the webrtc-direct transport. We will reconsider and improve the implementation when vacp2p/nim-libp2p#409 is addressed or if the webrtc-direct spec changes, for example. But for now on, there are some part missing.

Non-exhaustive list of what is missing:

REALM and NONCE attributes (no authentification/security process described in the webrtc-direct spec)
Retransmissions (client side, we only implement the server)
Shared secret Request/Response/Error (again nothing described in the spec)

ICE Lite protocol

For the same reasons as to why the STUN protocol isn't fully implemented (and because it's in the webrtc-direct spec), we use the Lite implementation of ICE. And again, only the server (ICE-CONTROLLED) side. As the server in the webrtc-direct transport must be publicly available, the Lite version should be sufficient.

webrtc/stun/stun.nim

.github/workflows/ci.yml

tests/teststun.nim

webrtc/stun/stun_utils.nim

tests/teststun.nim

webrtc/stun/stun_transport.nim

tests/teststun.nim

webrtc/stun/stun_connection.nim

tests/teststun.nim

webrtc.nimble

webrtc/stun/stun_connection.nim

diegomrsantos · 2024-05-23T12:36:06Z

webrtc/stun/stun_connection.nim

+    laddr*: TransportAddress # Local address
+    raddr*: TransportAddress # Remote address
+    dataRecv*: AsyncQueue[seq[byte]] # data received which will be read by DTLS
+    stunMsgs*: AsyncQueue[seq[byte]] # stun messages received and to be


is this queue unbounded?

Yes, those two queues are unbounded.

should they be?

There's nothing in the RFC saying anything about this. And I'm not confident enough to say There should be a limit and this limit is this number. So I leave things as they are because of my uncertainty.

I'm not sure this is something always mentioned on specs, but I believe there should always be a limit to avoid memory DoS attacks. See more in https://github.com/libp2p/rust-libp2p/blob/master/docs/coding-guidelines.md#bound-everything.

webrtc/stun/stun_connection.nim

diegomrsantos · 2024-05-23T12:43:33Z

webrtc/stun/stun_connection.nim

+    try:
+      let decoded = StunMessage.decode(await self.stunMsgs.popFirst())
+      if not decoded.isFingerprintValid():
+        # Fingerprint is invalid, the StunMessage received might be a false positive.


What does "might be a false positive" mean? Not a Stun message? Why is it moved to the dataRecv queue?

Basically, there's a first check with the raw data (without decoding) where we check different things (the size of the message, the presence of the magic stun cookie etc...). When the incoming message is sorted, we can decode it. If something is wrong with the Fingerprint, it can means that it is, in fact, not a Stun Message but a message for another protocol. And the RFC specifies this by saying :

The FINGERPRINT attribute can aid in distinguishing STUN packets from packets of other protocols. See [Section 7](https://datatracker.ietf.org/doc/html/rfc8489#section-7).```

Can you briefly describe it in the comment and add the link? Why is it moved to the dataRecv queue?

Why is it moved to the dataRecv queue?

Because StunConn uses two queues for received messages:

one for Stun Messages stunMsgs which are decoded/answered/etc... in stunMessageHandler

one for the others protocols dataRecv (in the case of the WebRTC stack, it's DTLS messages), which are popped from the queue when read is called.

And, if the Fingerprint is wrong, it could be a false negative, which mean, it's not a Stun Message, but probably a DTLS message, thus it should be in dataRecv and not in stunMsgs

diegomrsantos · 2024-05-23T12:48:42Z

webrtc/stun/stun_connection.nim

+    except WebRtcError as exc:
+      trace "Failed to write the Stun response", error=exc.msg
+
+proc init*(


It should be new as StunConn is a ref.

diegomrsantos · 2024-05-23T12:54:34Z

webrtc/stun/stun_connection.nim

+  ##
+  await self.closeEvent.wait()
+
+proc close*(self: StunConn) =


Should we close the underlying UDP conn?

Udp conn is a really bad name, it should be Udp Transport, I might change this in another PR.
But no, UdpConn is closed only when we close the Stun transport

diegomrsantos · 2024-05-23T13:20:24Z

webrtc/stun/stun_connection.nim

+  if self.closed:
+    debug "Try to close an already closed StunConn"
+    return
+  self.closed = true


should it be the last line?

Or maybe having a closing and closed, but not sure.

I changed it, but as the proc is synchronous, it doesn't change anything

diegomrsantos · 2024-05-23T19:51:08Z

webrtc/stun/stun_utils.nim

+import sequtils, typetraits, std/sha1
+import bearssl
+
+proc generateRandomSeq*(rng: ref HmacDrbgContext, size: int): seq[byte] =


it is supposed to return a seq, but I believe it doesn't return anything, potentially nil.

What do you mean? I initialize result. It definitely returns a seq.

I find it confusing. There are 3 different ways of returning in Nim and it's used arbitrarily.

diegomrsantos · 2024-05-23T20:51:28Z

webrtc/stun/stun_utils.nim

+        rem = (rem shr 1) xor 0xedb88320'u32
+      else:
+        rem = rem shr 1
+    result[i] = rem


I'd recommend to avoid result and use explicit returns unless you have a very strong preference for that.

Hum.... I like result actually, it's a neat tool

I find it unusual and harder to reason. Another reason is that it is used arbitrarily in the procs. I believe it is better to follow the same pattern.

diegomrsantos

Looks great! Amazing job. Thanks for addressing the comments and delivering this.

feat: stun protocol & stun connection

a53c04f

lchenut mentioned this pull request Mar 8, 2024

feat: dtls connection using mbedtls #10

Open

lchenut added 8 commits March 15, 2024 13:43

rename getResponse into getPong and test it

6778672

add Username attribute

7be3d0a

genUfrag procedure

f4ca113

Add generateRandomSeq to generate a transaction id

81a2a51

First draft of getPing

52296d1

Merge remote-tracking branch 'origin/master' into stun-protocol

2b70ad7

Use UdpPacketInfo tuple

50fde87

Change closing debug message

82203f2

diegomrsantos reviewed Apr 2, 2024

View reviewed changes

webrtc/stun/stun.nim Outdated Show resolved Hide resolved

lchenut added 6 commits April 2, 2024 17:02

Add proper exception tracking

fa5d674

Change StunConn init behavior

3d070b5

Add a last UdpPacketInfo

762acd8

Add comments

5ce6796

refactor: change connection management

5e9a335

Add a lot of comments/Finish refactor

832a343

diegomrsantos reviewed Apr 15, 2024

View reviewed changes

.github/workflows/ci.yml Outdated Show resolved Hide resolved

diegomrsantos reviewed Apr 15, 2024

View reviewed changes

tests/teststun.nim Show resolved Hide resolved

diegomrsantos reviewed Apr 15, 2024

View reviewed changes

tests/teststun.nim Show resolved Hide resolved

diegomrsantos reviewed Apr 15, 2024

View reviewed changes

webrtc/stun/stun_utils.nim Outdated Show resolved Hide resolved

diegomrsantos reviewed Apr 15, 2024

View reviewed changes

tests/teststun.nim Outdated Show resolved Hide resolved

lchenut added 6 commits April 15, 2024 13:25

Add copyright headers on test files

18e302b

simplify newRng proc for testing

7eb2940

add exception tracking for stun transport asynchronous proc

f075c40

remove ping/pong example building in the ci

5100b40

rename getPong test

5c3afe1

remove maximum connections

4660ac0

diegomrsantos reviewed Apr 17, 2024

View reviewed changes

tests/teststun.nim Outdated Show resolved Hide resolved

diegomrsantos reviewed Apr 17, 2024

View reviewed changes

webrtc/stun/stun_transport.nim Outdated Show resolved Hide resolved

chore: make teststun more readable

d0c4013

diegomrsantos reviewed Apr 30, 2024

View reviewed changes

webrtc/stun/stun_transport.nim Outdated Show resolved Hide resolved

lchenut added 4 commits April 30, 2024 17:28

feat: use withValue instead of getOrDefault in Stun.connect()

559c857

feat: add check if Fingerprint is valid

735cde8

refactor: getAttribute and username/password provider

11b4d42

chore: removes genUfrag, should be in libp2p instead

3a5b206