Traffic filtering #2030

jamilbk · 2023-08-24T17:38:54Z

connlib: control protocol additions
connlib: gateway packet filter implementation
firezone/product#654

jamilbk · 2024-04-22T16:03:32Z

Time to make this happen :-)

Todo:

portal:
- part of Team plan
- Allow editing existing Resources' traffic filters on Starter (to handle downgrade edge cases)
- On starter, when adding resource, show Traffic filters but disabled, with "upgrade to unlock" (most users won't read announcements or read feature matrix)
gateway: implement traffic filters received from Portal
docs: add traffic filter docs section either in Resources or separately
announce in newsletter/socials
website: add to pricing matrix

jamilbk · 2024-04-22T17:42:30Z

Edge case with DNS Resources:

If DNS resources map to the same IP and you want to allow access to port 80 but not port 8080 for a User, this runs into an issue where the user would be allowed both.

AndrewDryga · 2024-04-22T17:43:33Z

We should explain that traffic filters work on IP level, so they apply after DNS name is resolved and then merged per dest up.

Resource A: a.mycorp.com:80
Resource B: b.mycorp.com:123

On Gateway:
RSLV a.mycorp.com -> 100.100.100.100
RSLV b.mycorp.com -> 100.100.100.100
Mapping 100.128.0.1 -> 100.100.100.100

On client:
You can access a.mycorp.com on both ports 80 and 123,
and you can access b.mycorp.com on both ports 80 and 123,

jamilbk · 2024-04-22T19:20:26Z

Possible solution? (involves lots of refactoring though):

github.com:80 -> 142.0.0.1
gitlab.com:443 -> 142.0.0.1

client queries github.com
receives 100.96.0.1
application sends traffic from 100.100.0.1:40000 to 100.96.0.1:80
client mangles packet 100.100.0.1:40000 -> 142.0.0.1:80: check that 80 is within 80, 443
resource responds 142.0.0.1:80 -> 100.100.0.1:40000
client mangles back 100.96.0.1:80 and replies to application
client queries gitlab.com
receives 100.96.0.2
application sends traffic from 100.100.0.1:40000 to 100.96.0.2:80
client mangles packet to 100.100.0.1:41000 -> 142.0.0.1:80
resource responds 142.0.0.1:80 -> 100.100.0.1:41000
client mangles back to 100.96.0.1:80 -> 100.100.0.1:40000

Gateway:

Resource: {
ports: [int]
}

Peer: {
CGNAT IP -> ResourceId1 {
mapped_src_ports: hash(): {orig_src_port, dst_ip} -> {dst_port, src_ip}
}

CGNAT IP -> ResourceId1 {
Name:
ruleset: (could be from multiple Resources -- list of ports to allow)
}
}

conectado · 2024-04-22T22:56:26Z

Current approach to DNS resources

Client makes DNS query for a resource.
DNS resolution is requested to the gateway along with access
Assuming that the portal allows it, gateway responds with the resolved DNS addresses and installs a permission for the resolved IPs for the given client
The client creates or re-uses a translation for the resolved IP(s) to CGNAT space, keeping the invariant that the "real" IP for a given gateway peer always maps to the same CGNAT IP, regardless of whether or not it's the same resource
The client then whenever sees an outgoing packet for the CGNAT IP translates it to the real IP
Packets incoming from the real IP are translated to CGNAT space
The gateway only accepts packets outgoing to the real IP

Drawback(regarding to traffic filtering)

The problem is that when 2 different DNS resources resolve to the same real IP, we can't distinguish traffic between them, so the Gateway must apply the rules only based on IPs.

New approach(proposed by @jamilbk )

Summary

The idea is to have the gateway do the packet mangling, the client would only ever see the cgnat ips.

Then, on the gateway, the translation is made between CGNAT and real IPs, however the translation would be made but not only ip would be translated but also, a new sport would be picked.

Then incoming traffic can be uniquely distinguished by the tuple (dport, daddr, saddr), where daddr is used to see what client peer the traffic is meant for and the tupple (dport, saddr) is used to restore the original (dport, saddr), which was determined originally with out-going traffic

Drawbacks

The problem still is that an attacker, can use the daddr of the other resource to have access of the dport for that resource.

And this requires a big refactor and a lot more state tracking on the gateway.

In general it seems impossible to apply rules meant for layer 5 to the traffic we see in layer 3

AndrewDryga · 2024-04-22T23:54:10Z

Would this solution work for two DNs resources with the same port filters for both of them? And what if port filter rule for those is more than half of the port range or partially overlapping large port ranges?

For example, active FTP port range is 20, 60000-65535.

conectado · 2024-04-23T00:36:38Z

Would this solution work for two DNs resources with the same port filters for both of them? And what if port filter rule for those is more than half of the port range or partially overlapping large port ranges?

For example, active FTP port range is 20, 60000-65535.

The approach is independent of the port filtering itself, is just a way to be able to distinguish between resources in the gateway.

AndrewDryga · 2024-04-23T00:38:30Z

But how do you distinguish when two port ranges overlap for the same IP but different resources?

conectado · 2024-04-23T01:03:21Z

But how do you distinguish when two port ranges overlap for the same IP but different resources?

If they are different resources the daddr would be different, so we apply rules only meant for that ip.

Or you mean for incoming traffic? I think we do only egress filtering

conectado · 2024-04-23T01:04:06Z

But how do you distinguish when two port ranges overlap for the same IP but different resources?

Also, I think we're not going to go with this approach for the reasons explained here:

Drawbacks

The problem still is that an attacker, can use the daddr of the other resource to have access of the dport for that resource.

And this requires a big refactor and a lot more state tracking on the gateway.

In general it seems impossible to apply rules meant for layer 5 to the traffic we see in layer 3

conectado · 2024-04-23T22:51:24Z

Seems like the portal is currently not sending ICMP filtering rules cc @AndrewDryga

conectado · 2024-04-24T20:14:46Z

Same for "Permit All" the filter field seems to come empty.

We could assume that an empty filter means "allow all" but doesn't seem like the best way to go.

AndrewDryga · 2024-04-24T22:48:40Z

@conectado I understand that empty filters = allow all might be a bit confusing but this semantic was chosen because it's an opt-in feature, so the value can be empty by default. We can discuss it on standup tomorrow and change if needed.

I pushed a fix for ICMP filters.

AndrewDryga · 2024-04-24T22:50:32Z

Or you mean for incoming traffic? I think we do only egress filtering

I was thinking that if you will use (dport, daddr, saddr) for routing packets then there are valid situations where they can be the same for multiple resources.

conectado · 2024-04-24T23:20:41Z

Or you mean for incoming traffic? I think we do only egress filtering

I was thinking that if you will use (dport, daddr, saddr) for routing packets then there are valid situations where they can be the same for multiple resources.

(dport, daddr, saddr) would be used to distinguish for incoming traffic and since for outgoing traffic we would pick a different sport the dport would always distinguish traffic for the same peer.

conectado · 2024-04-25T19:28:09Z

Continuing the discussion here #4779 (comment)

Let's say there are 2 overlapping CIDR resources on the client with different port filter rules
10.0.0.0/24 -> TCP/80
10.0.0.0/16 -> TCP/443

Right now the client has no information about port filtering and the request_connection is sent only based on IP, meaning, if client tries to access to 10.0.0.1:443 right now it picks resource based only on ip, the most specific one.

So the resource 10.0.0.0/24 would be sent to the gateway and traffic for port 443 wouldn't be allowed.

And without changes to the control protocol this can't be fixed.

conectado · 2024-04-25T19:33:01Z

The first implementation will be naive and simply have this problem, it will be solved with #4789

Ideally, we can add a warning when there are overlapping resources in the portal cc @AndrewDryga

jamilbk · 2024-04-26T16:26:00Z

@conectado Yeah actually, thinking more about it, I'm not sure I'm right about the user expectation on this one.

I think we should get it out there and get feedback on it before going down the difficult path of resolving filters across all overlapping CIDRs.

Just documenting well how this works is probably good enough for a good UX here.

AndrewDryga · 2024-04-26T16:47:07Z

@jamilbk yeah, that's why we decided not to do client filtering right away. It feels like a pretty rare edge case that we already know how to solve (add filtering on the client too). So all we have to do is to keep an issue open for it in case we need to come back and implement it later.

jamilbk · 2024-04-26T19:24:47Z

Adding note here after discussing with @conectado --

This would be required if we were to do the filtering on the Client as well, which we can save for later on, perhaps for #949. That would add another layer of security, since an attacker would need to compromise both the Gateway and Client in order to tamper with logged traffic.

) This came up while working on #2030 and thinking about testing `Peer`. Not entirely convinced of taking both `Instant` and `DateTime<Utc>` but unless we convert the expiration to an instant, which would bring a bunch of new problems, I don't see another way to do this.

conectado · 2024-05-03T01:57:22Z

@AndrewDryga I just realized that now we can get some filters with messages "all", does it still mean that empty filters means allow all or allow none?

jamilbk · 2024-05-03T02:10:31Z

@AndrewDryga I just realized that now we can get some filters with messages "all", does it still mean that empty filters means allow all or allow none?

It should be permit all. Deny all is what happens when there's no policy.

conectado · 2024-05-03T14:51:56Z

@AndrewDryga I just realized that now we can get some filters with messages "all", does it still mean that empty filters means allow all or allow none?

It should be permit all. Deny all is what happens when there's no policy.

If that's the case I rhink we should remove the "all" value for filters. Having 2 ways to express the same onlu makes things more complex.

jamilbk · 2024-05-03T15:29:16Z

@AndrewDryga I just realized that now we can get some filters with messages "all", does it still mean that empty filters means allow all or allow none?

It should be permit all. Deny all is what happens when there's no policy.

If that's the case I rhink we should remove the "all" value for filters. Having 2 ways to express the same onlu makes things more complex.

Yeah, it might be confusing if Permit all doesn't do anything, or creates a Resource that can't be accessed if left unchecked.

Can discuss this issue at standup. I think this could be solved by making Permit all and the Filters sections a radio toggle.

This implements traffic filtering on the gateway. Filters are set on the portal, per-resource, in an allow-list manner. If no filters exist for a given resource all packets are allowed, otherwise only packets that matches port/protocol for the filters are allowed, otherwise they are dropped. Filters can be either TCP, UDP or ICMP. For the first 2 multiple ports can be given. Furthermore, multiple filters can exists for the same resource. To be able to add and remove filters with the same IP/CIDR we keep around the whole list of filters for any given peer using an ID map and recalculate the IP each time something is added is removed. This allows us to remove filters and simply recalculate the allowlist for each IP. Furthermore, for any IP, all rules apply, meaning if there are multiple IPs that apply for a resource all port/protocol combinations for that IP will apply. This works well right now for DNS resources, since access is requested by DNS name, then the resource for that DNS name will arrive at the gateway, and the port filtering will apply given that resource(and any other resource with the same IP). However, since the client has no idea of the filters, it can't request the resource access based on the port/protocol combination and we are still using the most specific("longest match") IP. This will mean that for overlapping CIDR resources, only the rules for the most specific will be used, even if the gateway supports applying them all, since it will not have the other resources. This will be solved in #4789. It can also lead to some weirdness, let's say that you have 10.0.0.0/24 -> TCP/80 and 10.0.0.0/16 -> TCP/443 for your user. The user tries to access 10.0.0.1, and will then only be allowed port 80. At some point the user might access 10.1.0.1 and it will be allowed port 443. But from that point on, the user will be allowed to access 80 and 443 in 10.0.0.1 because the rules correctly work on the gateway, the problem is the client side. Again, #4789 will fix this. Left for next PRs (in tentative order!): - #4792 - #4789 Depends on: #4773. Resolves #2030. Resolves #4791. --------- Co-authored-by: Jamil Bou Kheir <jamilbk@users.noreply.github.com>

jamilbk changed the title ~~traffic filtering~~ epic: traffic filtering Aug 28, 2023

jamilbk changed the title ~~epic: traffic filtering~~ traffic filtering Sep 11, 2023

jamilbk transferred this issue from another repository Sep 12, 2023

jamilbk transferred this issue from firezone/private-to-public Sep 12, 2023

jamilbk added this to the 1.0 milestone Sep 12, 2023

jamilbk mentioned this issue Sep 25, 2023

1.0 Gateway #1769

Closed

6 tasks

jamilbk changed the title ~~traffic filtering~~ Traffic filtering Nov 10, 2023

jamilbk removed the business_value/critical Required by 100% of our customer base label Dec 26, 2023

jamilbk added the kind/feedback Issue created as a direct result of customer feedback label Feb 16, 2024

jamilbk removed this from the 1.0 GA milestone Mar 25, 2024

jamilbk assigned conectado Apr 22, 2024

jamilbk added the area/portal Portal, panel, web, control plane, you name it! label Apr 22, 2024

jamilbk assigned jamilbk and AndrewDryga Apr 22, 2024

conectado mentioned this issue Apr 24, 2024

chore(connlib): make peer pure by taking utc time from parameters #4773

Merged

AndrewDryga mentioned this issue Apr 24, 2024

fix(portal): Fix traffic filtering to send port-less rules #4778

Merged

conectado mentioned this issue Apr 24, 2024

feat(connlib): traffic filtering #4779

Merged

conectado mentioned this issue Apr 25, 2024

Send traffic filter down to clients as part of the resources #4789

Open

jamilbk added this to the 04/24 milestone Apr 29, 2024

conectado closed this as completed in #4779 May 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Traffic filtering #2030

Traffic filtering #2030

jamilbk commented Aug 24, 2023 •

edited

jamilbk commented Apr 22, 2024 •

edited

jamilbk commented Apr 22, 2024

AndrewDryga commented Apr 22, 2024

jamilbk commented Apr 22, 2024 •

edited

conectado commented Apr 22, 2024 •

edited

AndrewDryga commented Apr 22, 2024

conectado commented Apr 23, 2024

AndrewDryga commented Apr 23, 2024

conectado commented Apr 23, 2024

conectado commented Apr 23, 2024

conectado commented Apr 23, 2024

conectado commented Apr 24, 2024

AndrewDryga commented Apr 24, 2024

AndrewDryga commented Apr 24, 2024

conectado commented Apr 24, 2024

conectado commented Apr 25, 2024

conectado commented Apr 25, 2024

jamilbk commented Apr 26, 2024

AndrewDryga commented Apr 26, 2024

jamilbk commented Apr 26, 2024 •

edited

conectado commented May 3, 2024

jamilbk commented May 3, 2024

conectado commented May 3, 2024

jamilbk commented May 3, 2024

Traffic filtering #2030

Traffic filtering #2030

Comments

jamilbk commented Aug 24, 2023 • edited

jamilbk commented Apr 22, 2024 • edited

Todo:

jamilbk commented Apr 22, 2024

AndrewDryga commented Apr 22, 2024

jamilbk commented Apr 22, 2024 • edited

conectado commented Apr 22, 2024 • edited

Current approach to DNS resources

Drawback(regarding to traffic filtering)

New approach(proposed by @jamilbk )

Summary

Drawbacks

AndrewDryga commented Apr 22, 2024

conectado commented Apr 23, 2024

AndrewDryga commented Apr 23, 2024

conectado commented Apr 23, 2024

conectado commented Apr 23, 2024

conectado commented Apr 23, 2024

conectado commented Apr 24, 2024

AndrewDryga commented Apr 24, 2024

AndrewDryga commented Apr 24, 2024

conectado commented Apr 24, 2024

conectado commented Apr 25, 2024

conectado commented Apr 25, 2024

jamilbk commented Apr 26, 2024

AndrewDryga commented Apr 26, 2024

jamilbk commented Apr 26, 2024 • edited

conectado commented May 3, 2024

jamilbk commented May 3, 2024

conectado commented May 3, 2024

jamilbk commented May 3, 2024

jamilbk commented Aug 24, 2023 •

edited

jamilbk commented Apr 22, 2024 •

edited

jamilbk commented Apr 22, 2024 •

edited

conectado commented Apr 22, 2024 •

edited

jamilbk commented Apr 26, 2024 •

edited