Analyze dropped packets

red-pepper-pesto · December 26, 2024, 1:48am

I am analysing why the wireguard connection does not work. It has something to do with MTU I guess.

ping 1.1.1.1 works but ping -s 1500 1.1.1.1 does not work. I was able to narrow it down to a dropped packet in sys-net vm.

my setup is sys-vpn → sys-firewall (vif38) → sys-net (ens6) → router (also has a vpn) → vpn server connected from sys-vpn ip: 89.46.223.58. The tcpdump output below is from sys-net.

The issue is sys-net does not pass the packet with ID 57174 received from vpn server to sys-firewall.

tcpdump from sys-net

11:02:34.299577 vif38.0 In  IP (tos 0x0, ttl 63, id 10078, offset 0, flags [none], proto UDP (17), length 1392)
    <sys-firewall ip>.33597 > 89.46.223.58.51820: UDP, length 1364
11:02:34.299622 ens6  Out IP (tos 0x0, ttl 62, id 10078, offset 0, flags [none], proto UDP (17), length 1392)
    <sys-net ip>.33597 > 89.46.223.58.51820: UDP, length 1364
11:02:34.299627 vif38.0 In  IP (tos 0x0, ttl 63, id 10079, offset 0, flags [none], proto UDP (17), length 284)
    <sys-firewall ip>.33597 > 89.46.223.58.51820: UDP, length 256
11:02:34.300004 ens6  Out IP (tos 0x0, ttl 62, id 10079, offset 0, flags [none], proto UDP (17), length 284)
    <sys-net ip>.33597 > 89.46.223.58.51820: UDP, length 256
11:02:34.402363 ens6  In  IP (tos 0x0, ttl 50, id 57174, offset 0, flags [+], proto UDP (17), length 1364)
    89.46.223.58.51820 > <sys-net ip>.33597: UDP, length 1408
11:02:34.402415 ens6  In  IP (tos 0x0, ttl 50, id 57174, offset 1344, flags [none], proto UDP (17), length 92)
    89.46.223.58 > <sys-net ip>: ip-proto-17
11:02:34.402478 ens6  In  IP (tos 0x0, ttl 50, id 57175, offset 0, flags [none], proto UDP (17), length 252)
    89.46.223.58.51820 > <sys-net ip>.33597: UDP, length 224
11:02:34.402501 vif38.0 Out IP (tos 0x0, ttl 49, id 57175, offset 0, flags [none], proto UDP (17), length 252)
    89.46.223.58.51820 > <sys-firewall ip>.33597: UDP, length 224

nft input chain rules

        chain input {
                type filter hook input priority filter; policy drop;
                jump custom-input
                ct state invalid counter packets 11 bytes 15796 drop
                iifgroup 2 udp dport 68 counter packets 0 bytes 0 drop
                ct state established,related accept
                iifgroup 2 meta l4proto icmp accept
                iif "lo" accept
                iifgroup 2 counter packets 0 bytes 0 reject with icmp host-prohibited
                counter packets 10 bytes 840
        }

Any idea what’s the problem here?

DVM · December 26, 2024, 9:33am

Since you are going through 2 VPN tunnels, you should try lowering the MTU in sys-vpn to accommodate the new encapsulation happening on the router.
If the router is also using WG, try lowering your current sys-vpn MTU value by 60 based on the value set on your router.

red-pepper-pesto · December 27, 2024, 10:23am

Changing MTU does not fix the issue. I went as low as setting sys-vpn MTU to 1280 and the wireguard interface in sys-vpn to 1220. The issue seems to be the packet rejected in sys-net because of nft rule ct state invalid drop

On further analysis, it looks like when the UPD packet is fragmented (like ID 57174 in my OG post) the conntrack is rejecting it because of bad checksum. I was able to enable conntrack logging and see something like sys-net kernel: nf_ct_proto_17: bad checksum IN=ens6....

Now I am trying to figure out why the UPD packet has bad checksum.

red-pepper-pesto · December 27, 2024, 12:45pm

I am convinced that I should disable UDP checksum validation at least for fragmented packets. Looks like the checksum is calculated using source and destination address and is recalculated every hop. For e.g. sys-net is recalculating the UDP checksum it received from ens6 before passing it on to sys-firewall.
In my case, the packet got fragmented before wireguard encryption of the router. So the checksum was calculated using the source and destination address before fragmentation and was not updated until it got assembled again in sys-net.

Now, I am trying to figure out if I have to disable checksum validation for all UDP packets or can I do it for more specific subset like wireguard packets or fragmented UDP packets or both.

Edit: My assumption that the fragmentation of UDP packet happened before the router wireguard vpn might be wrong. It is quite possible that the fragmentation happened in the router. In latter case, the checksum should be correct. But I’m still inclined towards disabling the UPD checksum verification.

Edit 2: disabling checksum validation did not fix the issue. The packet is not dropped in ct state invalid nft rule. But the packet is not sent to downstream sys-firewall. I am still looking for some answers on why the packet is not sent to sys-firewall.

red-pepper-pesto · December 30, 2024, 1:42pm

For anyone interested, I used below rule in sys-net to disable checksum validation for UPD packets. A better iteration of the rule would be to disable checksum validation for fragmented UDP packets.

nft add rule ip qubes prerouting iifgroup != 2 iifname != lo counter upd checksum set 0

Now, the fragmented wierguard packets are reaching sys-vpn. In tcpdump output, I could see the fragmented packets at eth0 interface. However I do not see the packets decrypted at wireguard interface. Either the fragmented packet is not routed to wireguard interface or the wireguard interface is receiving the packet and not decrypting it. Below are two nft traces the first is non-fragmented packet that gets decrypted and second is the fragmented packet that does not get decrypted. Any inputs on how to debug routing decisions?

trace id e252ba78 ip qubes input packet: iif "eth0" ether saddr fe:ff:ff:ff:ff:ff ether daddr <mac> ip saddr 185.76.9.57 ip daddr <eth0 ip> ip dscp cs0 ip ecn not-ect ip ttl 48 ip id 56194 ip length 156 udp sport 51820 udp dport 39148 udp length 136 @th,64,96 0x4000000ce8eb3c500000000
trace id e252ba78 ip qubes input rule jump custom-input (verdict jump custom-input)
trace id e252ba78 ip qubes input rule ct state established,related accept (verdict accept)
trace id ca565c9f ip qubes trace_preraw packet: iif "wg-se-sto" ip saddr 1.1.1.1 ip daddr <wireguard interface ip> ip dscp cs0 ip ecn not-ect ip ttl 54 ip id 14331 ip length 84 icmp type echo-reply icmp code net-unreachable icmp id 795 icmp sequence 1 @th,64,96 0x32c5716700000000aef00700

itrace id 23bab5c6 ip qubes input packet: iif "eth0" ether saddr fe:ff:ff:ff:ff:ff ether daddr <mac> ip saddr 185.76.9.57 ip daddr <eth0 ip> ip dscp cs0 ip ecn not-ect ip ttl 48 ip id 62230 ip length 1372 udp sport 51820 udp dport 39148 udp length 1352 @th,64,96 0x4000000ce8eb3c503000000
trace id 23bab5c6 ip qubes input rule jump custom-input (verdict jump custom-input)
trace id 23bab5c6 ip qubes input rule ct state established,related accept (verdict accept)