Linux Kernel WireGuard tuning

This technical guide addresses requirements regarding WireGuard VPN latency, throughput, resilience, and scalability by tuning Linux Kernel parameters (as WireGuard is part of Linux Kernel).

Introduction

WireGuard is widely praised for its lean codebase and efficiency. However, the default Linux kernel settings are often tuned for general-purpose computing, not for acting as a high-speed router handling encrypted UDP traffic at scale.

Here are some tuning parameters to achieve maximum performance (low latency), stability across changing networks (roaming), and high concurrency, we must tune three distinct layers.

Kernel sysctl settings optimize how the Linux kernel schedules packets and manages memory buffers. Add the following to /etc/sysctl.d/99-wireguard-tuning.conf. or /etc/sysctl.conf

Kernel tuning

Congestion Control & Queuing (Latency & Throughput)

To reduce bufferbloat (latency spikes under load) and maximize throughput, we replace the default CUBIC algorithm with BBR (Bottleneck Bandwidth and Round-trip propagation time), which is less sensitive to packet loss and more aggressively seeks the optimal congestion window.

# Use BBR congestion control
net.core.default_qdisc = fq
net.ipv4.tcp_congestion_control = bbr

Memory & Buffers (Throughput)

WireGuard uses UDP for data transport. By default, Linux kernel UDP buffer sizes are often too small for high-speed transfers (1 Gbps+), causing packets to be dropped in the kernel before WireGuard can process them.

# Increase default and max receive/send window sizes (approx 16MB)
net.core.rmem_max = 16777216
net.core.wmem_max = 16777216
net.core.rmem_default = 262144
net.core.wmem_default = 262144
net.ipv4.udp_mem = 4096 87380 16777216

Packet Processing & Forwarding (Efficiency)

These settings allow the kernel to process packets faster and handle bursts of traffic without dropping them.

# Enable IP Forwarding (Required for VPN)
net.ipv4.ip_forward = 1

# Increase the maximum length of the processor input queue
# (Prevents drops during traffic bursts)
net.core.netdev_max_backlog = 5000

# Increase the maximum number of connections waiting for acceptance
net.core.somaxconn = 8192

Packet buffering

In Linux, network cards (NICs) use NAPI (New API) polling to handle incoming packets. When an interrupt fires, the kernel disables further interrupts and polls the NIC, processing packets in batches.

net.core.netdev_budget limits how many packets the kernel may process in a single SoftIRQ cycle before yielding the CPU and it's default is: 300 packets.

Too low value - Under heavy load (e.g., 100+ streaming users), the kernel yields too early, packets back up in the NIC buffer, and drops occur.
Too high value - The networking stack can monopolize a CPU core, starving userspace processes and increasing overall latency.

For high-performance VPN servers, we increase netdev_budget to favor network throughput and tune the companion setting netdev_budget_usecs to cap CPU time per polling cycle. Below you wil find some recommended values for multiple scenarios.

Home/Small Office

Meaning around ~20 users: The default 300 is fine and changing it won't be noticeable.

50 VPN users and above

# --- NAPI Polling Budget Tuning ---

# Increase packet budget (Default: 300).
# Allow the CPU to process up to 600 packets in one cycle.
# Beneficial for high PPS (packets per second) environments.
net.core.netdev_budget = 600

# Increase the time budget (Default: 2000us or 2ms).
# Allow the NAPI cycle to run for up to 4ms before yielding.
# Prevents the loop from aborting prematurely during heavy traffic bursts.
net.core.netdev_budget_usecs = 4000

High throughput ≥ 10Gbps

You may need values as high as netdev_budget = 1200, assuming you have a powerful CPU with Receive Packet Steering (RPS) enabled.

Multiple connection concurrency (egress via VPN)

WireGuard is stateless, but the Linux firewall tracking (used for Masquerade or DNAT when configuring connection tracking and egress through VPN) is stateful.

Here is a formula how to optimize netfliter parameters based on the following assumptions that one connected device is a UDP stream (VPN) and multiple TCP streams that the user/device "uses" for browsing/apps "exiting" via VPN:

Assumptation: 1 active user generates ~50-100 simultaneous connections

Parameter (Sysctl)

Description

10 Devices(Home/SOHO)

100 Devices(SMB/Office)

1,000 Devices(Enterprise/ISP)

10,000 Devices(Data Center)

net.netfilter.nf_conntrack_max

CRITICAL. Max concurrent connections tracked.

65536

131072

524288

5242880

net.core.somaxconn

Max pending connections in queue.

4096

16384

65535

net.core.netdev_max_backlog

Max packets queued if kernel is busy.

1000

5000

16384

65535

net.core.netdev_budget

Max packets processed in one CPU cycle.

300

600

1200

net.core.rmem_max (Bytes)

Max OS receive buffer size (UDP).

16 MB

32 MB

128 MB

net.core.wmem_max (Bytes)

Max OS send buffer size (UDP).

16 MB

32 MB

128 MB

fs.file-max

System-wide file descriptor limit.

Default

100000

1000000

5000000

Required System RAM

Minimum RAM needed for state tables.

512 MB

1 GB

4 GB

32 GB+

PreviousProduction deployment verification guide NextDeployment automation

Last updated 1 month ago

Was this helpful?

Good morning

hashtagIntroduction

hashtagKernel tuning

hashtagCongestion Control & Queuing (Latency & Throughput)

hashtagMemory & Buffers (Throughput)

hashtagPacket Processing & Forwarding (Efficiency)

hashtagPacket buffering

hashtagHome/Small Office

hashtag50 VPN users and above

hashtagHigh throughput ≥ 10Gbps

hashtagMultiple connection concurrency (egress via VPN)

Introduction

Kernel tuning

Congestion Control & Queuing (Latency & Throughput)

Memory & Buffers (Throughput)

Packet Processing & Forwarding (Efficiency)

Packet buffering

Home/Small Office

50 VPN users and above

High throughput ≥ 10Gbps

Multiple connection concurrency (egress via VPN)