Blog

Kubernetes Networking Demystified: CNI Plugins, Network Policies, and Pod-to-Pod Communication

Prev in KubernetesThe Resilience of Timbernetes: A Comprehensive Analysis of In-Place Pod Vertical Scaling in Kubernetes 1.35 Next in KubernetesQuenchWorks: A Zero-CVE, Built-From-Source Replacement for the Bitnami Catalog

Blog

Kubernetes Networking Cloud Native Infrastructure

Kubernetes Networking Demystified: CNI Plugins, Network Policies, and Pod-to-Pod Communication

Mohammad Abu MattarPublished: 13 Apr, 2026Updated: 22 Jul, 202607 Mins read12 Mins listen

If you’ve spent any time with Kubernetes, you know that networking is often the part that makes people’s heads spin. It feels like magic until something breaks, and then you’re staring at a maze of virtual interfaces and iptables rules. But once you peel back the layers, it’s actually built on some very logical, albeit different, principles. Let’s break down how it all works so you can feel confident managing your cluster.

What is Kubernetes Networking and Why Does it Feel So Complex?

At its heart, Kubernetes networking is about making sure every part of your application can talk to every other part, no matter where they’re running. In the old days of running one app on one server, this was easy. But in a cluster, you have hundreds of containers starting and stopping across dozens of machines.

Kubernetes simplifies this by giving every Pod its own unique IP address. Think of a Pod like a virtual machine or a physical host on a flat network. This “IP-per-Pod” model means you don’t have to worry about port conflicts or managing complex mappings. If two different Pods both want to listen on port 80, they can, because they have different IP addresses.

There are three big rules that every Kubernetes network must follow:

All Pods can talk to all other Pods without using Network Address Translation (NAT).
Nodes can talk to all Pods on that node (and vice-versa) without NAT.
A Pod sees its own IP as exactly the same IP that other Pods see for it.

These rules create a predictable environment where your apps can just use standard networking. The complexity usually comes from how we implement these rules, especially when we start moving packets across different physical servers.

How Does the Container Network Interface (CNI) Connect Your Cluster?

Kubernetes doesn’t actually handle the networking itself. Instead, it uses something called the Container Network Interface, or CNI. You can think of CNI as a standardized plug-in system. When the cluster starts a Pod, it calls the CNI plugin and says, “Hey, I need a network for this Pod. Can you set it up?”.

The CNI plugin does two main jobs:

Connectivity: It creates the virtual “wires” (interfaces) and connects the Pod to the rest of the network.
IP Management (IPAM): It picks a unique IP address from a pool and assigns it to the Pod.

When the kubelet schedules a pod, it calls the CNI plugin, which assigns the pod an IP address and wires up the virtual interfaces (a veth pair plus bridge and routes) on the node so the pod can reach the rest of the network.

You can see which CNI you’re using by checking the configuration files on your nodes or looking at the pods running in your kube-system namespace.

# A quick way to see which CNI pods are running in your cluster
kubectl get pods -n kube-system | grep -E "calico|cilium|flannel"

How Do You Choose Between Calico, Cilium, and Flannel?

Choosing a CNI is a big decision because it affects your cluster’s speed, security, and how much “work” your nodes have to do. Here’s a quick look at the big three:

Flannel is the simple overlay for small clusters but has no network policy. Calico is the routed, policy-rich industry standard. Cilium uses eBPF for high performance, L7 policy, and deep observability, at the cost of needing a recent kernel.

Flannel: The Simple Option

Flannel is the “old reliable” of the group. It’s very easy to set up and uses an overlay network (usually VXLAN) to wrap your traffic and send it between nodes. It’s great for small clusters or development environments where you don’t need fancy security rules. The downside? It doesn’t support Network Policies natively, so you can’t easily block traffic between Pods.

Calico: The Industry Standard

Calico is incredibly versatile. It can run as an overlay, or it can use BGP to route traffic natively without the extra “wrapping” (encapsulation), which makes it very fast. It’s famous for its powerful security engine that handles Network Policies perfectly.

Cilium: The Performance Powerhouse

Cilium is the new favorite for high-performance clusters. It uses a Linux kernel technology called eBPF to handle packets. By working directly in the kernel, it skips a lot of the traditional networking overhead. It also gives you amazing visibility into what’s happening on your network through a tool called Hubble.

Feature	Flannel	Calico	Cilium
Ease of Use	Very High	Moderate	Moderate
Security Policies	None (Native)	Excellent	Advanced (L7)
Performance	Good	Very High	Elite
Resource Cost	Very Low	Moderate	Higher

How Does Pod-to-Pod Communication Actually Work?

Let’s look at how a packet gets from Pod A to Pod B. Inside the Pod, there’s a virtual interface called eth0. This is connected to a “virtual cable” (a veth pair) that leads out to a bridge on the host node.

If Pod B is on the same node, the bridge just passes the packet over. But if Pod B is on a different node, things get interesting. Depending on your CNI, the packet might be:

The packet leaves Pod A through its veth pair to the node bridge, out the node NIC, across the network (VXLAN overlay or plain routing depending on the CNI), and back up the same path on Node B. Crucially, no NAT happens, so pods see each other's real IPs.

Encapsulated: Wrapped in a new packet (like an envelope) to be sent over the physical network.
Routed: Sent directly if the physical network knows where the Pod IPs are.

You can check your Pod’s internal networking setup with a simple command:

# Look at the IP and routes inside a running pod
kubectl exec <pod-name> -- ip addr show eth0
kubectl exec <pod-name> -- ip route show

How Can You Use Network Policies for Security?

By default, every Pod in a cluster can talk to every other Pod. In a production environment, that’s a bit like leaving all your office doors unlocked. NetworkPolicy resources are how you lock those doors.

A Network Policy is basically a set of rules that says “only Pod A can talk to Pod B on port 80”. A key thing to remember is that these policies are “allow-lists.” Once you apply a policy to a Pod, everything else is blocked by default.

With a default-deny policy in place, only the connections you explicitly allow get through: frontend to api, and api to db. Everything else, including other pods reaching the api or the frontend talking straight to the db, is blocked.

I always recommend starting with a “default-deny” policy. This ensures that no traffic moves unless you specifically say it’s okay.

1
# A "Default Deny" policy that blocks all incoming and outgoing traffic in a namespace
2
apiVersion: networking.k8s.io/v1
3
kind: NetworkPolicy
4
metadata:
5
  name: default-deny
6
  namespace: my-app
7
spec:
8
  podSelector: {} # This empty selector matches ALL pods in the namespace
9
  policyTypes:
10
    - Ingress
11
    - Egress

Once you’ve locked everything down, you can add specific “allow” rules for your services.

1
# This policy allows the 'frontend' to talk to the 'backend' on port 5432
2
apiVersion: networking.k8s.io/v1
3
kind: NetworkPolicy
4
metadata:
5
  name: allow-frontend-to-backend
6
  namespace: my-app
7
spec:
8
  podSelector:
9
    matchLabels:
10
      app: backend # The pods we are protecting
11
  ingress:
12
    - from:
13
        - podSelector:
14
            matchLabels:
15
              app: frontend # The only pods allowed to talk to them
16
      ports:
17
        - protocol: TCP
18
          port: 5432

What’s the Best Way to Handle Multi-Tenancy?

If you’re sharing a cluster between different teams (multi-tenancy), you need to make sure they don’t step on each other’s toes.

Namespaces are your first line of defense. They give you a way to logically group resources. You should combine these with Resource Quotas so one team doesn’t accidentally use up all the CPU or memory in the cluster.

For “hard” isolation like if you’re running code for different customers you might want to use node isolation. This ensures that sensitive workloads don’t even share the same physical server as other tenants.

How Do You Debug Connectivity Issues When Things Go Wrong?

When a connection fails, don’t just guess. Use a systematic approach. Start with the Pod and work your way out.

Check the Pods: Are they running? Do they have IPs?

kubectl get pods -o wide

Check Endpoints: If you’re using a Service, are there actually Pods behind it?

kubectl get endpoints <service-name>

Use a Debug Pod: I love using netshoot. It’s a Pod packed with every networking tool you’ll ever need.

# Spin up a temporary debug pod to test connections
kubectl run debug-pod --rm -it --image=nicolaka/netshoot -- /bin/bash

# Inside the pod, you can test DNS and connectivity
nslookup my-service
curl -v http://my-service:8080

One common “gotcha” is the MTU (Maximum Transmission Unit). If your network uses an overlay (like VXLAN), it adds some extra data to every packet. If the total size goes over 1500 bytes, the packet might get dropped. If your small requests work but large file uploads fail, you’re probably hitting an MTU issue.

FAQ: Frequently Asked Questions on Kubernetes Networking

Why can't my Pod resolve a service name?

This is usually a DNS issue. First, check if your CoreDNS pods are actually running: kubectl get pods -n kube-system -l k8s-app=kube-dns. If they are, check their logs for errors.

What is the difference between Pod-to-Pod and Pod-to-Service communication?

Pod-to-Pod is direct. But Pods are ephemeral and their IPs change. Pod-to-Service uses a stable “ClusterIP” that stays the same, even if the underlying Pods are replaced.

Do I really need a Service Mesh?

A Service Mesh like Istio or Linkerd is great for things like automatic encryption (mTLS) and advanced routing (like canary deployments). But they add complexity. If you just need basic connectivity, a good CNI and some Network Policies are often enough .

Can I run a cluster without a CNI?

Nope. Kubernetes provides the model, but it needs a CNI plugin to actually do the work of connecting things.

How do I block Pods from reaching the internet?

You can use an Egress Network Policy. By specifying which internal CIDR ranges are allowed and leaving out everything else, you effectively block external access.

Pro-Tips for Production Networking

To wrap things up, here are a few best practices for your production clusters:

Plan your IP space: Make sure your Pod and Service CIDR ranges are big enough for future growth.
Use Default-Deny: It’s much safer to explicitly allow traffic than to try and block everything bad.
Monitor Latency: Tools like Cilium’s Hubble can show you exactly where packets are slowing down, which is a lifesaver when debugging performance issues.

Networking in Kubernetes is a deep topic, but if you focus on the basics of Pod IPs, CNI plugins, and Network Policies, you’ll be well on your way to mastering it.