0%·4 min left
Engineering

Inside KubeCon EU 2025: Highlights and Key Trends

Sandipan Panda

Sandipan Panda

Technical Staff

May 2, 20254 min read
Inside KubeCon EU 2025: Highlights and Key Trends

April in London has never felt so electric. From the first step into the ExCeL halls to the hallway conversations, KubeCon + CloudNativeCon Europe 2025 was a whirlwind of new ideas, familiar faces, and those “aha” moments we all live for.

As a first-time in-person speaker—and at my very first conference—I arrived with equal parts excitement and nerves. Below, I unpack these experiences, the talks I attended, and the broader trends shaping our ecosystem.

My Talk: “Empowering ML Workloads with Kubeflow”#

Setting the Stage#

I co-presented with Hezhi Xie at the Kubeflow Summit on Empowering ML Workloads with Kubeflow: JAX Distributed Training and LLM Hyperparameter Optimization.

We doved into two major extensions to Kubeflow’s capabilities: Distributed JAX Training We showcased how the Kubeflow Training Operator now supports distributed training workloads using JAX on Kubernetes, enabling seamless scaling of high-performance computations. Automated LLM Hyperparameter Optimization We demoed a high-level API for tuning hyperparameters of LLMs that automates the optimization process in Kubernetes.

Talks I Attended#

Throughout the event, I curated my schedule around sessions that pushed Kubernetes, MLOps, and infrastructure efficiency forward. Here are a few highlights:

1. Techniques and Insights to Test Kubernetes Limits with Kind#

Speakers: Antonio Ojea and Katarzyna Lach (Google)

They showed how Kind can emulate large-scale clusters to root cause DNS and kube-proxy performance issues locally. Their emphasis on breaking down problems into narrow API interactions and scripting reproducible tests was eye-opening—NF tables outperformed IP tables by orders of magnitude in their benchmarks.

2. Scale Smarter, Not Harder: How Extending Cluster Autoscaler Saves Millions#

Speakers: Rahul Rangith & Ben Hinthorne (Datadog)

Datadog detailed how they extended the Cluster Autoscaler with a gRPC expander to evaluate cost, performance, and reliability when choosing instance types. The Node Group Set and Instance Score Controller patterns they showcased enabled them to save millions by optimizing bin-packing across dozens of clusters.

3. The Next Generation of DaemonSet Autoscaling#

Speakers: Adam Bernot (Google Cloud) and Bryan Boreham (Grafana Labs)

They proposed a Vertical Pod Autoscaler enhancement to tune resource requests on a per-node basis for DaemonSets. Their demo of scoped VPAs adapting CPU allocations without manual tuning underscored how Kubernetes is evolving to handle heterogeneous clusters more efficiently.

4. Hot Takes: Kubernetes “Paintainers” Bring the Heat#

Speakers: Ian Coldwater, Marly Salazar, Jeffrey Sica, Kat Cosgrove, Xander Grzywinski

This Hot Ones–style panel delivered candid insights on governance, overrated buzzwords (AI/LLMs and GitOps UX topped the list), and burnout prevention. Their honest advice on setting boundaries and advocating for well-being was a powerful reminder of the human side of open source.

5. Learning Lounge: CNCF Kubernetes Certifications#

Speaker: Chad M. Crowell (KubeSkills)

A rapid walk-through of CKA/CKS exam tips—perfect for newcomers looking to validate their Kubernetes knowledge.

Cloud-Native MLOps Maturity#

Kubeflow sessions clustered around production readiness: multi-tenant isolation, cost-aware scheduling, and full-lifecycle metadata tracking. ML in the cloud-native world is no longer experimental—it’s becoming enterprise standard.

Infrastructure Observability & Testing#

From Kind-driven chaos tests to autoscaler expanders, the community is doubling down on built-in observability and pre-production validation. “Test early, monitor everywhere” was the mantra I heard most.

Dynamic Resource Allocation (DRA)#

DRA has become a major theme this year, featured in at least ten sessions either as the primary focus or a key reference point.

Community & Culture Focus#

Panels and social events like KubeClash reinforced that human connection remains the heartbeat of open source—even in hybrid work times.

DevZero Booth & Community Vibes#

Between sessions, I spent time at the DevZero booth (S783), where our live demos on cloud cost optimization drew steady crowds. Highlights included:

  • Custom LEGO minifigs illustrated live
  • Raffles for retro computers and LEGO sets
  • Morning breakfast roundtables with Platform and DevOps leaders

Conversations at the booth made it clear: teams are actively looking for better ways to reduce cloud waste without compromising performance. That’s exactly what we’re building at DevZero—a Kubernetes cost optimization platform that automatically rightsizes CPU and memory based on real workload behavior.

From hallway chats to after-parties (KubeClash was epic!), it was a powerful reminder to always make time for meaningful connections.

Final Thoughts#

KubeCon EU 2025 was more than a conference—it was a reminder of how vibrant and fast-moving the cloud-native ecosystem is, especially at the intersection with ML. I leave London energized, and with a deeper appreciation for the community that powers open source.

Share:
Sandipan Panda

Sandipan Panda

Technical Staff

Cut Kubernetes Cost

Before You Pay a Cent.

Every feature unlocked. No hidden fees.

Start for free

Start Free

$0/ month
Kubernetes resource and cost monitoring
Up to 2 active clusters
Platform access for 45 days
Cost attribution for departments
Data export for chargeback
Audit logging