/plushcap/analysis/datadog/engineering-moving-a-jobsystem-to-kubernetes

How We Minimized the Overhead of Kubernetes in our Job System

What's this blog post about?

The text discusses how to improve performance when transitioning from running machines on raw VMs to Kubernetes. It outlines the steps taken by Datadog to solve a 40-50% slower job completion rate after moving their existing job system to Kubernetes. These steps included designing performance experiments, light performance tuning, and timing analysis. The text also answers key deployment questions such as what the per-pod overhead looks like. It concludes by stating that with proper configuration and optimization, transitioning from raw VMs to Kubernetes can provide significant management and scalability benefits without sacrificing performance.

Company
Datadog

Date published
Feb. 22, 2021

Author(s)
Lally Singh, Ashwin Venkatesan

Word count
2334

Hacker News points
3

Language
English


By Matt Makai. 2021-2024.