How We Minimized the Overhead of Kubernetes in our Job System
The text discusses how to improve performance when transitioning from running machines on raw VMs to Kubernetes. It outlines the steps taken by Datadog to solve a 40-50% slower job completion rate after moving their existing job system to Kubernetes. These steps included designing performance experiments, light performance tuning, and timing analysis. The text also answers key deployment questions such as what the per-pod overhead looks like. It concludes by stating that with proper configuration and optimization, transitioning from raw VMs to Kubernetes can provide significant management and scalability benefits without sacrificing performance.
Company
Datadog
Date published
Feb. 22, 2021
Author(s)
Lally Singh, Ashwin Venkatesan
Word count
2334
Hacker News points
3
Language
English