/plushcap/analysis/workos/from-four-to-five-9s-of-uptime-by-migrating-to-kubernetes

From four to five 9s of uptime by migrating to Kubernetes

What's this blog post about?

The text discusses the migration of User Management from Heroku to Kubernetes on EKS, along with the development of a custom platform called Terrace. The challenges faced while using Heroku included lack of an SLA, limited rollout functionality, and inadequate data locality options. To address these issues, the team chose Kubernetes on EKS and built Terrace to streamline deployment, secret management, and automated load balancing. The migration resulted in significant improvements in uptime, from four nines to consistently achieving five nines over 7 and 30-day periods across all services. Future plans for Terrace include implementing deployments to different data localities and experimenting with blue/green and canary deployment strategies.

Company
WorkOS

Date published
June 27, 2024

Author(s)
Matheus Lichtnow

Word count
1738

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.