Chaos Engineering at Datadog
In August, Datadog hosted a Test in Production Meetup at the Meetup headquarters in NYC where Corey Bertram discussed how Datadog does chaos engineering. He shared his experiences from when he led the SRE team at Netflix and how that has influenced the way Datadog put process around chaos engineering experiments. The company is moving from a world of game days to experiments, being explicit about what these experiments are, and focusing on safety. They have a hypothesis, method, and rollback for their tests and use tools like Spinnaker and Chaos Toolkit. Datadog's SRE team consists of 15 members globally who focus on building tools that empower other teams to do their job better rather than doing it for them.
Company
LaunchDarkly
Date published
Sept. 20, 2019
Author(s)
Kim Harrison
Word count
4739
Language
English
Hacker News points
None found.