/plushcap/analysis/launchdarkly/two-tales

Two Tales

What's this blog post about?

In October, a special Test in Production session was held with Honeycomb.io where speakers shared their scariest stories of operational outages and other horrors. Eric Pollman, CTO and co-founder of ClearBrain, discussed two such experiences from his time at Google. The first incident involved Google Ads going fully offline for 90+ minutes due to a bad data push that caused servers to crash. The second story was about a "zombie haunted" pipeline that kept developers awake late into the night while processing large amounts of data. These experiences taught valuable lessons, such as checking multiple metrics and automating processes to prevent similar issues in the future.

Company
LaunchDarkly

Date published
Nov. 16, 2018

Author(s)
Kim Harrison

Word count
3066

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.