/plushcap/analysis/cloudflare/major-data-center-power-failure-again-cloudflare-code-orange-tested

Major data center power failure (again): Cloudflare Code Orange tested

What's this blog post about?

In November 2023, a major data center at Cloudflare experienced a power outage due to maintenance by the electrical grid provider, affecting its control plane and causing significant downtime for several services. The company declared Code Orange, shifting all non-critical engineering functions to focus on ensuring high reliability of their control plane. Over the next five months, teams across various departments worked to improve resilience in case of a similar failure in the future. On March 26, 2024, the same data center faced another power outage, but this time, most services were up and running within minutes due to the implemented improvements. The company continues to work on completing the resilience program for its Analytics platform and will stay focused on enhancing overall reliability.

Company
Cloudflare

Date published
April 8, 2024

Author(s)
Matthew Prince, John Graham-Cumming, Jeremy Hartman

Word count
2243

Language
English

Hacker News points
214


By Matt Makai. 2021-2024.