Company
Date Published
Sept. 6, 2024
Author
Alexandre Bouchard
Word count
595
Language
English
Hacker News points
None

Summary

Hookdeck experienced a significant event delivery delay on August 28th and 29th due to their database vendor's storage auto-scaling feature. This incident led to severe consequences for many customers, resulting in the need for significant changes to maintain trust. The root cause was identified as the provisioning of slower disks instead of high-performance ones. No data was lost during the incident. Hookdeck is revamping its infrastructure and has decided to credit all customers based on their SLA policy. They will also make several changes, including updating their SLA to offer higher credit compensation, making real-time p99 latency public, working with the database vendor for improvements, and improving communication during incidents.