/plushcap/analysis/datadog/how-medium-monitors-dynamodb-performance

How Medium monitors DynamoDB performance

What's this blog post about?

Medium, a popular blogging platform, uses Amazon DynamoDB as one of its primary data stores to handle rapid scaling. Monitoring and managing the performance of DynamoDB is crucial for maintaining an optimal user experience. The main challenge faced by Medium is throttling, which can cause high latency and user-facing errors. To tackle this issue, Medium uses Datadog to track whole-table capacity usage and compares it with provisioned capacity. They also calculate the number of partitions per table and estimate the throughput limit for each partition. By tracking hot keys and throttling events, Medium can take action to increase capacity when needed. Additionally, they use Redis as a cache in front of DynamoDB to reduce throttling frequency and lower costs. Proper alerting is essential for resolving issues quickly, and Medium uses Datadog's alerting features to notify the right people with the appropriate channels and messages. Monitoring also helps Medium identify when they can scale down their provisioned throughput, optimizing infrastructure expenses.

Company
Datadog

Date published
Sept. 15, 2015

Author(s)
Jean-Mathieu Saponaro

Word count
1201

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.