Company
Date Published
Author
Kunju Perath
Word count
1552
Language
English
Hacker News points
None

Summary

DigitalOcean, a cloud infrastructure company, leverages Prometheus and Alertmanager for whitebox monitoring of its services and container clusters. Observability is crucial, comprising logging, metrics, and tracing. The company uses four golden signals (latency, saturation, traffic, and error) to monitor request-based microservices, while also utilizing the USE method (utilization, saturation, and errors) to monitor Kubernetes clusters. This monitoring setup enables the identification of long-term trends, analysis of performance issues, and setting up visualizations, ultimately improving observability and server reliability.