Company
Date Published
Author
Jason Myers
Word count
595
Language
English
Hacker News points
None

Summary

Red Hat, a global leader in open source enterprise IT solutions, faced the challenge of managing its massive enterprise IT infrastructure, which involves monitoring over 40,000 employees across forty different countries. The company's internal network monitoring team aimed to build a single source of truth for network performance and observability by collecting data from thousands of devices and interfaces worldwide. To achieve this, they adopted InfluxDB as their critical piece in the network monitoring architecture, utilizing Telegraf and gNMI plugins to collect data directly from network devices whenever possible. The system enriches data, detects issues, sends alerts, and stores analyzed data in InfluxDB for further analysis and visualization. This solution relies on Ansible for automation and requires relatively little manual intervention, allowing support engineers to focus on critical issues rather than managing individual devices.