/plushcap/analysis/datadog/engineering-the-trouble-with-mounting

The Trouble with Mounting

What's this blog post about?

The Datadog agent was experiencing long gaps in metrics on some systems due to the os.statvfs function call taking a very long time during disk checks. This issue occurred when using NFS (Network File Systems) with hard mounts, which can cause programs to hang in an unkillable state by design. To address this problem, the Datadog agent now runs the statvfs call on a separate thread and times out if it's hanging, allowing the main thread to continue. This solution slightly increases memory usage but is necessary for the agent to operate in various environments with different NFS mount settings.

Company
Datadog

Date published
Jan. 23, 2017

Author(s)
Greg Meyer, Yann Mahe

Word count
653

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.