The Trouble with Mounting
The Datadog agent was experiencing long gaps in metrics on some systems due to the os.statvfs function call taking a very long time during disk checks. This issue occurred when using NFS (Network File Systems) with hard mounts, which can cause programs to hang in an unkillable state by design. To address this problem, the Datadog agent now runs the statvfs call on a separate thread and times out if it's hanging, allowing the main thread to continue. This solution slightly increases memory usage but is necessary for the agent to operate in various environments with different NFS mount settings.
Company
Datadog
Date published
Jan. 23, 2017
Author(s)
Greg Meyer, Yann Mahe
Word count
653
Language
English
Hacker News points
None found.