/plushcap/analysis/whylabs/whylabs-posts-whylogs-embrace-data-logging

whylogs: Embrace Data Logging Across Your ML Systems

What's this blog post about?

whylogs is an open-source data logging library that enables data logging for any ML/AI pipeline in a few lines of code. It provides essential insights into the health and performance of the pipeline, helping developers and data scientists to be aware of changes in the quality and consistency of their data. The library supports a wide range of ML data types, including complex inputs like images and video, and is designed for both interactive analysis and production settings. whylogs is massively scalable due to its static memory footprint and lightweight output, which minimizes storage costs. It also prevents overcollection of data to reduce costs, security vulnerabilities, and operational complexity. The library currently supports columnar data in Python and Java, with plans to add support for time series, text, and image data types, as well as more languages.

Company
WhyLabs

Date published
Sept. 23, 2020

Author(s)
Andy Dang,, Bernease Herman

Word count
1904

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.