Gretel Synthetics: Introducing v0.10.0
Gretel Synthetics has released new features for its latest version, making it easier to create synthetic data with a batch interface that works directly with Pandas DataFrames. The Batch interface automates manual steps and supports high dimensionality datasets by clustering like columns and training models on subsets of the entire dataset at once. It maintains correlations and statistical relationships between columns, allowing for scaling to highly dimensional datasets with minimal loss in accuracy. Users can create a synthetic dataset interactively using Google Colaboratory's batch training notebook. The Batch module allows validators to be set for each batch, ensuring that the output DataFrame has the same shape as the input DataFrame. Gretel Synthetics plans to release its custom validation package soon, which will automatically learn constraints in data and enforce them during generation.
Company
Gretel.ai
Date published
Aug. 23, 2020
Author(s)
John Myers
Word count
825
Language
English
Hacker News points
None found.