Bring Your Own Cloud (BYOC): Transforming & Synthesizing Data with Gretel Hybrid
The text describes a process of anonymizing 7 terabytes of data using Gretel Transform in a hybrid cloud environment. The main challenge was to ensure that the data and models remain within the customer's cloud throughout the process, as some customers work in heavily regulated industries with stringent data residency requirements. To address this, Gretel developed a Kubernetes deployment strategy for its hybrid workflows. This approach allowed them to scale their hybrid workflows based on the number of nodes they could run in a customer's environment and keep all data within the customer's cloud during the process. The text also provides details about preparing the data, running training and transform jobs, and loading the transformed data into a lower environment for testing purposes.
Company
Gretel.ai
Date published
July 13, 2023
Author(s)
Matt Kornfield
Word count
1428
Language
English
Hacker News points
2