Company
Date Published
April 7, 2022
Author
Amy Steier
Word count
1445
Language
English
Hacker News points
2

Summary

This text provides a detailed guide on how to de-identify a relational database for demo or pre-production testing environments while maintaining the referential integrity of primary and foreign keys. It uses an ecommerce relational database as an example, demonstrating how to gather data directly from the database, define transform policies, train models, generate data, transform primary/foreign key relationships, and load final data back into the database. The guide also explains how to ensure referential integrity of all primary/foreign key relationships during the de-identification process.