Gretel.ai

Founded in 2019. Privately Held.

External links: homepage | docs | blog | jobs | youtube | twitter | github | linkedin

Synthetic data creation.

Blog posts published by month since the start of

167 total blog posts published.

Switch to word count

Blog content

post title author published words HN
Red Teaming Synthetic Data Models Marjan Emadi Jun. 02, 2022 1252 5
Conditional Text Generation by Fine Tuning Gretel GPT Alex Watson May. 26, 2022 792 3
Diffusion models for document synthesis Andrew Carr May. 19, 2022 735 -
What is Model Soup? Andrew Carr May. 11, 2022 639 -
Transforms and Synthetics on Relational Databases Amy Steier May. 06, 2022 1312 -
ML Models: Understanding the Fundamentals Will Jennings Apr. 28, 2022 3327 -
Transforms and Multi-Table Relational Databases Amy Steier Apr. 07, 2022 1445 2
What is Data Anonymization? John Myers Apr. 01, 2022 2755 -
Simplifying Our APIs Alex Watson Apr. 06, 2022 510 -
How to Generate Synthetic Data: Tools and Techniques to Create Interchangeable Datasets Alex Watson Mar. 24, 2022 4312 3
What is Synthetic Data? Alex Watson Apr. 29, 2022 2800 -
Q&A Series: Solving Privacy Problems with Synthetic Data Lipika Ramaswamy Mar. 11, 2022 922 -
Create a Location Generator GAN Alex Watson Mar. 02, 2022 1204 -
How to use Weights & Biases with Gretel.ai Alex Watson Feb. 17, 2022 1066 -
Data Is More Valuable When It Can Be Shared Alex Watson Feb. 01, 2022 432 10
What We’re Reading: Trends & Takeaways from the NeurIPS 2021 Conference Gretel Research Jan. 26, 2022 622 1
Creating Synthetic Time Series Data for Global Financial Institutions – a POC Deep Dive Alex Watson Jan. 30, 2022 1351 3
Advanced Data Privacy: Gretel Privacy Filters and ML Accuracy Amy Steier Jan. 05, 2022 1373 1
Why Nonprofits Should Care About Synthetic Data Daniel Nissani Dec. 17, 2021 1069 -
Gretel.ai + Illumina - Using AI to create safe, synthetic datasets for genomics Alex Watson Mar. 31, 2022 451 -
Optuna Your Model Hyperparameters Amy Steier Dec. 09, 2021 1294 -
Common misconceptions about differential privacy Lipika Ramaswamy Dec. 08, 2021 1326 -
Veterans Day Reflections: Open source software and evacuation operations, a remarkable combination. John Myers Nov. 28, 2021 1395 -
Got text? Use Named Entity Recognition (NER) to label PII in your data Yamini Kagal Nov. 09, 2021 685 -
Workshop: Generating Synthetic Data for Healthcare & Life Sciences Alex Watson Mar. 21, 2022 2768 -
Why privacy by design matters more than ever Ali Golshan Nov. 02, 2021 582 -
Exploring NLP Part 2: A New Way to Measure the Quality of Synthetic Text Daniel Nissani Oct. 05, 2021 2243 -
Exploring NLP Part 1: Why Should a Privacy Engineering Company Care About NLP? Daniel Nissani Sep. 21, 2021 845 -
Introducing Gretel's Privacy Filters Amy Steier Sep. 01, 2021 936 1
Instrumenting Kubernetes in AWS with Terraform and FluentBit Rob Stark Aug. 30, 2021 2465 -
Build a synthetic data pipeline using Gretel and Apache Airflow Drew Newberry Aug. 24, 2021 1803 1
Gretel releases Beta 2 John Myers Jul. 13, 2021 380 -
What's new in Beta2 Alex Watson Jun. 15, 2021 618 -
Why I Joined Gretel Ali Golshan Jun. 08, 2021 635 -
What is Privacy Engineering? Alex Watson Jun. 02, 2021 657 -
A guide to load (almost) anything into a DataFrame Piotr Mlocek May. 13, 2021 1487 2
Synthetic Data Configuration Templates John Myers May. 03, 2021 608 -
Practical Privacy with Synthetic Data Alex Watson Apr. 27, 2021 1003 -
Introducing the Gretel Bartender Arron Hunt Apr. 05, 2021 757 -
Anonymize Data with S3 Object Lambda John Myers Mar. 30, 2021 1762 -
How accurate is my synthetic data? Amy Steier Mar. 29, 2021 1135 -
Gretel Smart-Seeding is auto-complete for your data John Myers Mar. 02, 2022 524 -
Machine Learning Accuracy Using Synthetic Data Amy Steier Mar. 03, 2021 660 -
CHANGELOG: Beta2 John Myers Mar. 01, 2021 804 -
Creating synthetic time series data Alex Watson Feb. 22, 2021 772 -
Walkthrough: Create Synthetic Data from any DataFrame or CSV Alex Watson Aug. 05, 2021 1200 -
Recognizing Data Privacy Day by Protecting Your Privacy Laszlo Bock Jan. 28, 2021 707 -
Install TensorFlow with CUDA, cDNN, and GPU Support in 4 Easy Steps Alex Watson May. 07, 2021 262 2
Automate Detecting Sensitive Personally Identifiable Information (PII) Alex Watson Nov. 09, 2021 567 1
Automatically Reducing AI Bias With Synthetic Data Amy Steier Jan. 09, 2021 679 1
How To Create Differentially Private Synthetic Data Alex Watson Jan. 09, 2021 1073 1
Gretel.ai Raises $12 Million in Series A to Safely Share, Build with Data Alex Watson Nov. 16, 2020 289 -
Load NER data into Elasticsearch Tyler Bray Nov. 17, 2020 1758 1
November 2020 - What’s new in Gretel Arron Hunt Nov. 10, 2020 494 -
Auto-anonymize production datasets for development Drew Newberry Jan. 09, 2021 822 -
Introducing Gretel Blueprints John Myers Oct. 27, 2020 417 8
Gretel's New Synthetic Performance Report Amy Steier Oct. 07, 2020 1145 2
Create high quality synthetic data in your cloud with Gretel.ai and Python Alex Watson Sep. 18, 2020 616 2
How to use Gretel’s new entity stream Arron Hunt Sep. 08, 2020 399 -
Gretel Synthetics Frequently Asked Questions (FAQs) Alex Watson Jan. 31, 2022 1743 -
NEW: Integrating with Gretel SDKs just got easier! Arron Hunt Sep. 10, 2020 515 -
README.V2 Alex Watson Sep. 01, 2020 423 11
Improving massively imbalanced datasets in machine learning with synthetic data Alex Watson Mar. 26, 2022 1220 2
Reducing AI bias with Synthetic data Alex Watson Jan. 11, 2021 870 -
Gretel Synthetics: Introducing v0.10.0 John Myers Aug. 23, 2020 825 -
Automated Data Exposure Detection with Gretel Outpost John Myers Sep. 01, 2020 734 1
Contact Tracing: Deep Dive & Simulation John Myers Aug. 23, 2020 1469 -
Create artificial data with Gretel Synthetics and Google Colaboratory Alex Watson Sep. 02, 2020 148 -
Fast data cataloging of streaming data for fun and privacy John Myers Sep. 01, 2020 926 -
Using generative, differentially-private models to build privacy-enhancing, synthetic datasets from real data. Alex Watson Sep. 14, 2020 2683 1
Gretel.README Alex Watson Aug. 23, 2020 479 -
Deep dive on generating synthetic data for Healthcare Alex Watson Sep. 01, 2020 952 -
Innovating With FastText and Table Headers Amy Steier Aug. 20, 2020 974 -
How we accidentally discovered personal data in a popular Kaggle dataset John Myers Aug. 24, 2020 923 1
Introducing Gretel Benchmark Nicole Pang Oct. 05, 2022 984 1
Conditional data generation in 4 lines of code Alex Watson Sep. 29, 2022 536 -
Announcing the Synthetic Data Community Discord Mason Egger Sep. 21, 2022 296 -
Generate time-series data with Gretel’s new DGAN model Kendrick Boyd Sep. 15, 2022 1364 -
Community Insights: Overcoming Medical Class Imbalance with Synthetic Data Murtaza Khomusi Sep. 14, 2022 1371 -
Introducing Gretel Amplify Grace King Sep. 07, 2022 848 -
An update to Gretel’s license to support continuous community growth and innovation John Myers Aug. 30, 2022 722 -
Generate synthetic data in 3 lines of code Alex Watson Aug. 24, 2022 495 49
How to safely work with another company's data Andrew Carr Aug. 16, 2022 1127 1
Progress and Innovation - Women in AI Jennifer Yonemitsu Aug. 12, 2022 748 -
The Evolution of Gretel's Developer Stack for Synthetic Data John Myers Jul. 26, 2022 1266 -
Measure the Quality of any Synthetic Dataset with Gretel Evaluate Grace King Jul. 20, 2022 774 1
Evaluating Data Sampling Methods with a Synthetic Quality Score Andrew Carr Jul. 13, 2022 720 -
Data Simulation: Tools, Benefits, and Use Cases Will Jennings Jul. 13, 2022 2409 -
Test Data Generation: Uses, Benefits, and Tips Will Jennings Jun. 30, 2022 2278 -
Create Synthetic Time-series Data with DoppelGANger and PyTorch Kendrick Boyd Jun. 21, 2022 1834 -
Synthetic Data and the Data-centric Machine Learning Life Cycle Alex Watson Oct. 26, 2022 1013 1
Generate synthetic Taylor Swift-like lyrics using Gretel GPT Grace King Nov. 15, 2022 1095 -
Downstream ML classification with Gretel ACTGAN and PyCaret Andrew Carr Dec. 02, 2022 952 -
Synthetic Image Models for Smart Agriculture Andrew Carr Dec. 08, 2022 1246 -
Anonymize tabular data to meet GDPR privacy requirements Alex Watson Jan. 25, 2023 1223 -
Bringing AI-generated images to enterprise use cases Andrew Carr Feb. 08, 2023 799 12
Gretel and Google Cloud partner on synthetic data Ali Golshan Mar. 14, 2023 718 -
Teaching large language models to zip their lips Andrew Carr Mar. 15, 2023 1195 1
Augmenting ML Datasets with Gretel and Vertex AI John Myers Mar. 22, 2023 1781 4
Generate Synthetic Databases with Gretel Relational Grace King Mar. 23, 2023 2083 1
Compare Synthetic and Real Data on ML Models with the new Gretel Synthetic Data Utility Report Nicole Pang Apr. 06, 2023 1352 1
Introducing Gretel Tabular DP: A fast, graph-based synthetic data model with strong differential privacy guarantees Lipika Ramaswamy Apr. 24, 2023 1796 -
Introducing Gretel Tabular DP: A fast, graph-based synthetic data model with strong differential privacy guarantees Lipika Ramaswamy Apr. 24, 2023 1796 -
Scale Synthetic Data to Millions of Rows with ACTGAN Alex Watson May. 03, 2023 844 -
Helping Organizations Build Resilient and Trustworthy Information Technology Bryan Zimmer Jun. 08, 2023 336 -
Unlocking Adapted LLMs on Enterprise Data Alex Watson Jun. 08, 2023 766 2
Gretel Tweets Alex Watson Jun. 15, 2023 1331 -
Gretel is now available in the AWS Marketplace Ali Golshan Jun. 12, 2023 400 -
Gretel is live on Google Cloud Marketplace 🎉 Ali Golshan Jun. 27, 2023 409 -
Measure the utility and quality of GPT-generated text using Gretel’s new text report Marjan Emadi & Nicole Pang Jun. 28, 2023 806 -
Bring Your Own Cloud (BYOC): Transforming & Synthesizing Data with Gretel Hybrid Matt Kornfield Jul. 13, 2023 1428 2
Predicting Patient Stay Durations in the ER with Safe Synthetic Data Alex Watson Jul. 19, 2023 1396 -
Comprehensive Data Cleaning for AI and ML Amy Steier Jul. 24, 2023 2119 -
Gretel GPT Sentiment Swap Johnny Greco Aug. 24, 2023 2610 -
Synthetic Data, Real Privacy: Automating Secure Workflows with Gretel and Amazon SageMaker Maarten Van Segbroeck, Rumi Olsen, Qiong Zhang Aug. 08, 2023 1696 -
Prompting Llama-2 at Scale with Gretel Alex Watson Oct. 03, 2023 647 2
Synthesizing dialogs for better conversational AI Maarten Van Segbroeck Sep. 28, 2023 623 -
Automate Synthetic Data Pipelines with Gretel Workflows Yamini Kagal Sep. 13, 2023 795 -
How to Safely Query Enterprise Data with Langchain Agents + SQL + OpenAI + Gretel Alex Watson Sep. 11, 2023 960 -
We just streamlined Gretel’s Python SDK Johnny Greco Oct. 17, 2023 505 -
Optimize the Llama-2 Model with Gretel’s Text SQS Marjan Emadi Nov. 06, 2023 747 -
AWS + Gretel Synthetic Data Accelerator Program for Generative AI Will Jennings Nov. 07, 2023 564 2
AWS + Gretel Synthetic Data Accelerator Program for Generative AI Will Jennings Nov. 07, 2023 564 -
Gretel Demo Day: Exploring the Future of Synthetic Data Will Jennings Nov. 17, 2023 1445 -
Generate Synthetic Data Using Gretel Hybrid Ben McCown Nov. 21, 2023 1963 -
Training Better LLMs & SLMs with Diverse, High-Quality Synthetic Data Alex Watson Dec. 05, 2023 403 -
Gretel announces partnership with Microsoft Azure and joins Microsoft for Startups Pegasus Program Will Jennings Dec. 06, 2023 515 -
Nail Synthetic Data Generation Every Time with Gretel Tuner Johnny Greco Dec. 13, 2023 417 -
Filling in sparse tables with Gretel’s Tabular LLM Nick Keune Dec. 18, 2023 775 -
Differentially Private Synthetic Text Generation with Gretel: Making Data Available at Scale (Part 1) Alex Watson Jan. 16, 2024 1991 1
Introducing Gretel's Transform v2 Sami Torbey Jan. 31, 2024 858 -
How to Improve RAG Model Performance with Synthetic Data Murtaza Khomusi Feb. 02, 2024 1023 -
How to Generate Best-in-Class Synthetic Time Series Data Maarten Van Segbroeck Feb. 29, 2024 820 -
Gretel awarded ISO 27001 certification Bryan Zimmer Mar. 11, 2024 285 -
RAG Model Evaluation with Azure AI and Gretel Navigator Maarten Van Segbroeck Mar. 18, 2024 485 -
What is Retrieval Augmented Generation? Gretel Team Mar. 26, 2024 3068 -
What is Tabular Data? Gretel Team Mar. 22, 2024 3076 -
Introducing world's largest synthetic open-source Text-to-SQL dataset Yev Meyer Apr. 04, 2024 1602 -
Gretel partners with Google Cloud to develop native synthetic data integration, achieves BigQuery designation Will Jennings Apr. 09, 2024 539 -
Fine-Tuning CodeLlama on Gretel & AWS SageMaker JumpStart Maarten Van Segbroeck, Qiong (Jo) Zhang, Shashi Raina May. 02, 2024 676 -
Generate Differentially Private Synthetic Text with Gretel GPT Lipika Ramaswamy, Andre Manoel May. 24, 2024 2061 3
Detect and redact PII in free text with NER in Transform v2 Sami Torbey May. 30, 2024 332 -
What is Data Acquisition? Gretel Team Jun. 06, 2024 2799 -
What is Synthetic Data Generation? Gretel Team Jun. 06, 2024 2068 -
Gretel announces partnership with Databricks Brooke Gleason Jun. 10, 2024 465 -
Gretel Unlocks PII Detection with Synthetic Financial Document Dataset Alex Watson Jun. 12, 2024 958 1
Gretel Navigator is Now Generally Available Nicole Pang, Murtaza Khomusi Jun. 13, 2024 816 -
Fine-Tuning Gretel Navigator To Generate Highest Quality Domain-Specific Synthetic Data Sami Torbey Jun. 18, 2024 663 -
Generate Question-Truth Pairs from Documents with Gretel Navigator Maarten Van Segbroeck Jun. 17, 2024 428 -
Introducing Gretel MLOps Maarten Van Segbroeck Jun. 20, 2024 1757 -
An Awesome Synthetic Multilingual Prompts Dataset Maarten Van Segbroeck Jul. 03, 2024 652 -
How to Create High Quality Synthetic Data for Fine-Tuning LLMs Alex Watson Jul. 12, 2024 1975 2
The explosion of SLMs and license confusion Yev Meyer, Alex Watson Jul. 25, 2024 2023 1
Gretel’s New Data Privacy Score Amy Steier Aug. 06, 2024 2015 -
Synthesizing Private Patient Data with Gretel: A Step-by-Step Guide Alex Watson Aug. 13, 2024 1160 -
Addressing Concerns of Model Collapse from Synthetic Data in AI Alex Watson Aug. 23, 2024 1688 -
Navigator Fine Tuning is Now Generally Available Johnny Greco, Kristen Ennis Aug. 28, 2024 971 -
Gretel's Workflow Builder Streamlines Multi-Step Synthetic Data Generation for Financial Services Grace King, Murtaza Khomusi Sep. 05, 2024 1383 -
Privacy-First Chatbot Enhancement in Finance with Databricks and Gretel Kirit Thadaka, Manjesh Mogallapalli, Prasad Kona Sep. 04, 2024 1273 -
Gretel's Workflow Builder Streamlines Multi-Step Synthetic Data Generation for Financial Services Grace King, Murtaza Khomusi Sep. 05, 2024 1383 -
Teaching AI to Think: A New Approach with Synthetic Data and Reflection Alex Watson Sep. 12, 2024 1513 -
Gretel's Workflow Builder Streamlines Multi-Step Synthetic Data Generation for Financial Services Grace King, Murtaza Khomusi Sep. 05, 2024 1383 -
Gretel Featured in LinkedIn's Top 50 U.S. Startups of 2024 Will Jennings Sep. 25, 2024 305 -
GSM-Symbolic: Analyzing LLM Limitations in Mathematical Reasoning and Potential Solutions Alex Watson, Yev Meyer, Dane Corneil, Maarten Van Segbroeck Oct. 17, 2024 2022 -
Fine-tuning Models for Healthcare via Differentially-Private Synthetic Text Andre Manoel, Lipika Ramaswamy, Maarten Van Segbroeck, Qiong Zhang (AWS), Shashi Raina (AWS) Oct. 29, 2024 2238 -
GLiNER Models for PII Detection through Fine-Tuning on Gretel-Generated Synthetic Documents Maarten Van Segbroeck Oct. 31, 2024 991 -
Build high-quality datasets for AI using Gretel Navigator Data Designer Kirit Thadaka Nov. 12, 2024 1424 -

By Matt Makai. 2021-2024.