/plushcap/analysis/fivetran/star-schema-vs-obt

Star Schema vs. OBT for Data Warehouse Performance

What's this blog post about?

Michael Kaminsky from Gradient Metrics conducted a benchmark comparison of data architectures, specifically focusing on the performance implications of different warehouse distribution patterns under normal BI-style workloads within Redshift, Snowflake, and BigQuery. The results showed that denormalized tables resulted in faster query response times compared to star schemas for all three warehouses. The speed improvement ranged from 25% to 50%, depending on the warehouse used. This analysis was conducted using a subset of TPC-DS benchmark data and aimed to understand how different data architecture patterns perform once a warehouse has been chosen, rather than comparing warehouses themselves.

Company
Fivetran

Date published
Nov. 23, 2020

Author(s)
Michael Kaminsky

Word count
1348

Language
English

Hacker News points
None found.


By Matt Makai. 2021-2024.