/plushcap/analysis/census/census-how-to-query-iceberg-locally-using-spark-pyiceberg-or-duckdb

How to query Iceberg locally using Spark, PyIceberg, or duckdb

What's this blog post about?

The text discusses how to get started with Apache Iceberg data in the cloud and behind catalogs by running an Iceberg client locally. It covers three main aspects: catalog type, underlying data storage location, and read/write capabilities. Various tools like Spark, PyIceberg, duckdb, and their respective configurations are explained for setting up an Iceberg client. The text also highlights the benefits of using these tools in combination with each other to leverage their strengths effectively.

Company
Census

Date published
Oct. 8, 2024

Author(s)
Sean Lynch

Word count
1646

Language
English

Hacker News points
None found.