article thumbnail

Tabular: Turning Your Data Swamp into a Data Lakehouse with Apache Iceberg

Clouded Judgement

If you want features in your lakehouse (on top of open source Iceberg) for ingestion, CDC, streaming (file loading, Kafka connect, etc), schema evolution, compaction, optimization, time travel, snapshots, auto-scaling, maintenance (no more writing spark jobs to delete snapshots!), A natural question is “why do we have two tiers?

Data 130