Delta Lakes have been around for several years now and there are some fundamental changes that need to be made from a traditional SQL Data Warehouse.
We will review major point of the two concepts used in Lakehouses: Parquet Files and Delta.
For Parquet files, we will review how the files are structured, compressed, and queried. The comparison between them to the 8K pages of SQL Server have many implications for how we structure data within analytical queries.
For Delta, we will dive into how it enables ACID, time travel, upserts, streaming, and more!
With the addition of direct lake mode in Microsoft Fabric, understanding these concepts are becoming increasingly important for any data team.