Fabric – Ramblings of a Crafty DBA

Protected: Thread 06 – Data Engineering with Fabric

March 3, 2024March 4, 2024 admin

There is no excerpt because this is a protected post.

Protected: Thread 05 – Data Engineering with Fabric

February 28, 2024March 2, 2024 admin

There is no excerpt because this is a protected post.

Protected: Thread 04 – Data Engineering with Fabric

February 23, 2024March 2, 2024 admin

There is no excerpt because this is a protected post.

Thread 03 – Data Engineering with Fabric

February 21, 2024May 19, 2024 admin

Full versus Incremental Loads The loading of data from a source system to target system has been well documented over the years. My first introduction to an Extract, Transform and Load program was DTS for SQL Server 7.0 in 1998. In a data lake, we have a bronze quality zone that supposed to represent the raw data in a delta file format. This might include versions of the files for auditing. In the silver quality zone, we have a single version of truth. The data is de-duplicated and cleaned up.…

Thread 02 – Data Engineering with Fabric

February 19, 2024February 20, 2024 admin

Managing Files and Folders What is a data lake? It is just a bunch of files organized by folders. Keeping these files organized prevents your data lake from becoming a data swamp. Today, we are going to learn about a python library that can help you. Business Problem Our manager has given us weather data to load into Microsoft Fabric. We need to create folders in the landing zone to organize these files by both full and incremental loads. How can we accomplish this task? Technical Solution This use case…

Thread 01 – Data Engineering with Fabric

February 17, 2024February 20, 2024 admin

Managed Vs Unmanaged Tables Microsoft Fabric was release to the general availability on November 15th, 2024. I will be writing a quick post periodically in 2024 to get you up to speed on how to manipulate data in the lake house using spark. I really like the speed of the starter pools in Microsoft Fabric. A one to ten node pool will be available for consumption in less than 10 seconds. Read all about this new compute from this learn page. Business Problem Our manager has given us weather data…