Mastering Data Build Tool (DBT) for Snowflake
- Created By ebrahim khaja
- Posted on September 7th, 2023
- Overview
- Prerequisites
- Audience
- Audience
- Curriculum
Description:
In the data-driven landscape, the ability to efficiently manage, transform, and materialize data is crucial. In this comprehensive hands-on course, students will dive deep into the world of DBT and learn how to leverage its power to build robust data transformation pipelines. From foundational concepts to advanced techniques, students will gain hands-on experience working with key components: models, materialization, seeds, snapshots, source freshness, macro, and hooks. By the end of this course, students will be equipped with skills to build efficient data pipelines using modern data build tools.
Duration: 2 days
Course Code: BDT301
Learning Objectives:
After this course, you will be able to:
- Introduction Data Build Tool (DBT)
- Understanding DBT models
- Using DBT tests to ensure quality of DBT models
- Explore DBT materializations to optimize performance & scalability of DBT models
- Integrating seeds and sources in DBT project during data ingestion
- Enhance data loading capabilities and create custom macros
- Using DBT snapshots learn how to capture historical versions of data for auditing and analysis process
- Dive into DBT Hooks to integrate external scripts and actions into your DBT workflows
- Basic understanding of Snowflake and SQL
- This course is designed for Analytics Engineers, Data Analysts, BI Professionals, Data Scientists, Data Engineers, DevOps Engineers, and Architects
- This course is designed for Analytics Engineers, Data Analysts, BI Professionals, Data Scientists, Data Engineers, DevOps Engineers, and Architects
1. Introduction to Data Build Tool (DBT)
- Introduction to Data Warehouse (Snowflake)
- ETL v/s ELT
- DBT introduction
- DBT installation
- DBT cloud introduction
- Lab: Getting started with DBT
2. Understanding DBT Models
- What are DBT models?
- Creating DBT table
- Using DBT schema
- DBT project organization
- Lab: Project organization
3. Using DBT Tests
- What is DBT schema?
- What is a DBT Macro?
- Understanding DBT test types
- Lab: Generic and Singular Tests
4. Exploring DBT Materialization
- What are materializations in DBT?
- Default materialization in DBT
- Using Config Block for materialization
- Lab: Setting materialization
5. Integrating DBT Seeds and Sources
- Seeds and Sources overview
- Adding sources in DBT
- What is source freshness?
- Labs: Adding source freshness check in DBT
6. DBT Custom Macros
- Implementing Table, View and Ephemeral Model
- Create custom macro
- Understanding DBT package
- Labs: Building incremental load
7. Working with DBT Snapshots
- Snapshots overview
- Creating a snapshot
- Labs: create snapshot
8. DBT Hooks
- What are DBT Hooks?
- Understanding pre-hook, post-hook, on-run-start, on-run-end hooks
- Implementing DBT Hook
- Labs: implementing DBT hook
Training material provided: Yes (Digital format)
Hands-on Lab: Instructions will be provided to set up a free tier snowflake account. Also, students will be provided on how to install DBT tool on Windows/Mac