What You Need to Know Before
You Start

Starts 5 June 2025 19:35

Ends 5 June 2025

00 days
00 hours
00 minutes
00 seconds
course image

dbt on Databricks

Building Scalable, Modular, Testable, and Version-Controlled Data Transformation Pipelines with dbt on Databricks
via Udemy

4052 Courses


7 hours 47 minutes

Optional upgrade avallable

Not Specified

Progress at your own speed

Paid Course

Optional upgrade avallable

Overview

Are you ready to unlock the full potential of your data analytics pipelines? dbt on Databricks is a comprehensive course tailored for data professionals aiming to master data transformation using dbt (data build tool) on the Databricks platform, harnessing the power of Apache Spark for scalable and efficient workflows.

Syllabus

  • Introduction to dbt and Databricks
  • Overview of dbt and its role in data transformation
    Introduction to Databricks and Apache Spark
  • Setting Up Your Environment
  • Installing dbt on Databricks
    Configuring dbt profiles for Databricks
  • Fundamentals of dbt
  • Understanding the dbt workflow
    Writing basic dbt models
    Utilizing macros and variables
  • Advanced dbt Techniques
  • Implementing tests and documentation
    Strategies for model optimization
    Using hooks and operations
  • Leveraging Apache Spark with dbt
  • Overview of Apache Spark architecture
    Integrating Spark SQL with dbt models
    Managing large datasets with Spark
  • Implementing dbt in Databricks
  • Running dbt jobs in Databricks notebooks
    Scheduling and orchestrating dbt runs in Databricks
  • Data Quality and Testing
  • Best practices for data testing in dbt
    Automating tests on Databricks
  • Debugging and Optimization
  • Identifying and resolving performance bottlenecks
    Profiling and optimizing queries with dbt and Spark
  • Use Cases and Real-world Applications
  • Case studies of dbt on Databricks implementations
    Success stories from industry
  • Course Project
  • Designing and implementing a data transformation pipeline using dbt on Databricks
    Presentation and peer review of projects
  • Conclusion and Next Steps
  • Recap of key concepts
    Resources for further learning and development

Taught by

Malvik Vaghadia


Subjects

Business