This repository contains a demo of dbt Python models on Databricks for the Coalesce 2022 conference. It will not be actively maintained. See the repository it was forked from for a current version -- we will work to merge the Python models into the main
branch there after Coalesce.
What a cool DAG! Python and SQL side-by-side in dbt!
Python models in dbt Cloud!
You should be setup as part of the workshop. You are expected to develop solutions to the challenges in a Databricks notebook, then transfer the code over to the dbt Cloud IDE for deployment in the dbt DAG.
See workshop.ipynb, intended for use in Databricks notebooks, for more details on the challenges.
The dbt models in models/challenges/
in the main
branch on this repo are intended to fail. Get them working!
Use the describe_py.py file.
Use the pivot_py.py file.
Use the cluster_py.py file.
Use the forecast_score_py.py file to train and persist the Prophet models, then forecast_score_py.py to load them back in and get the predictions.
We'd welcome contributions to this demo project. However, we will likely archive this repository sometime after Coalesce 2022. Consider contributing to the repository this one is forked from instead!