trusted-data-pipeline
trusted-data-pipeline copied to clipboard
Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb
trusted-data-pipeline
This project is used as a demo for my talk: Building 3D Trusted Data Pipelines With Dagster, Dbt, and Duckdb at PyCon Australia 2023
The slides for the workshop is on this link.
Data pipelines and architecture

Dagster global assets

Setup Python environment
git clone [email protected]:danhphan/trusted-data-pipeline.git
cd trusted-data-pipeline
python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
Setup Dagster
cd ./dagster_dbt_jaffle/
dagster dev # or: dagit
Setup Dbt
cd ./dbt_jaffle_shop/
dbt deps
dbt build
Run and test Dbt models
dbt run
dbt test
dbt test --select customers
Generate and view docs
dbt docs generate
dbt docs serve --port 8081
dbt show --inline "select count(*) from {{ ref('stg_customers') }}"