Extract CSV data, transform with DuckDB SQL, and display results in a Flyte report.
extract— Loads the Titanic CSV from a public URL using DuckDB'sread_csv_autotransform— Aggregates survival statistics by passenger class using SQLpipeline— Orchestrates extract -> transform, renders results as an HTML table in a Flyte report
cd tutorials/starter-examples/duckdb-etl
uv venv .venv --python 3.11
source .venv/bin/activate
uv pip install -r requirements.txtTo run remotely, configure your Flyte cluster endpoint:
flyte create config \
--endpoint <your-endpoint> \
--auth-type headless \
--builder remote \
--domain development \
--project flytesnacksDon't have a cluster? Request access at flyte.org.
Remote:
uv run flyte run duckdb_etl.py pipelineLocal:
uv run flyte run --local duckdb_etl.py pipeline- Fully self-contained — no external services or accounts needed
- DuckDB can query pandas DataFrames directly with SQL