Question 1

What does this generate?

Accepted Answer

From a file's inferred schema it produces a dbt sources.yml (the source definition), a staging model (models/staging/stg_<table>.sql that selects from that source), and a schema.yml models block documenting each column with its data type — plus the underlying CREATE TABLE for reference.

Question 2

What's the difference between sources.yml and schema.yml?

Accepted Answer

sources.yml declares a raw source table so models can reference it with source('schema', 'table'). schema.yml (a models block) documents a model's columns and is where you add tests and descriptions. This tool emits both, wired together: the staging model reads from the source declared in sources.yml.

Question 3

Does it work with dbt Core and dbt Cloud?

Accepted Answer

Yes — the output is plain dbt YAML and SQL that drop straight into a models/ directory in any dbt project (Core or Cloud). Adjust the source database/schema placeholders in sources.yml to match your warehouse.

Question 4

How are the column types determined?

Accepted Answer

The schema is inferred in your browser with DuckDB-WASM by reading the file. Column names and their DuckDB types are written into the YAML data_type fields — a strong starting point you can refine per warehouse.

Question 5

Is my file uploaded?

Accepted Answer

No. Everything runs locally via DuckDB-WASM. Only the column names and types are used to build the YAML and SQL — your data never leaves your device.

dbt sources.yml & staging model generator

Frequently asked questions

Related tools