You can write tests for your DBT models by following DBT’s documentation.
When a pipeline runs, each DBT model block will run and then each test associated with that DBT model will be ran.
For example, if you add a DBT block to your pipeline for a DBT model named my_third_dbt_model
and a test file named test_my_second_dbt_model.sql
with the following folder structure:
demo_project/
| dbt/
| | demo/
| | | models/
| | | | example/
| | | | | my_third_dbt_model.sql
| | | tests/
| | | | example/
| | | | | my_third_dbt_model.sql
Then whenever your pipeline runs, the test in test_my_second_dbt_model.sql
will also be ran after
the DBT run for the model my_third_dbt_model
is complete.
Any errors in the DBT test will also fail the pipeline run. The output of the test will be logged and viewable when inspecting the pipeline’s run and logs.
In your tests folder, you must mirror the path of the model you want to test.
For example, if your model is named my_third_dbt_model
and if its located under the folder
named example
, then your test file must be located here:
models/example/my_third_dbt_model.sql
tests/example/my_third_dbt_model.sql
Note that the model file and test file have the same name (e.g. my_third_dbt_model.sql
)
and are nested under a folder with
identical names (e.g. example
).
The content of the DBT model named my_third_dbt_model
is:
SELECT 1 AS n_constant_number
The content of the test for my_third_dbt_model.sql
that will cause a test failure is:
SELECT n_constant_number
FROM {{ ref('my_third_dbt_model') }}
GROUP BY n_constant_number
HAVING (n_constant_number = 1)
Here is the sample output of a failed test for the above files:
23:57:21 Running with dbt=1.3.0
23:57:21 Found 3 models, 4 tests, 0 snapshots, 0 analyses, 289 macros, 0 operations, 0 seed files, 6 sources, 0 exposures, 0 metrics
23:57:21
23:57:23 Concurrency: 1 threads (target='dev')
23:57:23
23:57:23 1 of 1 START test my_third_dbt_model ........................................... [RUN]
23:57:23 1 of 1 FAIL 1 my_third_dbt_model ............................................... [FAIL 1 in 0.76s]
23:57:24
23:57:24 Finished running 1 test in 0 hours 0 minutes and 3.08 seconds (3.08s).
23:57:24
23:57:24 Completed with 1 error and 0 warnings:
23:57:24
23:57:24 Failure in test my_third_dbt_model (tests/example/my_third_dbt_model.sql)
23:57:24 Got 1 result, configured to fail if != 0
23:57:24
23:57:24 compiled Code at target/compiled/demo/tests/example/my_third_dbt_model.sql
23:57:24
23:57:24 Done. PASS=0 WARN=0 ERROR=1 SKIP=0 TOTAL=1
Traceback (most recent call last):
File "/home/src/mage_ai/cli/main.py", line 146, in <module>
main()
File "/home/src/mage_ai/cli/main.py", line 116, in main
ExecutorFactory.get_pipeline_executor(pipeline).execute(
File "/home/src/mage_ai/data_preparation/executors/pipeline_executor.py", line 28, in execute
asyncio.run(self.pipeline.execute(
File "/usr/local/lib/python3.10/asyncio/runners.py", line 44, in run
return loop.run_until_complete(main)
File "/usr/local/lib/python3.10/asyncio/base_events.py", line 649, in run_until_complete
return future.result()
File "/home/src/mage_ai/data_preparation/models/pipeline.py", line 233, in execute
await execution_task
File "/home/src/mage_ai/data_preparation/models/block/__init__.py", line 147, in run_blocks
await asyncio.gather(*remaining_tasks)
File "/home/src/mage_ai/data_preparation/models/block/__init__.py", line 94, in execute_and_run_tests
run_dbt_tests(
File "/home/src/mage_ai/data_preparation/models/block/dbt/utils/__init__.py", line 466, in run_dbt_tests
raise Exception('DBT test failed.')
Exception: DBT test failed.
The content of the test for my_third_dbt_model.sql
that will yield a passing test is:
SELECT n_constant_number
FROM {{ ref('my_third_dbt_model') }}
GROUP BY n_constant_number
HAVING (n_constant_number = 0)
Here is the sample output of a successful test for the above files:
23:59:56 Running with dbt=1.3.0
23:59:56 Found 3 models, 4 tests, 0 snapshots, 0 analyses, 289 macros, 0 operations, 0 seed files, 6 sources, 0 exposures, 0 metrics
23:59:56
23:59:57 Concurrency: 1 threads (target='dev')
23:59:57
23:59:57 1 of 1 START test my_third_dbt_model ........................................... [RUN]
23:59:58 1 of 1 PASS my_third_dbt_model ................................................. [PASS in 0.72s]
23:59:59
23:59:59 Finished running 1 test in 0 hours 0 minutes and 3.00 seconds (3.00s).
23:59:59
23:59:59 Completed successfully
23:59:59
23:59:59 Done. PASS=1 WARN=0 ERROR=0 SKIP=0 TOTAL=1