You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: _posts/2024-12-08-dbt-expectations.md
+9-7Lines changed: 9 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,17 +3,13 @@ layout: post
3
3
title: Using dbt expectations as part of a dbt build.
4
4
---
5
5
6
-
<i> The objective of the blog post is to give a practical overview of the data transformation testing tool Great Expectations/dbt expectations. </i>
6
+
<i> The objective of the blog post is to give a practical overview of the data transformation testing tool Great Expectations (specifically the open source version dbt expectations. </i>
7
7
8
8
### Why data testing?
9
9
10
-
Having been involved in data transformations in the past (e.g. moving data from on prem to the Azure cloud) I'm aware of the potential complexity of ensuring the quality of data from source to target, verifying the transformations at each stage and maintaining data integrity.
10
+
Having been involved in data transformations in the past (e.g. moving data from on prem to the Azure cloud) I'm aware of the potential complexity of ensuring the quality of data from multiple sources to target, verifying the transformations at each stage and maintaining data integrity.
11
11
12
-
Given
13
-
14
-
### Great Expectations
15
-
16
-
[Great Expectations.io](https://greatexpectations.io/) and its open source version [dbt expectations](https://github.com/calogica/dbt-expectations) are frameworks that enable automated tests to be embedded in ingestion/transformation pipelines.
12
+
In the context of these data testing challenges, [Great Expectations.io](https://greatexpectations.io/) and its open source version [dbt expectations](https://github.com/calogica/dbt-expectations) are frameworks that enable automated tests to be embedded in ingestion/transformation pipelines.
17
13
18
14
<GEImage>
19
15

@@ -72,6 +68,12 @@ In a specific example, the failing sql code is run directly against the table (i
72
68
73
69
### Lineage Graph (Data Flow DAG)
74
70
71
+
In the sections above we've looked at practical tests in dbt expectations which can be embedded in the data transformation pipeline, they can also be featured in the 'lineage graph' alongside the source tables, dimension, fact tables etc. to show where and when the tests run, what table it relates to etc.
72
+
73
+
Provided the test in question is included in the schema.yml and has a description value, we can see it included on the lineage graph generated by dbt:
74
+
75
+

76
+
75
77
Source data in green -> dependencies
76
78
77
79
Select what types of elements to include in the graph, refresh to only show selection
0 commit comments