This example shows all codes to create a Data Governance Pipeline using Dataplex and Big Query in Google Cloud Platform in a company of Renewable Energy in Brazil - Casa dos Ventos. This repository is based from this post on google here
The dashboard has 2 pages:
- Data Governance of BigQuery
- Dataplex Analysis
The dashboard uses 3 tables (1,2,3) and a view (4) that can be created using the following scripts:
- bigquery_tables_analysis
- bigquery_views_analysis
- dataplex_assets_analysis
- check_bq_datasets_in_dataplex
bigquery_tables_analysis - Table with information about all tables in organization (snapshot of the day).
bigquery_views_analysis - Table with information about all views in organization (snapshot of the day).
dataplex_assets_analysis - Table with information about all assets in organization's dataplex (snapshot of the day).
check_bq_datasets_in_dataplex - View with join between datasets and assets in dataplex to analyse new datasets not mapped in dataplex.
Create a copy of this Dashboard.
After clicking on the Copy button, you will find a message asking you to choose a new data source. Select the data sources created.
Click on create report. Rename the report (dashboard) to a name of your choice.
To learn about Casa dos Ventos, visit our website here