Skip to content

Glue Data Quality Example - Deploy to your AWS Account w/ Terraform to test

Notifications You must be signed in to change notification settings

jaredfiacco2/AWS_Glue_DQ

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

1. Update terraform.tfvars file

region          = "us-east-1"
tf_state_bucket = "<my-bucket-for-tfstate>"
glue_asset_bucket = "aws-glue-assets-<accountNumber>-us-east-1"
data_bucket   = "<my-bucket-for-taxi-data>"
glue_role_arn = "arn:aws:iam::<accountNumber>:role/GlueRole"

2. Add GitHub Environment Variable for "TF_API_TOKEN"

Follow these instructions to set things correctly.

3. Update GitHub Actions 'TF_CLOUD_ORGANIZATION' to your repo name

Ex. jaredfiacco2 if your username is jaredfiacco2

4. Run Glue Crawler

5. Run Glue Job

6. Check Data Quality

Releases

No releases published

Packages

No packages published