Skip to content

komminarlabs/terraform-aws-glue-job

Repository files navigation

terraform-aws-glue-job

Terraform module to create and manage a AWS Glue job

Requirements

Name Version
terraform >= 0.13.0
aws >= 3.10

Providers

Name Version
aws >= 3.10

Inputs

Name Description Type Default Required
name The name of the Glue job string n/a yes
script_location The S3 path to the script that is executed by the job string n/a yes
tags A mapping of tags to assign to all resources map(string) n/a yes
command_name The name of the job command. Defaults to glueetl string "glueetl" no
connections A list with connections for this job list(string) [] no
default_arguments A map with default arguments for the job map(string) {} no
glue_version The Glue version to use string "2.0" no
max_capacity The maximum number of data processing units that can be allocated number null no
max_retries The maximum number of times to retry the failing job number 0 no
number_of_workers The number of workers that are allocated when the job runs string null no
python_version The Python version (2 or 3) being used to execute a Python shell job string "3" no
role_arn An optional Glue execution role string null no
role_policy A valid Glue IAM policy JSON document string null no
schedule A cron expression used to specify the schedule for the glue trigger string null no
schedule_active Whether the glue trigger should be active bool true no
trigger_type The type ('ON_DEMAND' or 'SCHEDULED') of the trigger string null no
worker_type The type ('Standard' or 'G.1X' or 'G.2X') of predefined worker that is allocated when the job runs string null no

Outputs

Name Description
arn The Glue job ARN
id The Glue job name
trigger_arn The Glue trigger ARN
trigger_id The Glue trigger name

Requirements

Name Version
terraform >= 1.3
aws >= 4.62.0

Providers

Name Version
aws >= 4.62.0

Modules

No modules.

Resources

Name Type
aws_cloudwatch_log_group.default resource
aws_glue_job.default resource
aws_glue_trigger.default resource
aws_iam_role.default resource
aws_iam_role_policy.default resource
aws_iam_role_policy_attachment.default resource
aws_iam_policy_document.default data source

Inputs

Name Description Type Default Required
name The name of the Glue job string n/a yes
script_location The S3 path to the script that is executed by the job string n/a yes
tags A mapping of tags to assign to all resources map(string) n/a yes
command_name The name of the job command. Defaults to glueetl string "glueetl" no
connections A list with connections for this job list(string) [] no
default_arguments A map with default arguments for the job map(string) {} no
glue_version The Glue version to use string "4.0" no
kms_key_id The kms key id of the AWS KMS Customer Managed Key to be used to encrypt the log data string null no
log_retention_days The cloudwatch log group retention in days number 365 no
max_capacity The maximum number of data processing units that can be allocated number null no
max_retries The maximum number of times to retry the failing job number 0 no
number_of_workers The number of workers that are allocated when the job runs string null no
python_version The Python version (2 or 3) being used to execute a Python shell job string "3" no
role_arn An optional Glue execution role string null no
role_policy A valid Glue IAM policy JSON document string null no
schedule A cron expression used to specify the schedule for the glue trigger string null no
schedule_active Whether the glue trigger should be active bool true no
security_configuration The name of the Security Configuration to be associated with the job string null no
trigger_type The type ('ON_DEMAND' or 'SCHEDULED') of the trigger string null no
worker_type The type ('Standard' or 'G.1X' or 'G.2X') of predefined worker that is allocated when the job runs string null no

Outputs

Name Description
arn The Glue job ARN
id The Glue job name
trigger_arn The Glue trigger ARN
trigger_id The Glue trigger name