glue-job

Star

Here are 17 public repositories matching this topic...

awslabs / athena-glue-service-logs

Star

Glue scripts for converting AWS Service Logs for use in Athena

athena elb-logs alb-logs cloudtrail-logs cloudfront-logs vpc-flow-logs aws-glue s3-log-parser glue-scripts glue-job

Updated Feb 1, 2024
Python

vincentclaes / datajob

Star

Build and deploy a serverless data pipeline on AWS with no effort.

aws machine-learning serverless pipeline glue data-pipeline stepfunctions sagemaker aws-cdk glue-job

Updated Feb 8, 2023
Python

awslabs / sensitive-data-protection-on-aws

Star

The Sensitive Data Protection on AWS solution allows enterprise customers to create data catalogs, discover, protect, and visualize sensitive data across multiple AWS accounts. The solution eliminates the need for manual tagging to track sensitive data such as Personal Identifiable Information (PII) and classified information.

aws security-audit serverless analytics s3 rds gdpr sensitive-data glue-job pii-detection

Updated May 2, 2024
TypeScript

miztiik / s3-to-rds-with-glue

Star

Extract, transform, and load data for analytic processing using AWS Glue

spark etl glue cdk glue-job cloud-development-kit glue-catalog miztiik-automation s3-to-rds

Updated May 2, 2021
Python

GabrielDan92 / AWS_Terraform_PySpark-ETL_Job

Star

Terraform configuration that creates several AWS services, uploads data in S3 and starts the Glue Crawler and Glue Job.

aws terraform s3-bucket pyspark glue-job glue-catalog aws-glue-crawler

Updated Feb 10, 2022
Python

camposvinicius / aws-snowflake-etl

Star

This is a data pipeline built with the purpose of serving a business team.

aws lambda cloudformation spark aws-lambda serverless athena glue snowflake pyspark serverless-framework cicd aws-cloudformation aws-cdk glue-job glueworkflow externaltable

Updated Feb 28, 2023
Python

komminarlabs / terraform-aws-glue-job

Star

Terraform module to create and manage a AWS Glue job

aws terraform-module glue-job

Updated Apr 18, 2024
HCL

g-lorena / aws_streaming_pipeline

Star

AWS streaming pipeline for real-time analysis

python lambda ec2 aws-s3 kinesis sns firehose glue-job

Updated Mar 11, 2024
Python

phaniteja5789 / ETLJob_StreamingJob_Workflow

Star

python lambda s3 iam sns kinesis-stream cloudshell stepfunctions glue-job

Updated Oct 28, 2023
Python

vitalibo / terraform-aws-glue-job

Star

Terraform module which creates Glue Job resources on AWS.

aws terraform terraform-module glue-job

Updated Jun 4, 2022
HCL

edrrezend / ETL_Streaming_DataLake

Star

ETL using application streaming and creating a Data Lake

python crawler athena etl s3 kinesis kinesis-firehose kinesis-stream datalake dataengineering glue-job glue-catalog

Updated Apr 7, 2023
Jupyter Notebook

rishita27 / ETL-Operations-using-AWS-Glue-and-Redshift

Star

Used AWS Glue to perform ETL operations and load resultant data to AWS Redshift. In the second phase used AWS CloudWatch rules and LAMBDA to automatically run GLUE Jobs

aws aws-lambda etl redshift glue-job

Updated Mar 3, 2022

ritikdhame / Automated-Telecom-Customer-Churn-Analysis

Star

AWS Glue & Airflow automate weekly churn analysis pipeline (S3, Redshift) for telco (1M+ customers) with actionable Tableau dashboard (user segmentation, churn reasons, geographical distribution).

aws airflow athena s3 redshift tableau quicksight glue-job

Updated Apr 1, 2024
Python

GustavoGuarany / projeto-engenharia-dados-tv-jornalismo

Star

O projeto foi elaborado com o objetivo de estabelecer uma arquitetura na AWS, originada a partir de uma migração de um banco de dados existente em um ambiente local (on-premise).

python docker aws crawler sql sql-server spark athena terraform s3 glue jupyter-notebook s3-bucket dms powerbi glue-job database-migration-service rds-postgres

Updated Aug 17, 2023
Python

JayyShah / data-pipelines-terraform

Star

This Project aims to automate the process of infrastructure creation.

terraform glue datapipeline dockercompose awslambda airflow-dags glue-job

Updated Sep 25, 2023
HCL

NSVpriya / Youtube_Data_ETL_Project

Star

This project aims to analyze the popularity of YouTube content across different regions by leveraging datasets sourced from Kaggle. It employs a systematic approach to data preprocessing, cleaning, and analysis using various AWS (Amazon Web Services) services including S3, Lambda, Glue, and others, to build an automated ETL pipeline.

python shell aws sql aws-lambda powerbi glue-job dataanlytics

Updated Apr 26, 2024
Python

jaredfiacco2 / AWS_Glue_DQ

Star

Glue Data Quality Example - Deploy to your AWS Account w/ Terraform to test

aws terraform glue parquet-files github-actions glue-job terraform-cloud aws-glue-data-quality

Updated Feb 21, 2024
HCL

Improve this page

Add a description, image, and links to the glue-job topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the glue-job topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

glue-job

Here are 17 public repositories matching this topic...

awslabs / athena-glue-service-logs

vincentclaes / datajob

awslabs / sensitive-data-protection-on-aws

miztiik / s3-to-rds-with-glue

GabrielDan92 / AWS_Terraform_PySpark-ETL_Job

camposvinicius / aws-snowflake-etl

komminarlabs / terraform-aws-glue-job

g-lorena / aws_streaming_pipeline

phaniteja5789 / ETLJob_StreamingJob_Workflow

vitalibo / terraform-aws-glue-job

edrrezend / ETL_Streaming_DataLake

rishita27 / ETL-Operations-using-AWS-Glue-and-Redshift

ritikdhame / Automated-Telecom-Customer-Churn-Analysis

GustavoGuarany / projeto-engenharia-dados-tv-jornalismo

JayyShah / data-pipelines-terraform

NSVpriya / Youtube_Data_ETL_Project

jaredfiacco2 / AWS_Glue_DQ

Improve this page

Add this topic to your repo