AWS hosted enterprise Data Lake with both batch and realtime data pipelines.
-
Updated
Jul 26, 2020
AWS hosted enterprise Data Lake with both batch and realtime data pipelines.
A simple shell script to delete multiple tables based on table name prefix.
politician stock market activity web scraping project
Data Lakehouse solution for data produced by STEDI Step Trainer sensors and the mobile app so that it can train the machine learning module.
a toolkit that provides an object-oriented interface for working with parquet datasets on AWS
Implementation of ETL data pipeline to load data from S3 to snowflake and refresh tableau datasource in AWS
Data lake project for a US based Insurance Company
Intro to streaming data with Kafka, Spark and AWS Glue
The Practical Data Science Specialization brings together these disciplines using purpose-built ML tools in the AWS cloud. It helps you develop the practical skills to effectively deploy your data science projects and overcome challenges at each step of the ML workflow using Amazon SageMaker. This Specialization is designed for data-focused develop
AWS has Athena service which can query structured data from S3. The DynamoDB is managed NoSQL database. So we have to convert Unstructured data to Structured data. The code written in python & performs this objective.
AWS Athena, Glue Database, Glue Crawler deployment on existing S3 bucket through Serverless (sls) Framework.
This is a sample project to demonstrate how to update DynamoDB with AWS Glue
Neste projeto, usaremos um conjunto de dados de comércio eletrônico para simular os registros de compras do usuário, visualizações de produtos, histórico de carrinho e jornada do usuário na plataforma online para criar dois pipelines analíticos, Lote e Tempo Real.
An end-to-end solution for managing and analyzing YouTube video data from Kaggle, leveraging AWS services and visualized through Quicksight and Tableau
Project that incorporates TerraForm to create AWS infrastructure using S3, Lambda, and DynamoDB tables for ocean and river data 🐢
AWS Athena, Glue Database, Glue Crawler and S3 buckets deployment through CloudFormation stack on AWS console.
Add a description, image, and links to the aws-glue topic page so that developers can more easily learn about it.
To associate your repository with the aws-glue topic, visit your repo's landing page and select "manage topics."