Hackolade plugin for AWS Glue Data Catalog
-
Updated
Jun 14, 2024 - JavaScript
Hackolade plugin for AWS Glue Data Catalog
PySpark script to aggregate small parquet files in a prefix into larger files. Designed to be run on AWS Glue
Process DynamoDB change streams via. AWS Glue w Iceberg to keep a copy of a collection in S3 upto date
Host a Docker container for the Spark history server / Spark UI of AWS Glue jobs
📊🌈🏛 This project develop a data warehouse for a bank using Amazon Redshift, VPC, Glue, S3 and DBT, following a ⭐ Star Schema architecture. The goal is to storage, manage, and optimize data to support decision making and reporting 🏵️
Datasets collection and preprocessings framework for NLP extreme multitask learning
A GLUE project for comparative genomic analysis of circular Rep-encoding single-stranded DNA (CRESS DNA) viruses
An AWS Glue Studio Connector for Accessing Exasol Database
The aim of this project is to combine secure data management with insightful analysis of YouTube video categories and trends using an ETL pipeline.
Apache Hudi examples designed to be run on AWS Glue via. Glue Jobs
KANs for text classification on GLUE tasks
A coding-free framework built on PyTorch for reproducible deep learning studies. 🏆25 knowledge distillation methods presented at CVPR, ICLR, ECCV, NeurIPS, ICCV, etc are implemented so far. 🎁 Trained models, training logs and configurations are available for ensuring the reproducibiliy and benchmark.
A GLUE project for comparative genomic analysis of flaviviruses
Add a description, image, and links to the glue topic page so that developers can more easily learn about it.
To associate your repository with the glue topic, visit your repo's landing page and select "manage topics."