Pinned Loading
-
Data-Engineering-on-GCP
Data-Engineering-on-GCP PublicA repo containing auto-triggered Airflow ETL activities for datasets located on GCP storage that flattens and creates analytical views on Big Query
Python 1
-
data_analysis_tool
data_analysis_tool PublicA superb tool to analyze data in any given dataset as structured or semi-structured format that located in cloud storage or any sql rdbms. Analysis results can be gathered in seconds
Jupyter Notebook 1
-
CDC_stream_data_simulation
CDC_stream_data_simulation PublicDataflow task simulation with combination of CDC formation and streaming data read/writes
Jupyter Notebook 1
-
etl-s3-airflow-snowflake-powerbi-marketing-data
etl-s3-airflow-snowflake-powerbi-marketing-data PublicA end to end data analytics work that consumes data from AWS S3, executing ETL process and creating datamart on Snowflake following to that a PowerBI report created by using datamart
Python 1
-
sports_app_data_analysis_tool
sports_app_data_analysis_tool PublicA repo that has a python class structure that facilitates analysis of semi-structured (JSON) football dataset.
Python
-
data_engineering_databricks_pyspark
data_engineering_databricks_pyspark PublicThis Repo contains activities related to ETL, data warehouse creation and advanced analytics
If the problem persists, check the GitHub status page or contact support.