Skip to content
View sauravvenkat's full-sized avatar
Block or Report

Block or report sauravvenkat

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. SparkDataLakes SparkDataLakes Public

    This is an ETL pipeline taking data from S3 data lake, transformed using Spark, and finally uploaded back to S3 into partitioned parquet file format.

    Python 1

  2. CloudDataWarehouse CloudDataWarehouse Public

    This is an ETL pipeline taking source data from an open source music database stored in Amazon S3, transforming the data, and then finally uploading the data into Amazon Redshift.

    Python

  3. Instacart-Market-Basket Instacart-Market-Basket Public

    This is an Exploratory Analysis of the Instacart Market Basket Dataset on Kaggle: https://www.kaggle.com/c/instacart-market-basket-analysis

    Jupyter Notebook

  4. Capital_Bike_Share Capital_Bike_Share Public

    This is an Exploratory Data Analysis of the publicly available Capital Bike Share Dataset

    Jupyter Notebook

  5. DataVisualization DataVisualization Public

    These are data visualizations I've created using Python and D3.js

    HTML