Popular repositories Loading
-
Data-Capture-and-Analysis-of-Cab-Rides
Data-Capture-and-Analysis-of-Cab-Rides PublicDesigned and implemented a big data analytics solution for a mobility startup on AWS using Hadoop, Spark, Hive, and Kafka. Developed streaming and batch data pipelines to ingest, process, and store…
Python
-
Retail-Data-Analysis
Retail-Data-Analysis PublicDeveloped a real-time streaming analytics pipeline using Apache Spark to calculate and store KPIs for e-commerce sales data, including total volume of sales, orders per minute, rate of return, and …
Python
-
Linear-Regression
Linear-Regression PublicBuilt a multiple linear regression model to predict the demand for shared bikes in the American market using Python, Pandas, NumPy, and Scikit-Learn.
Jupyter Notebook
-
MapReduce
MapReduce PublicCompleted a big data project using Hadoop, HBase, and Sqoop to ingest, process, and analyze a large dataset of taxi ride data on an AWS EMR cluster. Developed MapReduce codes to perform a variety o…
Python
-
ETL-Project
ETL-Project PublicDeveloped a batch ETL pipeline to extract, transform, and load transactional data from RDS to Redshift. Used Sqoop to ingest data from RDS to HDFS, PySpark to transform and load data to S3, and Red…
Jupyter Notebook
If the problem persists, check the GitHub status page or contact support.