📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
-
Updated
Mar 20, 2017 - Scala
📁 Extract, Transform, Load (ETL) 👷 refers to a process in database usage and especially in data warehousing. This repository contains a starter kit featuring ETL related work.
pagerank hadoop
MapReduce in Nodejs
Lightweight and extensible library to execute MapReduce-like jobs in Python
MapReduce Job Development, RDDs Programming, Medical Data Management, Sales Analysis, And Efficient Data Integration For Big Data Analysis. Spark: Big Data Processing, SQOOP Integration, And Spark Structured Streaming For Real-Time Data.
Map-Reduce jobs in python to get insightful information from NYC Taxi data
MapReduce Framework based on Storm that is flexible for any MapReduce work. Built with a number of workers and a single master.Used BerkeleyDB as temporary data storage in case of big data processing
Recommends movies to the users based on the users profiles and the ratings of other users.
Mapreduce concepts- Secondary sort, counters, mutiple mapreduce jobs
Performed business operations using Big data technologies: AWS EMR, AWS RDS (MySQL), Hadoop, Apache Scoop, Apache HBase, MapReduce
A cloud computing coursework on bigdata etc
Cloud and big data 2017/2018: Programming Assignments
Big data technologies that I have experimented with
Hadoop jobs written using GoLang, and run using Hadoop on Docker Containers
Big Data, Hadoop, and MapReduce in Python. MapReduce Jobs using the MRJob library & Amazon Elastic MapReduce service.
Beta versions/student projects
Hadoop map-reduce to derive some statistics from Yelp Dataset
Big Data Processing and Analytics course term project.
Design and implementation of different MapReduce jobs used to analyze a dataset on Covid-19 disease created by Our World In Data
Add a description, image, and links to the mapreduce-jobs topic page so that developers can more easily learn about it.
To associate your repository with the mapreduce-jobs topic, visit your repo's landing page and select "manage topics."