Skip to content

This repository contains all the programs that I have written for my university course IE494 - Big Data Processing. This includes concepts like hadoop map-reduce, spark RDDs, spark SQL etc.

Notifications You must be signed in to change notification settings

devanshk01/big-data-processing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Big Data Processing

This repo holds all of my course work for Big Data Processing (IE494).

Structure

big-data-processing
└── map-reduce-programs
    ├── Lab-1
    ├── Lab-2
    └── Lab-3
└── spark-rdd-programs
    └── Lab-4
// more labs/topics will be added soon!

(I’m still working through the course and adding more as I go — feel free to explore what’s there already!)

How to use

  1. Browse through the <topics>/Lab- <n> folders.
  2. Inside each, run the code according to the setup instructions or the file comments.
  3. New labs or topics will land in that <topics> folder as I progress.

Contribute / Collab

Feel free to fork the repo and add new programs, suggest changes, comment, or open a PR — I’ll review and merge useful contributions.

Check this out: my Hadoop setup tutorial repo is here: hadoop-on-windows11 — also see my other repos linked from my GitHub profile.


Enjoy exploring,
Me (Devansh)

About

This repository contains all the programs that I have written for my university course IE494 - Big Data Processing. This includes concepts like hadoop map-reduce, spark RDDs, spark SQL etc.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages