Big Data Capstone Project

Note: This is a clone of the original project. The code I wrote for this project is mainly in the src/main/scala/detector folder, plus the WeightedRandomizer.scala code.

Big Data Capstone Project

Project Description

In this project each team is tasked with creating random e-commerce transaction data according to the below schema, and publishing it to a topic stored on a shared Kafka Broker. After each team publishes their data to their respective topics the other team consumes the data from the other team's topic. This means in our case we're publishing data to the "team2" topic and consuming data from the "team1" topic. After exchanging and cleaning the data we're performing analysis on the data via our pattern detection classes and exporting the data to create visualizations.

Team Members

Adam Gore @Adam-Gore96
Alex White @AlexWhite252
Brandon Cho @BrandonYCho
Brian Vegh @brianvegh
Douglas Lam @Douglas-Lam
Evan Laferriere @evanlaferriere
Jeffrey Hafner @JeffH001
Md Tahmid Khan @MdTahmidKhan
Patrick Froerer @PJFroerer
Rudy Esquerra @rudyesquerra

Technologies

Scala 2.13.8
Apache Spark 3.2.0
Apache Kafka 3.1.0

Schema

Field name	Description
order_id	Order Id
customer_id	Customer Id
customer_name	Customer Name
product_id	Product Id
product_name	Product Name
product_cateogry	Product Category
payment_type	Payment Type
qty	Quantity ordered
price	Price of the product
datetime	Date and time when order was placed
country	Customer Country
city	Customer City
ecommerce_website_name	Site from where order was placed
payment_txn_id	Payment Transaction Confirmation Id
payment_txn_success	Payment Success or Failure
failure_reason	Reason for payment failure

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
KafkaAnalyzer		KafkaAnalyzer
consumerOutput		consumerOutput
detectorOutput		detectorOutput
input		input
output		output
src/main		src/main
.gitignore		.gitignore
README.md		README.md
build.sbt		build.sbt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Big Data Capstone Project

Table of Contents

Project Description

Team Members

Technologies

Schema

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Big Data Capstone Project

Table of Contents

Project Description

Team Members

Technologies

Schema

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages