Skip to content

JeffH001/Project_3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Note: This is a clone of the original project. The code I wrote for this project is mainly in the src/main/scala/detector folder, plus the WeightedRandomizer.scala code.

Big Data Capstone Project

Table of Contents

Project Description

In this project each team is tasked with creating random e-commerce transaction data according to the below schema, and publishing it to a topic stored on a shared Kafka Broker. After each team publishes their data to their respective topics the other team consumes the data from the other team's topic. This means in our case we're publishing data to the "team2" topic and consuming data from the "team1" topic. After exchanging and cleaning the data we're performing analysis on the data via our pattern detection classes and exporting the data to create visualizations.

Team Members

Technologies

  • Scala 2.13.8
  • Apache Spark 3.2.0
  • Apache Kafka 3.1.0

Schema

Field name Description
order_id Order Id
customer_id Customer Id
customer_name Customer Name
product_id Product Id
product_name Product Name
product_cateogry Product Category
payment_type Payment Type
qty Quantity ordered
price Price of the product
datetime Date and time when order was placed
country Customer Country
city Customer City
ecommerce_website_name Site from where order was placed
payment_txn_id Payment Transaction Confirmation Id
payment_txn_success Payment Success or Failure
failure_reason Reason for payment failure

About

The final Revature "Big Data" project, intended for producing data, sharing that data over Kafka, receiving another team's data over Kafka, cleaning the data, and analyzing that data for patterns.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages