USC DSCI 553 - Foundations & Applications of Data Mining - Spring 2024 - Prof. Wei-Min Shen
-
Updated
Sep 16, 2024 - Python
USC DSCI 553 - Foundations & Applications of Data Mining - Spring 2024 - Prof. Wei-Min Shen
Accident analysis project: modular Python application for analyzing road accident data, providing valuable insights to improve road safety.
This project utilizes the Spark Framework and GraphFrames library to implement the Girvan-Newman algorithm, detecting communities in social networks by analyzing user connections based on common business reviews and optimizing modularity through iterative edge removal.
Final submission. Topic: Apache Spark's Pyspark API
Graph analytics for telecom customer churn prediction
Hybrid Girvan Newman. Code for the "A Distributed Hybrid Community Detection Methodology for Social Networks" paper.
hadoop (GFS)、mapreduce programming model、prestro、apache spark etc
Objectives: Using pyspark, MLlib and graphframes libraries, perform 1) classification and custering tasks using RandomF and Kmeans and 2) graph analysis tasks. This material is from UIUC MCS coursework.
Graph coloring example using GraphFrames of Apache Spark framework
Analysis of Etherum contracts, transactions, gas, scams and scammers' graph
Community detection Based on Girvan-Newman Algorithm
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
A Pyspark implementation of the CNGF Algorithm used for Link Prediction
Add a description, image, and links to the graphframes topic page so that developers can more easily learn about it.
To associate your repository with the graphframes topic, visit your repo's landing page and select "manage topics."