This repository includes all codes and projects I developed duting CSE-6242 which mainly includes the following:
- Set 1:
- Analyzing Twitter dataset
- Gephi
- SQLite
- D3 Warmup
- OpenRefine
- Flask and jQuery
- Set 2: D3 Graphs and Visualization
- Designing a Good Table and Visualizing Data with Tableau
- Force-directed graph layout
- Scatter plots
- Heatmap and Select Box
- Visualizing Hierarchical Data with a Dendrogram
- Interactive visualization
- Choropleth Map of World Population
- Set 3: Hadoop, Spark, Pig and Azure
- Analyzing a Graph with Hadoop/Java
- Analyzing a Large Graph with Spark/Scala
- Analyzing Large Amount of Data with Pig on AWS
- Analyzing a Large Graph using Hadoop on Microsoft Azure
- Regression: Automobile price prediction, using Azure ML Studio
- Testing ARGO graph visualization (BETA)
- Set 4: Scalable PageRank via Virtual Memory (MMap), Random Forest, Scikit-Learn
- Scalable single-PC PageRank on 70M edge graph
- Random Forest Classifier
- Using Scikit-Learn