This project is a follow up of my previous GW2 analysis. On this one, I executed the same ML algorithms used previously, using Spark
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.

Guild Wars 2 PvP Analysis - machine learning on Spark


As a follow up of the report I did previously, titled PvP in Guild Wars 2: A data analysis, I decided to run the same machine learning algorithms used in said report, in another framework, Apache Spark, to see how they differ from each other. The original purpose why this was done on the previous report, was to verify if it is possible to predict the outcome of a match using the composition of the team.

The tests were done on Apache Spark 1.4.0, using the Python API.

Link to the report: Guild Wars 2 PvP Analysis - machine learning on Spark - report


This repository contains the Python script used in the analysis, the dataset (in .txt) and a codebook explaining the values of the dataset. Regarding the Python script, the SparkConf parameters, are the ones I found suitable for the analysis, however feel free to change them (in particular the master URL, it has to be the URL of your cluster).