Skip to content

Amatofrancesco99/dsbda2022

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

84 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Science & Big Data Analytics (dsbda) project - 2022 (UniPV)

Objective

The goal of the project is to study the Steam platform gaming community using both standard python, and Big Data tools.
More specifically, considering the users' reviews the objective is to understand, with the statement of some hypotheses, relevant correlation about the players behavior (e.g., identify factors such that a player like a video game, if the user is loyal to the same genre, ...).

Procedure

The stated hypotheses have firstly been studied using a small portion of the overall Big Data source (obtained using snippet of code written with Big Data tools).
Once seen if the selected hypotheses are statistically significant using the small portion of data, the same hypotheses were replicated on the whole data source (using Spark/HDFS/...) and checked if still relevant.

Datasets

The used datasets (Kaggle) are the following ones:

About

The goal of this project is to study the Steam platform gaming community behavior using both standard python, and Big Data tools.

Topics

Resources

License

Stars

Watchers

Forks

Contributors 4

  •  
  •  
  •  
  •