About the Project

Table of Contents

About The Project
Business Objective
Business Metrics
Dataset
Methods
Exploratory Data Analysis
Preprocessing and Featuring Data
Content-Based Recommendation System
Neighborhood Collaborative Filtering
Context-Aware Factorization Machines
Alternating Least Square
- Alternating Least Square Model
Conclusion
References

About the Project

Steam is currently the largest digital distribution service for PC gaming, operated by Valve Corporation. According to the annual statistics report for 2022, Steam reached a peak online user count of 33 million, with an impressive 44.7 billion gigabytes of game downloads over the year. One of the key reasons for Steam's rapid growth is its strong searchability in the store, which was highlighted in the report. They are actively working on a new recommendation system driven by machine learning to help users find games that match their personal preferences. While the algorithm is just one part of the searchability solution, they are also developing more interactive and user-friendly features, continually evaluating the overall store design.

On the flip side, the recommender system in marketplace is commonly also affected by the Matthew Effect, which means that games developed by large game companies receive more advertising budgets and, consequently, become very popular. Popular games often appear at the top of store web pages and attract more users, including you and your friends, to make purchases. In contrast, games developed by small studios or individual developers may not have the same advertising luck. Some mid-budget users are more inclined to wait for their preferred games to be launched a few months later or simply wait for discounts on their preferred games. This can lead these typical users to become inactive or even churn, even though there may be hidden gems that match their preferences based on user activity logs or given ratings for various games.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
pics		pics
.gitignore		.gitignore
Rendle2010FM.pdf		Rendle2010FM.pdf
readme.MD		readme.MD
recsys.ipynb		recsys.ipynb

Column Name	Describe	dtype
user_id	User unique identifier	object
app_id	Steam's game unique identifier	object
date	Steam's game release date	datetime
is_recommended	User vote up the game if True	bool
hours	Hour of user spent on game	float64
title	Steam games name	object
rating	Rating given by user on played game	object
price_final	price after discounted	float64
developer	Steam's game developer	object
publisher	Steam's game publisher	object
Categories	Steam's game category	object
genre	Steam's game genre	object
popular_tags	Steam's game popular tags	object
game_description	Steam's game description	object

DandiMahendris/Context-Aware-Factorization-Machine-Recommendation-System

Folders and files

Latest commit

History

Repository files navigation

About the Project

Business Objective

Business Metrics

Dataset

Methods

Exploratory Data Analysis

Ratings

Price Rates

Release date range

Preprocessing and Featuring Data

1. Content-Based Recommendation System

Text Preprocessing: Tokenization

Text Preprocessing: Stemming

Text Preprocessing: Term-Frequency (TF) and Inverse Document Frequency (IDF) Vectorizer

Cosine Similarity

Content Similar Recommendation

Novelty

Novelty and Similar Recommendation

Serendipity

Serendipity and Similar Recommendation

2. Neighborhood Collaborative Filtering

Utility Matrix

Item-to-Item Neighbor CF

User-to-User Neighbor CF

Recommendation System Evaluation: RMSE

3. Context-Aware Factorization Machines

Factorization Machines

FM: Regression Optimization Tasks

FM: Regression Model

FM Regression Recommendation

FM: Classification Objective

FM: Classification Models

FM: Classification Models Performance

Alternating Least Square

ALS Models

Conclusion

References

About

Resources

Stars

Watchers

Forks

Languages