Recommendation System Competition Platform

A data mining course competition platform for designing recommendation algorithms that maximize revenue through multi-iteration learning.

Overview

This platform uses the Sim4Rec simulation framework to:

Generate synthetic users and items with various properties
Simulate user responses to recommendations over multiple iterations
Evaluate recommendation algorithms based on revenue generation
Compare performance across different algorithms

Students will compete to design recommendation algorithms that earn the most money by recommending the right items to the right users. The system implements a multi-iteration learning environment where algorithms can adapt over time based on user feedback.

Problem Setting: Multi-Iteration Ranking Task

The competition focuses on a sequential recommendation problem where:

Each recommendation algorithm interacts with users across multiple iterations
Algorithms make top-k item recommendations to users in each iteration
User responses (purchases) generate revenue based on item prices
Algorithms can learn from past interactions to improve future recommendations
The goal is to maximize cumulative revenue across all iterations

This setup realistically simulates how recommendation systems operate in production environments, where models continuously learn from user interactions.

Training and Evaluation Setup

The platform uses a train-test split evaluation approach:

Training Phase:
- Algorithms interact with users for a fixed number of training iterations
- Recommenders can be retrained after each iteration based on new feedback
- This phase allows algorithms to learn user preferences
Testing Phase:
- Algorithms make recommendations for additional test iterations
- No retraining occurs during testing phase
- Performance on test iterations determines final evaluation

This approach tests both an algorithm's ability to learn from interactions and its generalization performance on unseen data.

Metrics

The platform evaluates recommenders using the following metrics:

Total Revenue (Primary): Sum of prices for all purchased items
```
Revenue = Sum(price * response)
```
where response is 1 for purchased items and 0 otherwise

Discounted Revenue: Revenue weighted by recommendation position

Discounted Revenue = Sum(price * response * (1/log2(rank + 1)))

Precision@K: Fraction of recommended items that were relevant

Precision@K = (# of recommended items that were purchased) / K

NDCG@K: Normalized Discounted Cumulative Gain, which measures ranking quality

DCG = Sum(relevance_i / log2(i+1))
IDCG = DCG of the ideal ranking
NDCG = DCG / IDCG

MRR: Mean Reciprocal Rank, the average of reciprocal ranks of the first relevant item
```
MRR = Average(1/rank of first relevant item)
```
Hit Rate: Fraction of users for whom at least one recommended item was relevant
```
Hit Rate = (# of users with at least one purchased item) / (# of users)
```

Installation

Setup

Clone this repository:

git clone https://github.com/anonymous-during-review/RevMax-RecSys
cd RevMax-RecSys

Install OpenJDK 17 (Java 17 is the highest version supported by Apache Spark).
- Check that Java 17 was installed successfully:
```
java -version
```
Install uv (Python package manager).
Run any of the Python files with uv run (dependencies will be installed automatically), for example:

uv run recommender_analysis_visualization.py

If you want to run a Jupyter notebook:

uv run jupyter lab

Getting Started

The main execution flow is in recommender_analysis_visualization.py, which:

Generates synthetic user and item data
Performs exploratory data analysis
Sets up and evaluates baseline recommenders
Visualizes performance metrics

Run the analysis script to get started:

uv run recommender_analysis_visualization.py

This will:

Generate a synthetic dataset with users and items
Create visualizations of user segments, item categories, and interactions
Run multiple baseline recommenders (Random, Popularity, Content-Based)
Compare their performance using train-test evaluation
Generate visualizations of recommender performance

Developing Your Algorithm

Start with the MyRecommender class in recommender_analysis_visualization.py
Implement your recommendation algorithm with:
- __init__: Initialize your algorithm and parameters
- fit: Train your model on historical data
- predict: Generate recommendations for users
Test your algorithm using the run_recommender_analysis function

Checkpoints

The competition consists of three checkpoints, each focusing on a different type of recommendation algorithm:

Content-based Recommender: Leverage user and item attributes to make recommendations
Sequence-based Recommender: Use sequential patterns in user interactions
Graph-based Recommender: Exploit relationships between users and items

Detailed instructions for each checkpoint will be released later.

Leaderboard

A competition leaderboard will track the performance of submitted algorithms. Submissions will be evaluated in hidden environments with the same setup but different random seeds to test robustness. More information on the leaderboard and submission process will be provided later.

Baseline Recommenders

Several baseline algorithms are provided for comparison:

RandomRecommender: Recommends random items
PopularityRecommender: Recommends items based on popularity
ContentBasedRecommender: Recommends items similar to previously liked items
EnhancedUCB: Upper Confidence Bound algorithm with price consideration
HybridRecommender: Combines multiple recommendation strategies

Tips for Success

Consider item prices: Since revenue is the primary metric, recommending high-priced items that users are likely to purchase can be effective
User segmentation: Different user segments may have different preferences
Content-based features: Use user and item attributes for personalization
Hybrid approaches: Combine multiple recommendation strategies
Exploration vs. exploitation: Balance between recommending items you know users will like and discovering new items
Iterative learning: Update your model as new interaction data becomes available

Questions and Support

For questions or support, please contact the course instructors or teaching assistants.

Good luck with your algorithms!

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assignment_handouts		assignment_handouts
leaderboard_setup		leaderboard_setup
sim4rec		sim4rec
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
config.py		config.py
data_generator.py		data_generator.py
evaluation.py		evaluation.py
index.html		index.html
interaction_analysis.py		interaction_analysis.py
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
recommender_analysis_visualization.py		recommender_analysis_visualization.py
requirements.txt		requirements.txt
sample_recommenders.py		sample_recommenders.py
simulator.py		simulator.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Recommendation System Competition Platform

Overview

Problem Setting: Multi-Iteration Ranking Task

Training and Evaluation Setup

Metrics

Installation

Setup

Getting Started

Developing Your Algorithm

Checkpoints

Leaderboard

Baseline Recommenders

Tips for Success

Questions and Support

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Recommendation System Competition Platform

Overview

Problem Setting: Multi-Iteration Ranking Task

Training and Evaluation Setup

Metrics

Installation

Setup

Getting Started

Developing Your Algorithm

Checkpoints

Leaderboard

Baseline Recommenders

Tips for Success

Questions and Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages