In the documentation, you will find the following notation:
- R : the set of all ratings.
- Rtrain, Rtest and R̂ denote the training set, the test set, and the set of predicted ratings.
- U : the set of all users. u and v denotes users.
- I : the set of all items. i and j denotes items.
- Ui : the set of all users that have rated item i.
- Uij : the set of all users that have rated both items i and j.
- Iu : the set of all items rated by user u.
- Iuv : the set of all items rated by both users u and v.
- rui : the true rating of user u for item i.
- r̂ui : the estimated rating of user u for item i.
- bui : the baseline rating of user u for item i.
- μ : the mean of all ratings.
- μu : the mean of all ratings given by user u.
- μi : the mean of all ratings given to item i.
- σu : the standard deviation of all ratings given by user u.
- σi : the standard deviation of all ratings given to item i.
- Nik(u) : the k nearest neighbors of user u that have rated item i. This set is computed using a
similarity metric <surprise.similarities>
. - Nuk(i) : the k nearest neighbors of item i that are rated by user u. This set is computed using a :py
similarity metric <surprise.similarities>
.
References
Here are the papers used as references in the documentation. Links to pdf files where added when possible. A simple Google search should lead you easily to the missing ones :)
refs.bib