Caltech CS 155 Winter 2018
Miniproject 1: Amazon Review Sentiment Detection
The first project is a Kaggle competition to detect sentiment in Amazon reviews. Details on the task are provided on the Kaggle project page, and our code is in the src directory, in particular in a jupyter notebook for training individual models and in this source file defining a stacked neural net using these results.
Initial summaries of results for individual models were tracked in this issue, and further results will be in project report.
Miniproject 2: Visualizing Matrix Factorizations for Movie Ratings
The second project involves finding matrix factorizations for the MovieLens dataset and projecting it into two dimensions for visualization and interpretation of how and why movies differ from each other.
We applied three different implementations of matrix factorization:
- An implementation of SVD in which we decompose into two matrices, U and V, which incorporate the singular values into the matrices themselves.
- SVD with Bias terms.
- SVD from SciPy from which we analyze matrices U and V which do not incorporate the singular values.
For each of these, we produced 2D visualizations with various colorings and labelings to help infer what the two extracted dimensions correlated to.