CS 155 Winter 2018
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Type Name Latest commit message Commit time
Failed to load latest commit information.



Caltech CS 155 Winter 2018

Code for projects in Caltech's CS155: Machine Learning and Data Mining, developed by team Aw Young Qingzhuo, Ola Kalisz, and Riley Patterson.

Miniproject 1: Amazon Review Sentiment Detection

The first project is a Kaggle competition to detect sentiment in Amazon reviews. Details on the task are provided on the Kaggle project page, and our code is in the src directory, in particular in a jupyter notebook for training individual models and in this source file defining a stacked neural net using these results.

Initial summaries of results for individual models were tracked in this issue, and further results will be in project report.

Miniproject 2: Visualizing Matrix Factorizations for Movie Ratings

The second project involves finding matrix factorizations for the MovieLens dataset and projecting it into two dimensions for visualization and interpretation of how and why movies differ from each other.

We applied three different implementations of matrix factorization:

  • An implementation of SVD in which we decompose into two matrices, U and V, which incorporate the singular values into the matrices themselves.
  • SVD with Bias terms.
  • SVD from SciPy from which we analyze matrices U and V which do not incorporate the singular values.

For each of these, we produced 2D visualizations with various colorings and labelings to help infer what the two extracted dimensions correlated to.

The code for this project is in the source directory for project2, and the results are found in the project2 report directory.