Skip to content

Latest commit

 

History

History
23 lines (14 loc) · 1.89 KB

README.md

File metadata and controls

23 lines (14 loc) · 1.89 KB

cs155-projects

Caltech CS 155 Winter 2018

Code for projects in Caltech's CS155: Machine Learning and Data Mining, developed by team Aw Young Qingzhuo, Ola Kalisz, and Riley Patterson.

Miniproject 1: Amazon Review Sentiment Detection

The first project is a Kaggle competition to detect sentiment in Amazon reviews. Details on the task are provided on the Kaggle project page, and our code is in the src directory, in particular in a jupyter notebook for training individual models and in this source file defining a stacked neural net using these results.

Initial summaries of results for individual models were tracked in this issue, and further results will be in project report.

Miniproject 2: Visualizing Matrix Factorizations for Movie Ratings

The second project involves finding matrix factorizations for the MovieLens dataset and projecting it into two dimensions for visualization and interpretation of how and why movies differ from each other.

We applied three different implementations of matrix factorization:

  • An implementation of SVD in which we decompose into two matrices, U and V, which incorporate the singular values into the matrices themselves.
  • SVD with Bias terms.
  • SVD from SciPy from which we analyze matrices U and V which do not incorporate the singular values.

For each of these, we produced 2D visualizations with various colorings and labelings to help infer what the two extracted dimensions correlated to.

The code for this project is in the source directory for project2, and the results are found in the project2 report directory.