Benchmarks against other implementations #6

Hydrotoast · 2016-06-06T17:42:49Z

Some other implementations to compare to:

fastFM

Tasks:

Select 2-3 datasets for comparison
Setup experiments

Experiment approach

Select a dataset and split it into training X_train, y_train and test X_test, y_test
Download both libraries
Train both libraries on X_train, y_train (and measure the training time)
Verify that the test set evaluations are close enough on X_test, y_test
Repeat the test 10 times

Implementing the experiment (up for discussion/alternatives)

Write Benchmark script in Bash and use simple wall clock time for measurements
Save script in a new benchmarks/ folder

The text was updated successfully, but these errors were encountered:

Hydrotoast · 2016-06-10T18:32:39Z

This task is OS-dependent since there are various methods installing the desired packages and and executing them. I suppose the best approach would be to clone the respective repositories and run the corresponding installing scripts.

Hydrotoast · 2016-06-11T06:13:28Z

A script I wrote recently for benchmarking against fastfm:

import numpy as np

from fastFM import sgd
from scipy.sparse import hstack
from sklearn.datasets import load_svmlight_file
from sklearn.metrics import mean_squared_error
from math import sqrt


X_train, y_train = load_svmlight_file("ml100k_train.txt.clean")
n_train = X_train.shape[1]
X_test, y_test = load_svmlight_file("ml100k_test.txt.clean")
m_test, n_test = X_test.shape
X_test = hstack((X_test, np.zeros((m_test, n_train - n_test), dtype=np.float)))

fm = sgd.FMRegression(n_iter=10, init_stdev=0.01, rank=4, l2_reg_w=0.0, l2_reg_V=0.0, step_size=0.1)
fm.fit(X_train, y_train)
y_pred = fm.predict(X_test)

print(sqrt(mean_squared_error(y_pred, y_test)))

btwardow · 2016-06-13T09:58:04Z

If it's OS dependent, what do You think about docerizing it?

Hydrotoast · 2016-06-13T21:11:21Z

Docker is a good idea for accurate/fair benchmarks; however, I am uncertain how much work would be required to get this working. Perhaps we can get something simple working first and move to Docker if this becomes a popular library across several platforms?

btwardow · 2016-06-14T09:30:53Z

Ok. When we have bash/pytho/julia script for running it, we are only simple step to encapsulate it inside the container.

Hydrotoast · 2016-06-14T20:46:54Z

Agreed. Time for me to review Docker.

Hydrotoast · 2016-06-21T20:53:44Z

Technically the PR for this issue has been merged, although the current implementation is slower than fastfm. I will close this issue now and open a new issue after some investigation into some slow code.

Hydrotoast mentioned this issue Jun 13, 2016

Benchmarks #10

Merged

Hydrotoast mentioned this issue Jun 16, 2016

Docker Benchmark #12

Merged

Hydrotoast closed this as completed Jun 21, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmarks against other implementations #6

Benchmarks against other implementations #6

Hydrotoast commented Jun 6, 2016 •

edited

Loading

Hydrotoast commented Jun 10, 2016

Hydrotoast commented Jun 11, 2016 •

edited

Loading

btwardow commented Jun 13, 2016

Hydrotoast commented Jun 13, 2016

btwardow commented Jun 14, 2016

Hydrotoast commented Jun 14, 2016

Hydrotoast commented Jun 21, 2016

Benchmarks against other implementations #6

Benchmarks against other implementations #6

Comments

Hydrotoast commented Jun 6, 2016 • edited Loading

Hydrotoast commented Jun 10, 2016

Hydrotoast commented Jun 11, 2016 • edited Loading

btwardow commented Jun 13, 2016

Hydrotoast commented Jun 13, 2016

btwardow commented Jun 14, 2016

Hydrotoast commented Jun 14, 2016

Hydrotoast commented Jun 21, 2016

Hydrotoast commented Jun 6, 2016 •

edited

Loading

Hydrotoast commented Jun 11, 2016 •

edited

Loading