Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rating prediction could stream test-file to save memory #432

Open
jkleint opened this issue Aug 1, 2013 · 2 comments
Open

Rating prediction could stream test-file to save memory #432

jkleint opened this issue Aug 1, 2013 · 2 comments

Comments

@jkleint
Copy link

jkleint commented Aug 1, 2013

I'm making predictions with a command like this:

rating_prediction \
    --recommender="$method" \
    --rating-type=byte \
    --training-file="$1" \
    --test-file="$2" \
    --test-no-ratings \
    --prediction-file="$3"

When running with a large "test-file", it uses a lot of memory; with a small test-file, it uses a little memory. I'm not familiar with the internals of MyMediaLite, but can't each prediction be made independently, facilitating a streaming read of the test-file? This would be both faster and enable the use of larger data sets.

It would also be awesome if I could read from a process or fifo (i.e., streaming read without seeking) so I could store my data compressed and uncompress it on the fly.

@ghost ghost assigned zenogantner Aug 1, 2013
@zenogantner
Copy link
Owner

This makes sense for very large data sets.
I would implement this after some general changes, which will come with 4.0.

Of course I will not keep anyone from working on this, so if there are patches, we could have it in 3.10 or 4.0.

How large are your test files?

@jkleint
Copy link
Author

jkleint commented Aug 7, 2013

A few GB.

@zenogantner zenogantner removed this from the 5.0 milestone Apr 29, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants