Model too big? #10250

GSanchis · 2018-03-26T18:37:04Z

Hi all,

I'm attempting to train a recommender system with MXNet 1.0.0 in Python3, but I'm running into the following problem: the dataset has rough 5M items, 200k users. This means that I am not able to have an embedding size larger than 100, since the model would not fit into memory:

user_embed = mx.symbol.Embedding(name="user_embed", data=user,
                                 input_dim=5000000, output_dim=100)
item_embed = mx.symbol.Embedding(name="item_embed", data=item,
                                 input_dim=200000, output_dim=100)
pred = mx.symbol.sum_axis(pred, axis=1)
pred = mx.symbol.Flatten(pred)
pred = mx.symbol.LinearRegressionOutput(data=pred, label=score)

I know that it's an overly simplistic model, but even this one doesn't fit into GPU memory... well, then a more complex model won't fit either.

Results with embedding size 100 are ok with a smaller amount of items/users, but for the full dataset they are not anymore, so I assume the problem is that the embeddings do not have enough expressive power.

Is there a way to reduce that memory footprint? Perhaps loading the embeddings "on-demand", i.e., only those that are actually required for a specific batch?

Thanks in advance!!

The text was updated successfully, but these errors were encountered:

anirudhacharya · 2018-03-26T19:27:59Z

@GSanchis this is not an issue with mxnet. Such questions are better asked in the discussion forum here - https://discuss.mxnet.io/ I encourage you to ask this question there.

@sandeep-krishnamurthy @nswamy - please tag this as 'Question' and close it.

eric-haibin-lin · 2018-03-27T00:51:43Z

SparseEmbedding could help reduce the memory footprint for the gradient. See example usage in https://github.com/apache/incubator-mxnet/tree/master/example/sparse/matrix_factorization

GSanchis · 2018-03-27T11:16:28Z

@anirudhacharya Sorry, I wasn't actually aware of that forum.

@eric-haibin-lin I'm aware of the SparseEmbedding, but... I'll post the "but" in the forum, to avoid keeping this issue open.

Yes, please close the question.

nswamy closed this as completed Mar 27, 2018

nswamy added the Question label Mar 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model too big? #10250

Model too big? #10250

GSanchis commented Mar 26, 2018

anirudhacharya commented Mar 26, 2018

eric-haibin-lin commented Mar 27, 2018

GSanchis commented Mar 27, 2018

Model too big? #10250

Model too big? #10250

Comments

GSanchis commented Mar 26, 2018

anirudhacharya commented Mar 26, 2018

eric-haibin-lin commented Mar 27, 2018

GSanchis commented Mar 27, 2018