questions about the training process #3

lllmmmyyy · 2018-03-08T08:17:09Z

hi，
I'm not familiar with Theano, so I have some questions about the training process.

According to the code in line 919-936 in hgru4rec.py, it seems that the input length of data is set to 1 in each mini-batch, which means each mini-batch only consists of data from one time step. I am wondering, in this way, could the error back propagation through time?

mquad · 2018-03-09T10:14:26Z

Hi,

we are not using BPTT in training the network (sequences in recommender systems are rather short and BPTT didn't pay off the additional complexity). I suggest you take a look at this paper to get a better idea of the training process. It is essentially the same, HGRU just uses an additional layer to keep track of the user of each step of the sessions included in the minibatch.

lllmmmyyy · 2018-03-12T01:59:15Z

Hi,
I've examine the training process of the paper "Session-based Recommendations with RNN", but I think there is a difference between these two papers. In the HGRU, there is an additional layer to keep track of each user among different sessions. Since your training process works with the additional layer, does it mean that most users in the data set only have a few number of sessions?

mquad · 2018-03-12T17:03:17Z

That's right, most users had few sessions (5/10). Nevertheless, in other experiments (not reported in the paper) we saw that the model behaves well also with users having more sessions (up to 20-30) despite the training doesn't use BPTT; interestingly, the gain of HGRU over GRU grows with the number of sessions in the user history in the scenarios we have tested (similar to video recommendation).
Hope it helps

Massimo

lllmmmyyy · 2018-03-13T02:48:01Z

em, that's really interesting. Have you tried using BPTT in the HGRU when users have up to 20-30 sessions? If yes, dose the training without BPTT behave better than that with the BPTT?

mquad · 2018-03-14T07:49:19Z

No I did not, I cannot help you with that, sorry.

mquad closed this as completed Apr 13, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

questions about the training process #3

questions about the training process #3

lllmmmyyy commented Mar 8, 2018

mquad commented Mar 9, 2018

lllmmmyyy commented Mar 12, 2018

mquad commented Mar 12, 2018

lllmmmyyy commented Mar 13, 2018

mquad commented Mar 14, 2018 •

edited

Loading

questions about the training process #3

questions about the training process #3

Comments

lllmmmyyy commented Mar 8, 2018

mquad commented Mar 9, 2018

lllmmmyyy commented Mar 12, 2018

mquad commented Mar 12, 2018

lllmmmyyy commented Mar 13, 2018

mquad commented Mar 14, 2018 • edited Loading

mquad commented Mar 14, 2018 •

edited

Loading