Negative sampling #36

nonamestreet · 2017-08-05T05:09:28Z

@maciejkula I guess we should remove the items found the training dataset before the negative sampling. Otherwise, it might make the learning less effective?

maciejkula · 2017-08-05T08:06:38Z

For implicit models, the only user-item pairs your dataset should contain are those where an interaction has been observed. You should not include pairs where an interaction is missing. Does this answer your question?

nonamestreet · 2017-08-05T09:21:03Z

Thank you for the reply. The negative sampling part of the code was sampling from all items. I guess this should exclude the observed interactions, as they are definitely not negative. For example, for user A who already bought item 1,2,3, when sampling negative prediction, I think we should exclude the item 1,2,3?

 
    def _get_negative_prediction(self, user_ids):

        negative_items = sample_items(
            self._num_items,
            len(user_ids),
            random_state=self._random_state)
        negative_var = Variable(
            gpu(torch.from_numpy(negative_items), self._use_cuda)
        )
        negative_prediction = self._net(user_ids, negative_var)

        return negative_prediction

maciejkula · 2017-08-06T10:14:08Z

In principle you are right. In practice, as long as the number of all items is (much) larger than the number of positive items (which is usually the case), I haven't found this omission detrimental to accuracy. I suspect that you could even make the argument that this approach gives you an implicit regularizer.

nonamestreet · 2017-08-06T13:18:02Z

Yeah, I agree. It can only be a potential problem when the number of items is small. Thank you!

KylinA1 · 2019-06-21T21:05:14Z

In principle you are right. In practice, as long as the number of all items is (much) larger than the number of positive items (which is usually the case), I haven't found this omission detrimental to accuracy. I suspect that you could even make the argument that this approach gives you an implicit regularizer.

Hi, I try both on MovieLens 1M dataset, and I find this sampling significantly reduce the performance.

nonamestreet closed this as completed Aug 6, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Negative sampling #36

Negative sampling #36

nonamestreet commented Aug 5, 2017

maciejkula commented Aug 5, 2017

nonamestreet commented Aug 5, 2017 •

edited

maciejkula commented Aug 6, 2017

nonamestreet commented Aug 6, 2017

KylinA1 commented Jun 21, 2019 •

edited

Negative sampling #36

Negative sampling #36

Comments

nonamestreet commented Aug 5, 2017

maciejkula commented Aug 5, 2017

nonamestreet commented Aug 5, 2017 • edited

maciejkula commented Aug 6, 2017

nonamestreet commented Aug 6, 2017

KylinA1 commented Jun 21, 2019 • edited

nonamestreet commented Aug 5, 2017 •

edited

KylinA1 commented Jun 21, 2019 •

edited