Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"interact with in the next week" #16

Open
talestsp opened this issue Mar 16, 2016 · 7 comments
Open

"interact with in the next week" #16

talestsp opened this issue Mar 16, 2016 · 7 comments

Comments

@talestsp
Copy link

Hi data people,

I didn't understand this "next week"

"the recommender should predict those job postings (items) that the user will interact with in the next week."

Thanks :)

@dkohlsdorf
Copy link
Contributor

Well this is a little hypothetical :). We held out the interaction data of the one week after the dataset ends.
These are the interactions you have to predict for the target users.

Hope that helps

Daniel

@talestsp
Copy link
Author

Hi Daniel,

Does it mean that it is the same weeknumber for everybody? Maybe the each
user's last weeknumber plus 1?

Thanks again :)

Tales Tenorio de Souza Pimentel
https://br.linkedin.com/pub/tales-tenorio-pimentel/63/5a6/661

@zhengrongqin
Copy link

@dkohlsdorf
Hi Daniel
I don't understand your mean.Which dataset you held out the interaction data of the one week ? items.csv?
Thanks

@dkohlsdorf
Copy link
Contributor

The data you can access and learn on is extracted from Aug - Nov 2015.
We held out the data of the week after the end date in November.
And yes we only held out interactions since predicting the interactions is our ground truth.
All users and items you need to make your predictions are given to you.

Daniel

@skywaLKer518
Copy link

One followup question -- is it possible those "interactions in the next week" already appear in the users' interaction history? or we only need to predict "new ones"? From the description, it seems to be the former, but we would like to confirm. Thanks !

@dkohlsdorf
Copy link
Contributor

No the interactions for the test set are not in the data

@pjankiewicz
Copy link

@dkohlsdorf To be more specific you say that there are no user_id + item_id present in the test interactions and train interactions. I did a quick check in which I created a submission with only training interactions and it scores 154890.45. Assuming that I didn't didn't make a mistake is this something to be expected?

For example this query returns 35999 interactions.

SELECT COUNT(*) FROM (
  SELECT DISTINCT user_id, item_id FROM interactions WHERE week = 45 AND interaction_type in (1,2,3)
  INTERSECT
  SELECT DISTINCT user_id, item_id FROM interactions WHERE week < 45 AND interaction_type in (1,2,3)
) as T

So I assume that a similar number is for the week 46 (test week).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants