About the Tmall datasets #3

xiaxin1998 · 2020-12-21T18:27:26Z

Hi,
Thanks for your sharing of this paper!
About the datasets Tmall, I find the website you provide in your paper and download the datasets. But I find that in the test dataset, there are no given labels for it. Only the training set has labels. So would you mind share the Tmall test dataset with sessions' labels for us?
Thanks!

Mikrokosmos1997 · 2020-12-22T04:55:18Z

Thank you for your interest in our paper.
The label information in the raw data indicates whether the user is a repeat buyer, which is not related to our task and has not been used in our work. For the Tmall dataset used in our work, the label of each session is the last item in the session.

xiaxin1998 · 2020-12-22T11:02:44Z

Thanks four your response!
But when I preprocess the datasets, the statistics are different with your papers.
For Tmall datasets, I use the train_format2.csv and test_format2.csv in the website.
For the preprocess, I firstly get all the sessions, and leave the first 120000 sessions.
For the 120000 sessions, I filter sessions whose lengths are 1 and larger than 40. Then I filter the items who appear less than 5 times. And then split the sessions.
But I still have different datasets with yours showed in your paper, Table1.
So would you mind share the preprocessed datasets for us or tell me my mistakes in the preprocess?

xiaxin1998 · 2020-12-22T11:09:43Z

If you share the preprocessed datasets for us, we'll appreciate. It is more convenient for us to make comparisons with your model.

xiaxin1998 · 2020-12-22T12:02:57Z

UPDATE: in the train_format2.csv, there are only 4995 items, but in your paper, there should be 40728 items.

Mikrokosmos1997 · 2020-12-22T12:31:40Z

Thank you for your interest.
We have updated the preprocessed Tmall datasets. You can also download the raw Tmall data from the dropbox link in the paper Evaluation of Session-based Recommendation Algorithms.

xiaxin1998 · 2020-12-22T12:38:18Z

Thank you very much!!!!!!!!!!

Mikrokosmos1997 closed this as completed Dec 22, 2020

This was referenced Apr 27, 2021

Tmall数据集及处理 #12

Closed

Original csv of Tmall and Nowplaying #13

Closed

xiaxin1998 mentioned this issue Sep 12, 2021

Data_preprocess xiaxin1998/DHCN#7

Closed

Mikrokosmos1997 mentioned this issue Oct 22, 2021

关于数据集 #18

Closed

yangbo1973 mentioned this issue Jul 27, 2022

数据集（类别信息） yangbo1973/CM-HGNN#4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the Tmall datasets #3

About the Tmall datasets #3

xiaxin1998 commented Dec 21, 2020

Mikrokosmos1997 commented Dec 22, 2020

xiaxin1998 commented Dec 22, 2020

xiaxin1998 commented Dec 22, 2020

xiaxin1998 commented Dec 22, 2020

Mikrokosmos1997 commented Dec 22, 2020

xiaxin1998 commented Dec 22, 2020

About the Tmall datasets #3

About the Tmall datasets #3

Comments

xiaxin1998 commented Dec 21, 2020

Mikrokosmos1997 commented Dec 22, 2020

xiaxin1998 commented Dec 22, 2020

xiaxin1998 commented Dec 22, 2020

xiaxin1998 commented Dec 22, 2020

Mikrokosmos1997 commented Dec 22, 2020

xiaxin1998 commented Dec 22, 2020