Skip to content

lyingCS/KuaiComt.github.io

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

KuaiComt

KuaiComt is a comprehensive short video recommendation dataset that includes abundant comment text and interaction data. It contains real user behavior logs collected from the short-video mobile app Kuaishou, a leading short video app in China with over 400 million daily active users. On average, users spend over 120 minutes on the app each day, with more than 7 minutes (over 5%) spent in the video comments section. The comments section boasts a UV penetration rate of over 60%.

This is the first recommendation dataset that not only records item text and interaction data but also includes abundant comment text and interaction data!

Overview

The following figure provides an example of the dataset. When users enter the app, they can scroll up and down to browse different videos. Additionally, users can click the comment button on the right side of the video to enter the comments section, where they can scroll through comments and engage in interactive behaviors such as likes and replies.

kuaidata

Download the data:

KuaiComt has been shared at https://zenodo.org/records/13922581.

DOI

OPTION 1. Download via your browser:

You can download the dataset from this link.

OPTION 2: Download via the 'wget' command tool:

For the KuaiComt dataset:

wget https://zenodo.org/record/13922581/files/KuaiComt.zip

unzip KuaiComt.zip

Citation

If you find our dataset useful, please cite the paper:

@inproceedings{zhang2025comment,
  title={Comment Staytime Prediction with LLM-enhanced Comment Understanding},
  author={Zhang, Changshuo and Lin, Zihan and Liu, Shukai and Liu, Yongqi and Li, Han},
  booktitle={Companion Proceedings of the ACM on Web Conference 2025},
  pages={586--595},
  year={2025}
}

License

CC BY-NC-SA 4.0

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

CC BY-NC-SA 4.0

About

A Joint Video and Comment Recommendation Dataset (WWW 2025)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors