Skip to content

Dataset of study: The Lifecycle of a Youtube Video: Phases, Content and Popularity

Notifications You must be signed in to change notification settings

yuhonglin/ytphasedata

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 

Repository files navigation

Dataset + Code on YouTube popularity history

We release the sofware and dataset from the following paper,

Honglin Yu, Lexing Xie and Scott Sanner, The Lifecycle of a Youtube Video: Phases, Content and Popularity, (ICWSM-15) [ paper | slides | poster]

Sofware

  1. Algorithm for power-law phase segmentation
  2. YouTube history crawler

Data

The data is hosted on dropbox.

File Description

All files are in python's pickle format.

videoID_category.pickle

  1. Data type : dictionary
  2. Key : videoIDs
  3. Value : category got from google API

videoID_segInfo.pickle

  1. Data type : dictionary
  2. Key : videoIDs
  3. Value : list of description of segments, following chronological order. For example, the value of video XXTey9OjuGc is [(0, 4, -0.373851, 2.98491, 48.4606, 1), (5, 734, 98.0916, -0.955789, 7.4257, 0)], this means the viewcount contains two segments. To see the parameter of each segment, please see this readme file on github.

videoID_viewcount.pickle

  1. Data type : dictionary
  2. Key : videoIDs
  3. Value : viewcount from first day of uploading. (containing 735 days)

About

Dataset of study: The Lifecycle of a Youtube Video: Phases, Content and Popularity

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published