Skip to content

Sample data set

Compare
Choose a tag to compare
@klauspost klauspost released this 31 Jul 11:08
· 30 commits to master since this release

The file called github-ranks-backup.bin.zst is a backup of 16 tables containing aggregated github event data.

Restore the database using this command:

rankdb-cli --timeout=5m restore multilist --src="file" --src_file="github-ranks-backup.bin.zst"

Elements are indexed by internal github user/repo integer id.

The tiebreaker is the inverse element id, meaning lower ID wins over higher.

"repos"

Contains events per repo for June 2019. Separated by event type:

"repos-CommitCommentEvent"
"repos-CreateEvent"
"repos-DeleteEvent"
"repos-ForkEvent"
"repos-GollumEvent"
"repos-PublicEvent"
"repos-PushEvent"

All of these lists have the following metadata defined:

      "metadata": {
        "content": "repos",
        "type": "actions"
      }
"users"

Contains events per user. Separated by event type:

"users-CommitCommentEvent"
"users-CreateEvent"
"users-DeleteEvent"
"users-ForkEvent"
"users-GollumEvent"
"users-PublicEvent"
"users-PushEvent"

All of these lists have the following metadata defined:

      "metadata": {
        "content": "users",
        "type": "actions"
      },

Fun requests:

Get top users:

http://127.0.0.1:8080/lists/users/range?from_top=0&limit=25

Get top repos:

http://127.0.0.1:8080/lists/repos/range?from_top=0&limit=25

Get user (and 5 neighbors) at the top 1 percentile:

http://127.0.0.1:8080/lists/users/percentile?from_top=1.0&range=5

Get rank for user 649556:

http://127.0.0.1:8080/lists/users/elements/649556

Returns elements with ID 649556 on all lists matching {"content": "users"} metadata:

http://127.0.0.1:8080/xlist/elements/649556?match_metadata=%7B%22content%22%3A%20%22users%22%7D