Skip to content

zhangh43/github-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Github Analysis

This repo is focus on analysis

Download

We use two scripts to download github history data from Github Archive

# Take year 2016 as example
mkdir 2016
python generate_url.py 2016
python download_url.py url2016 2016 120

Preprocessing

We preprocess github history data by the following steps:

  1. extracting [event, user, repo] pairs from original data.
python preprocess.py
  1. filter important users and repos.

  2. find the strong relationship between user and repos.

About

github analysis base on github archive

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors