Skip to content

klangner/github-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GitHub-Analysis

This project contains code for parsing and analysis data availabe from githubarchive.org

Installation

Run application

Run the application to extract events for particular repository with command:

java -Xmx2G -jar gha.jar params.yaml

Sample params.yaml:

# Extract events from repository rails/rails into events.csv where data in range from 2011-11 to 2012-1

command: find_events
repository: rails/rails
date:
  from: 2011-11
  to: 2012-1
datapath: /home/klangner/datasets/github
output: events.csv

See more

Packages

  • com.matrobot.gha.archive - Contains code to parse githubarchive.org files and create intermediate csv and json files which can be processed by other packages or external programs like R or weka.
  • com.matrobot.gha.insights - Contains code for analyzing data and find patterns in it.
  • com.matrobot.gha.ghapi - Get data from GitHub API.

Important links

Dependencies

This project uses Maven to resolve dependencies

Contribution

If you want to help with this projects, then great! The best way to do it is by creating fork and sending pull requests.

If you are beginner and want to learn Java, then let me know. I can prepare easy tasks for you which will help you became better developer.

About

Analize github activity.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages