Skip to content

abiUni/mining_cr_data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Github code review data scraper

This repo contains a modification of the ETRC tool (GitHub repo) to download code review data for multiple repositories given a csv file with repository names and pull request numbers.

To create the postgres database:

brew doctor
brew update 
brew install postgres@15
export PATH="/usr/local/opt/postgresql@15/bin:$PATH"
brew services start postgresql@15
psql postgres
CREATE ROLE scraper WITH LOGIN PASSWORD 'scraper23';
CREATE DATABASE githubcodereview;
\c githubcodereview
GRANT ALL ON SCHEMA public TO scraper;

To clean public schema:

psql postgres
\c githubcodereview
DROP SCHEMA public CASCADE;
CREATE SCHEMA public;
GRANT ALL ON SCHEMA public TO scraper;

To build project with:

./gradlew clean build  --stacktrace

To run project under ./build/libs/ (add github token in application.properties)

cp ./src/main/resources/application.properties ./build/libs/
cp ./src/main/resources/prs_with_review_comments.csv ./build/libs/
cd ./build/libs
chmod +x  prscraper-1.0.1.jar
java -jar prscraper-1.0.1.jar

New, configured run instructions:o Move to python-script project

mv build/libs/prscraper-1.0.1.jar ../cr-order/pr-scraper/

Then from there in root execute:

java -jar pr-scraper/prscraper-1.0.1.jar --reposToMine=adap/flower

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages