This repo contains a modification of the ETRC tool (GitHub repo) to download code review data for multiple repositories given a csv file with repository names and pull request numbers.
To create the postgres database:
brew doctor
brew update
brew install postgres@15
export PATH="/usr/local/opt/postgresql@15/bin:$PATH"
brew services start postgresql@15
psql postgres
CREATE ROLE scraper WITH LOGIN PASSWORD 'scraper23';
CREATE DATABASE githubcodereview;
\c githubcodereview
GRANT ALL ON SCHEMA public TO scraper;
To clean public schema:
psql postgres
\c githubcodereview
DROP SCHEMA public CASCADE;
CREATE SCHEMA public;
GRANT ALL ON SCHEMA public TO scraper;
To build project with:
./gradlew clean build --stacktrace
To run project under ./build/libs/ (add github token in application.properties)
cp ./src/main/resources/application.properties ./build/libs/
cp ./src/main/resources/prs_with_review_comments.csv ./build/libs/
cd ./build/libs
chmod +x prscraper-1.0.1.jar
java -jar prscraper-1.0.1.jar
New, configured run instructions:o Move to python-script project
mv build/libs/prscraper-1.0.1.jar ../cr-order/pr-scraper/
Then from there in root execute:
java -jar pr-scraper/prscraper-1.0.1.jar --reposToMine=adap/flower