Calculates PageRank over citation data.
- Python 2 or 3 (Python 3 preferred)
- igraph
pip install -r requirements.txt
First generate or download the citation links, e.g. using datacapsule-occ.
Then run:
./pagerank.sh \
--links-path=data/doi-citation-links.csv \
--pagerank-output-path=data/doi-citations-pagerank.csv \
--source-column=citing_doi \
--target-column=cited_doi
The output will be data/doi-citations-pagerank.csv with the following columns:
- citing_doi (or name specified by source_column)
- pagerank - the file will be sorted by that value (descending)
- incoming - number of incoming connections
- outgoing - number of outgoing connections
GPL due to igraph. (This may change in the future when switiching to another library)
Code that doesn't rely on GPL licensed code, may be licensed under MIT.