Skip to content
  • Integrated with readthedocs
  • Integrated with Travis CI
  • Improved documentation
  • Cleaned up sorts
Assets 2
  • Added script/maria2csv.py for converting MySQL/MariaDB dumps to CSV format.
  • Major rework of script/createlinks.sh script: Switched from column-number-based access to column-name-based access (this now includes Wikis that have a different DB layout, i.e. columns are not in the standard order).
  • Removed second dictionary from danker/danker.py by altering i and i+1 PageRank score positions with iterations in a single dictionary.
Assets 2

@athalhammer athalhammer released this Apr 24, 2019 · 4 commits to master since this release

  • Moved scripts to own folder script.
  • Renamed lib folder to danker.
  • Added setup.py.
Assets 2

@athalhammer athalhammer released this Mar 27, 2019 · 4 commits to master since this release

  • Major Python code cleanup.
  • Added asserts for sorted input.
  • Implemented test cases.
Assets 2

In previous versions of this code, there was no strict separation between iterations. In this case, the order of the nodes can start to play a role (higher numbers are updated later and can therefore make use of mostly already updated scores from the incoming links). Over multiple iterations, this could introduce a skew that we want to avoid.

Assets 2

@athalhammer athalhammer released this Sep 30, 2018 · 4 commits to master since this release

  • Wikipedia categories were not considered the previous version. Now these important pages (that all have also Wikidata Q-IDs) are also reflected in the computations.

  • Link-files are now compressed after computation. This safes disk space. For the ALL option, also some statistics can be found in the output (i.e., number of links per language).

Assets 2

@athalhammer athalhammer released this Sep 3, 2017 · 4 commits to master since this release

We release the first stable version of danker. Current features include:

  • Compute PageRank on any Wikipedia language edition.
  • Compute PageRank with the BIGMEM option (faster)
  • Compute PageRank over the union set (bag semantics) of links of ALL Wikipedia language editions.
Assets 2
You can’t perform that action at this time.