Skip to content

greencardamom/Pgcount

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Pgcount

by User:GreenC (en.wikipedia.org)

June 2020-2024

MIT License

Info

Pgcount generates Wikipedia:List of Wikipedians by article count

  • Designed for unlimited scalability, Wikipedia database size does not matter.
  • Low memory and CPU use.
  • Designed to fail and recover mid-process, state information is preserved.
  • No SQL or queries, API driven.
  • Caches between runs.
  • Flexible for use with multiple wiki languages.

Dependencies

  • GNU Awk 4.1+
  • BotWikiAwk (version Jan 2019 +)
  • A bot User account with bot permissions for your target wiki.

Installation

  1. Install BotWikiAwk following setup instructions. Add OAuth credentials to wikiget, see the EDITSETUP instructions.

  2. Clone Pgcount. For example: git clone https://github.com/greencardamom/Pgcount

  3. Set ~/Pgcount/pgcount.awk to mode 750, and change the first shebang line to the location of awk on your system

  4. Edit pgcount.awk in the "BEGIN{}" section is a place for you email address to send error reports to, and a few harded coded paths for common unix utilities.

Running

Example crontab entry

4 3 1 * * /home/greenc/toolforge/pgcount/pgcount.awk -h en -d wikipedia.org

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages