No description, website, or topics provided.
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
app
docs
src
.gitignore
LICENSE
Makefile
README.md
Setup.hs
package.yaml
stack.yaml

README.md

wikiwc

A Wikipedia word frequency counter.

This project makes use of Wikipedia_Extractor to pre-process a full Mediawiki dump into basically plain text files. It then parses these files into separate words, and counts the number of occurences of each word.

Usage

As a default, wikiwc downloads the german wikipedia.

$ make WIKILANG=en