Skip to content
/ uveira Public

Offline Wikipedia CLI for dumpster-dive MongoDB imports

Notifications You must be signed in to change notification settings

mrusme/uveira

Repository files navigation

Uveira

Command line client for MongoDB Wikipedia imports via dumpster-dive.

Build

$ go build

Usage

In order for Uveira to be able to connect to the database the config has to be set as environment variables:

export UVEIRA_MONGO_URI="mongodb://user:password@mongodb-host:27017" 
export UVEIRA_DATABASE="enwiki"
export UVEIRA_COLLECTION="pages"

Only then the tool can be used:

$ uveira -help
Usage of ./uveira:
  -te string
        Query by title (exact)
  -to
        List titles only in query results
  -tr string
        Query by title (RegEx)

Examples

Get a page by its exact title (case sensitive!):

$ uveira -te "Tesseract"

Get pages by using a RegEx query on their titles:

$ uveira -tr "^Tesseract.*"

Get only page titles by using a RegEx query on their titles:

$ uveira -tr "^Tesseract.*" -to