Skip to content

rsnyder/wikidata-dumpfile-utilities

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Wikidata dumpfile utilities

Simple utility programs for processing Wikidata dump files.

Setup

Create and activate python virtual environment

python3 -m venv .venv
source .venv/bin/activate

Add dependencies

pip install -r requirements.txt

Run simple pipeline

This pipeline prints the QID and label for all people found in the first 100 entities in the dump file. The Wikidata dump file is assumed to reside in the current directory in the file latest-all.json.bz2. If the dump file is located elsewhere the --path argument must be used to specify the location.

./read.py --limit 100 | ./filter.py --people | ./transform.py

About

Wikidata search

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages