Skip to content

jonasengelmann/worldcat-reconciliation-service

Repository files navigation

Worldcat-reconciliation-service

License

OpenRefine reconciliation service for Worldcat.

Implemented query properties are author and publication_year.

If there is only need to disambiguate conceptual works, i.e. specific realizations and editions are treated as the same entry, an extension service is provided that allows to extract all OCLC numbers of all editions. Thereby a set of OCLC numbers can be used to unambiguously identify a conceptual work, requiring an entry only to be matched to an arbitrary edition of the work. The extension service can be used as follows:

Edit column -> Add columns from reconciled values... -> OCLC Numbers of all Editions

Run via Docker-Compose (Recommended)

cp .env.docker-compose .env
docker-compose up --build

The reconciliation service should now be accessible at http://localhost.

Development

The reconciliation service requires Selenium and Redis. The easiest way is to run these services via Docker:

docker run --rm -it -p 4444:4444 --shm-size 2g seleniarm/standalone-chromium:latest
docker run --rm -it -p 6379:6379 redis:latest

Setup pre-commit hooks:

pre-commit install
pip3 install -r requirement.txt

Start the reconciliation service:

uvicorn main:app --reload --port 8000 --env-file .env.dev

The reconciliation service should now be accessible at http://localhost:8000.

License

This project is licensed under MIT license - see the LICENSE file for more information.