ScienceBeam Orchester

Configuration

An example configuration is provided in the example-config directory. Please copy it to config.

Datasets

Datasets describe where the data to be converted is coming from. In general it is describing a set of files.

Datasets are configured in: ./config/datasets, each .sh file describing one dataset.

Tools

Tools are used to convert files. Currently they configure the ScienceBeam pipeline.

Tools are configured in: ./config/tools, each .sh file describing one tool.

Run All

By default the corresponding container is started and stopped from within the sciencebeam-orchester container.

docker-compose run --rm sciencebeam-orchester ./run-all.sh convert

For an invidual dataset and conversion tool:

docker-compose run --rm sciencebeam-orchester \
  ./run-all.sh \
  --dataset pmc-1943-cc-by-sample \
  --tool grobid-tei \
  --force \
  --limit 1000 \
  --workers 10 \
  convert

docker-compose run --rm sciencebeam-orchester ./run-all.sh evaluation-report

Running individual containers

Build containers:

docker-compose up --no-start

Start:

docker-compose start sciencebeam-orchester

docker-compose start scienceparse-v2

docker-compose run --rm sciencebeam-orchester ./run.sh\
  --dataset pmc-1943 --tool scienceparse-v2 convert

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
example-config		example-config
scripts		scripts
.dockerignore		.dockerignore
.env		.env
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker-compose.override.yml		docker-compose.override.yml
docker-compose.yml		docker-compose.yml
requirements.py3.txt		requirements.py3.txt
requirements.txt		requirements.txt
run-all.sh		run-all.sh
run.sh		run.sh
start-stop-tool.sh		start-stop-tool.sh

License

elifesciences/sciencebeam-orchester

Folders and files

Latest commit

History

Repository files navigation

ScienceBeam Orchester

Configuration

Datasets

Tools

Run All

Running individual containers

About

Resources

License

Stars

Watchers

Forks

Languages