There are two sets of datasets for Dictionaria: The internal datasets and the external datasets. External datasets are the ones that have been released to the public. Internal datasets are the ones that are still being worked on and that get served to a separate instance of the webapp, so the editors can give the authors a preview on what their dictionaries will look like.
The metadata for both internal and external datasets is contained in the private
dictionaria-intern repository. The
clld initdb script will look for a clone
of this repository in
also contains a readme file on how to curate the datasets.
clld initdb script will ask two questions:
- Do you want to load the internal or external datasets into the database?
- Do you want to add all or just one specific dataset to the database? This is useful for quick testing during development – repopulating the entire database can take a minute or two.
We usually load the internal and external datasets into separate databases:
$ clld initdb development.ini ... [i]nternal or [e]xternal data (default: e): e dictionary id or 'all' for all dictionaries (default: all): all $ clld initdb development-intern.ini ... [i]nternal or [e]xternal data (default: e): i dictionary id or 'all' for all dictionaries (default: all): all