KewBridge is an effort to prioritise, build and document datasets which can be used to train machine learning applications.
Scoping and prioritising this work is likely to involve quite a bit of discussion: we will try to do this in the open using the discussions feature.
TBC
- July 2023 First student placement (Pausali Sengupta, 8 weeks)
- August 2023 Imageomics workshop
- September 2023 Second student placement (Eren Karabey, 12 months)
- specimens2illustrations - extract illustrations and captions from monographs, process captions to find referenced specimen, segement composite illustration image to separate components
- gbifocc-datasette - a datasette instance populated with a GBIF occurrence download and configured to act as a reconciliation endpoint, so that collector name and number can be translated into the associated GBIF occurrence
TBC
- Nicky Nicolson (n.nicolson@kew.org)