This documents serves to outline the tools currently included in the SDR, their purpose and interfaces.
Contents:
- Teklia Machine Learning tools
- Third party image processing tools
- Input file processing
- Output file processing
- Misc
The Teklia segmentation tool implements allows the identification of regions of interest, identifying alternatively samples or lines of text.
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
- Training model
- Description: Selection of models for use in the Teklia tools
- Type: [text] (options: "pinned_insects", "line_synthesys")
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
The Teklia Handwritten Text Recognition tool reads handwritten text in identified regions utilising the Kaldi library.
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
- Training model
- Description: Selection of models for use in the Teklia tools
- Type: [text] (options: "kaldi_synthesys")
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
The Teklia Named Entity Recognition tool implements the Spacy library.
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
- Training model
- Description: Selection of models for use in the Teklia tools
- Type: [text] (options: "pinned_insects")
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
Mothra, is a tool for automatically measuring the wingspan of lepidopterans. When provided with an image of a lepidopteran alongside scale bar, it attempts to produce a set of measurements.
- Image
- Description: The input image on which Mothra should be run
- Type: [PNG, JPG, JPEG, TIFF, TIF]
- Detailed plot
- Description: Flag to determine if Mothra should produce debugging information on output images
- Type: [bool]
- Mothra Image
- Description: The processed image output by Mothra
- Type: [JPG] (collection)
- Mothra Report
- Description: A csv file produced by Mothra detailing the measurements that it made
- Type: [csv]
This tool reads barcodes from specimen images using the bardecocde library.
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
- File location
- Description: Name of the input file
- Type: [json]
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
Splits a file line by line into seperate entries in a collection
- Input
- Description: The file which is to be split
- Type: [generic text]
- Split files
- Description: The multiple output files generated by the split
- Type: [generic text] (collection)
Create a specimen data object for de novo digitisation from a SDR CSV file.
- Input
- Description: The SDR specified CSV file to be used in the digitisation
- Type: [csv]
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
WARNING: DEPRECATED
Create a specimen data object for de novo digitisation
- Input
- Description: Input image
- Type: [png]
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
Saves the output of a digitisation to disk as an RO Crate
- Bundle Type
- Description: Selector to specify if wishing to bundle individual datasets or a collection
- Type: [select] (options: "Individual dataset", "List Collection")
- Description: Selector to specify if wishing to bundle individual datasets or a collection
- if Individual dataset, Single Files
- Description: Specify one or more files for bundling
- Type: [data]
- if List Collection, Data Collection
- Description: Specify a collection for bundling
- Type: [data] (collection)
- ROCrate
- Description: A Research Object Crate containing the input datasets / collections
- Type: [zip]
Create a flat csv from a specimen data object
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
- Input
- Description: A flattened representation of the OpenDS object
- Type: [csv]
Saves the output of a digitisation to (the server's) disk
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
- Batch ID
- Description: A user supplied identifier, under which to save the output
- Type: [text]
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
WARNING: DEPRECATED
A tool to validate a specimen data object.
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
- Validation report
- Description: Success or failure of validation
- Type: [text]
WARNING: DEPRECATED
Download an image to the local filesystem and append file info to opends object.
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]
- OpenDS specimen object
- Description: Incrementally built Fair Digital Object collating specimen data and annotations
- Type: [json]