Tools for working with taxonomic names
Reconciling small variations in taxonomic names facilitates the integration of biological names-based data. This tool matches a query list of (parsed) taxonomic names (List A) against a reference list (List B), according to a set of taxonomic rules (described below). The taxonomic rules are most appropriate for plant names, as specified by the International Code of Botanical Nomenclature. Can also perform approximate (fuzzy) matching to identify variations (e.g., misspelling) in binomial names and author strings. An output status code is given for each type of match.
See the man page for more details.
Split biological names into component parts:
- Genus hybrid sign
- Genus name
- Species hybrid sign
- Specific epithet
- Infraspecific rank signifier (“subsp.”, “var.”, etc.)
- Infraspecific epithet
- Name’s author string
Most of the work is done by a single regular expression. See the man page for more details.
All tools are AWK scripts for use with the Gawk flavor of AWK.
matchnames uses the
aregex Gawk extension. Before
aregex.so file must be present in a
$AWKLIBPATH (of both user and root).