Test cases and other data for training and testing address matching algorithms.
Test cases are held in tab-separated format files with the following columns:
- name — a name for the test case which should be unique across all testcases
- text — the address encoded as lines of text, newlines are encoded as '\n'
- addresses — a list of the expected address register values separated by ';'.
A test case may contain additional fields for information.
$ make
The software in this project is open source, covered by LICENSE file.
The data held in this repository is © Crown copyright and available under the terms of the Open Government 3.0 licence.
Data downloaded by the build process may be covered by different copyright and terms.