Skip to content

NAH2NAH/address-matching-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 

Repository files navigation

Address matching data

Test cases and other data for training and testing address matching algorithms.

Test case format

Test cases are held in tab-separated format files with the following columns:

  • name — a name for the test case which should be unique across all testcases
  • text — the address encoded as lines of text, newlines are encoded as '\n'
  • addresses — a list of the expected address register values separated by ';'.

A test case may contain additional fields for information.

Building bulk datasets

$ make

Licence

The software in this project is open source, covered by LICENSE file.

The data held in this repository is © Crown copyright and available under the terms of the Open Government 3.0 licence.

Data downloaded by the build process may be covered by different copyright and terms.

About

Test data for matching addresses

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published