Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parse addresses with no address number and two words as street name #258

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
14 changes: 14 additions & 0 deletions measure_performance/test_data/new_HC_tests.xml
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
<AddressCollection>
<AddressString><USPSBoxGroupType>HIGHWAY</USPSBoxGroupType> <USPSBoxGroupType>CONTRACT</USPSBoxGroupType> <USPSBoxGroupType>rte</USPSBoxGroupType> <USPSBoxGroupID>#</USPSBoxGroupID> <USPSBoxGroupID>46</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>992</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HC</USPSBoxGroupType> <USPSBoxGroupType>R</USPSBoxGroupType> <USPSBoxGroupID>32</USPSBoxGroupID> <USPSBoxType>Box</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>e3</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>StaR</USPSBoxGroupType> <USPSBoxGroupType>ROUTE</USPSBoxGroupType> <USPSBoxGroupID>75</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>5Z</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HC</USPSBoxGroupType> <USPSBoxGroupType>ROUTE</USPSBoxGroupType> <USPSBoxGroupID>72</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>1A</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HC</USPSBoxGroupType> <USPSBoxGroupType>R</USPSBoxGroupType> <USPSBoxGroupID>4C</USPSBoxGroupID> <USPSBoxType>Box</USPSBoxType> <USPSBoxID>54</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HC</USPSBoxGroupType> <USPSBoxGroupType>ROUTE</USPSBoxGroupType> <USPSBoxGroupID>3</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>5</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HIGHWAY</USPSBoxGroupType> <USPSBoxGroupType>CONtraCT</USPSBoxGroupType> <USPSBoxGroupType>ROUTE</USPSBoxGroupType> <USPSBoxGroupID>56</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>45C</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HCR</USPSBoxGroupType> <USPSBoxGroupID>88</USPSBoxGroupID> <USPSBoxType>bOX</USPSBoxType> <USPSBoxID>76E&#8221;</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HCR</USPSBoxGroupType> <USPSBoxGroupID>4e</USPSBoxGroupID> <USPSBoxType>box</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>32&#8221;</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HIGHWAY</USPSBoxGroupType> <USPSBoxGroupType>CONTRACT</USPSBoxGroupType> <USPSBoxGroupType>rte</USPSBoxGroupType> <USPSBoxGroupID>#</USPSBoxGroupID> <USPSBoxGroupID>32</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>1232</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HIGHWAY</USPSBoxGroupType> <USPSBoxGroupType>CONtraCT</USPSBoxGroupType> <USPSBoxGroupType>ROUTE</USPSBoxGroupType> <USPSBoxGroupID>1</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>77Z</USPSBoxID></AddressString>
</AddressCollection>

25 changes: 25 additions & 0 deletions training/new_HC_addresses.xml
Original file line number Diff line number Diff line change
@@ -0,0 +1,25 @@
<AddressCollection>
<AddressString><USPSBoxGroupType>HC</USPSBoxGroupType> <USPSBoxGroupType>ROUTE</USPSBoxGroupType> <USPSBoxGroupID>68</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>23A</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HC</USPSBoxGroupType> <USPSBoxGroupType>RTE</USPSBoxGroupType> <USPSBoxGroupID>24</USPSBoxGroupID> <USPSBoxType>box</USPSBoxType> <USPSBoxID>2A</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HC</USPSBoxGroupType> <USPSBoxGroupType>rte</USPSBoxGroupType> <USPSBoxGroupID>15B</USPSBoxGroupID> <USPSBoxType>bOX</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>1A</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>StaR</USPSBoxGroupType> <USPSBoxGroupType>Rte</USPSBoxGroupType> <USPSBoxGroupID>12A</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>455B</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>star</USPSBoxGroupType> <USPSBoxGroupType>route</USPSBoxGroupType> <USPSBoxGroupID>24</USPSBoxGroupID> <USPSBoxType>box</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>45</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>StaR</USPSBoxGroupType> <USPSBoxGroupType>ROUTE</USPSBoxGroupType> <USPSBoxGroupID>68</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>23A</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>hc</USPSBoxGroupType> <USPSBoxGroupType>RTE</USPSBoxGroupType> <USPSBoxGroupID>102</USPSBoxGroupID> <USPSBoxType>Box</USPSBoxType> <USPSBoxID>255D</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HC</USPSBoxGroupType> <USPSBoxGroupType>ROUTE</USPSBoxGroupType> <USPSBoxGroupID>12A</USPSBoxGroupID> <USPSBoxType>boX</USPSBoxType> <USPSBoxID>285</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HWY</USPSBoxGroupType> <USPSBoxGroupType>Contract</USPSBoxGroupType> <USPSBoxGroupType>RTE</USPSBoxGroupType> <USPSBoxGroupID>68</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>98A</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HCR</USPSBoxGroupType> <USPSBoxGroupID>99</USPSBoxGroupID> <USPSBoxType>boX</USPSBoxType> <USPSBoxID>22B</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HIGHWAY</USPSBoxGroupType> <USPSBoxGroupType>CONtraCT</USPSBoxGroupType> <USPSBoxGroupType>ROUTE</USPSBoxGroupType> <USPSBoxGroupID>95</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>235A</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HIGHWAY</USPSBoxGroupType> <USPSBoxGroupType>CONTRACT</USPSBoxGroupType> <USPSBoxGroupType>ROUTE</USPSBoxGroupType> <USPSBoxGroupID>12A</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>285</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HCR</USPSBoxGroupType> <USPSBoxGroupID>5a</USPSBoxGroupID> <USPSBoxType>box</USPSBoxType> <USPSBoxID>32</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HCR</USPSBoxGroupType> <USPSBoxGroupID>b34</USPSBoxGroupID> <USPSBoxType>BoX</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>55</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HCR</USPSBoxGroupType> <USPSBoxGroupID>45ac</USPSBoxGroupID> <USPSBoxType>box</USPSBoxType> <USPSBoxID>653</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>hwy</USPSBoxGroupType> <USPSBoxGroupType>CONTRACT</USPSBoxGroupType> <USPSBoxGroupType>route</USPSBoxGroupType> <USPSBoxGroupID>#</USPSBoxGroupID> <USPSBoxGroupID>15B</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>1A</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HWY</USPSBoxGroupType> <USPSBoxGroupType>CONTRACT</USPSBoxGroupType> <USPSBoxGroupType>ROUTE</USPSBoxGroupType> <USPSBoxGroupID>102</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>255A</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>stAR</USPSBoxGroupType> <USPSBoxGroupType>RTE</USPSBoxGroupType> <USPSBoxGroupID>#</USPSBoxGroupID> <USPSBoxGroupID>95</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>45C</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>STAR</USPSBoxGroupType> <USPSBoxGroupType>RTE</USPSBoxGroupType> <USPSBoxGroupID>102</USPSBoxGroupID> <USPSBoxType>Box</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>95</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>star</USPSBoxGroupType> <USPSBoxGroupType>ROUTE</USPSBoxGroupType> <USPSBoxGroupID>15B</USPSBoxGroupID> <USPSBoxType>bOX</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>102</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HIGHWAY</USPSBoxGroupType> <USPSBoxGroupType>CONTRACT</USPSBoxGroupType> <USPSBoxGroupType>rte</USPSBoxGroupType> <USPSBoxGroupID>#</USPSBoxGroupID> <USPSBoxGroupID>24</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>2A</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HCR</USPSBoxGroupType> <USPSBoxGroupID>23</USPSBoxGroupID> <USPSBoxType>Box</USPSBoxType> <USPSBoxID>#</USPSBoxID> <USPSBoxID>66A</USPSBoxID></AddressString>
<AddressString><USPSBoxGroupType>HC</USPSBoxGroupType> <USPSBoxGroupType>rte</USPSBoxGroupType> <USPSBoxGroupID>#</USPSBoxGroupID> <USPSBoxGroupID>95</USPSBoxGroupID> <USPSBoxType>BOX</USPSBoxType> <USPSBoxID>235A</USPSBoxID></AddressString>
</AddressCollection>