-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Spatial notation "99" is (ab)used as addtional free text field in source data #17
Comments
We could use
|
An even better approach would be imo, if we mapped the strings to actual resources and used |
Which property should be used to link to openstreetmap.org , be it a relation,node or a way ? |
To have this information somewhere here some numbers on the coverage of notation "99" in NWBib data. As of today:
|
As Simon Schneider suggested at the Friday meeting, we should rather link to http://linkedgeodata.org although there are some properties in the data that don't dereference, i.e. that are not part of a documented vocabulary. Relation URIs look like http://linkedgeodata.org/triplify/relation66555. I propose using the same property (
|
Issue hbz/nwbib#17 spatial 99
With lobid/lodmill#499 deployed the data is partly there, see e.g. http://lobid.org/resource/HT015617942. Some stats @acka47:
|
After new indexing at staging we now have 236.838 records with newly generated links to geonames and linkedgeodata (slightly more than the proclaimed 234.082). |
Looks great. I found one entry, where it didn't work, though: http://lobid.org/resource/HT017502874/about. |
Yes, that is one of the 393 problematic cases where the lookup fails. There are two types errors: HTTP status code 400 and 420. The former is a HTTP response code indicating a real problem with the URL, etc |
+1 for this, we have to fix notation 97 and 96 though, see #53 |
Ok, these are other issues. Closing this one. |
Just nopticed that there are now a lot of errors in the data. Examples:
These haven't been there from the beginning but I obviously missed to test properly before saying "+1"... Reopening. |
I remember there was a problem regarding data parsing. The error was fixed, but the geo api lookups resulting in false data was stored in database and this was not cleansed. |
The links for "Düsseldorf" are better. "Köln" links to "Altstadt-Nord" in GeoNames, though. See for example http://test.lobid.org/nwbib/BT000088884. |
This "wrong" link to geonames is a result of the way we build these links - there must be a better way, but for mow it's just like that: I remember that asking geonames with the literals directly too often it results in no result at all. |
You are right that LinkedGeoData is better in terms of geo coordinates as we get polygons from it. On the other side GeoNames is better for multilingual labels (which we currently don't need, though)... |
We have to improve this in the future. Maybe we should leave out the Geonames links in the HTML for now... |
I still think we wrongly invested in this upfront, and think we should not put additional work into this until more pressing issues are solved (first and foremost, reliable daily updates with monitoring). |
See hbz/nwbib#17 * pass-through of '<' and '>' to be used by following morphs * ',' and especially '-' are necessary when lookup nominatim API * avoid conflation of words by substituting '(','/',')' with '%20' * add tests
Pass the unmodified literal to build the query. Clean the literal to be used as JSONP callback ID in the geo morph, NOT already in the preceding morph. See hbz/nwbib#17.
BTW, it is similar with notations 96 & 97, see #53. |
Closing, as the original issue is solved. |
The notation "99" is used in the data (see http://test.lobid.org/nwbib/search?query=n99) but doesn't show up in the classification (SKOS, legacy classification).
This is so because the notation is used as a free text field for information on places, indicating
99
in subfield 'a' and the free text for the place in subfield 'b' of field 700n. In MAB, this looks like:700n |a 99 |b Hilden
Example values are "Hilden", "Köln", "Recklinghausen" and "Nettetal".
Mapping notation '99' to
http://purl.org/lobid/nwbib-spatial#n99
isn't correct then and we should think of a way to get out the free text and add it to the RDF.The text was updated successfully, but these errors were encountered: