-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
map DigitalResources #13
Comments
This is a great analysis. Part number is designed to be for presentation only (ideally), and order number to preserve sequence when there shouldn't be a displayed part number (i.e. for a cover). This accounts for the number mismatch. Nonetheless, sometimes a part number was assigned, incorrectly, for a part type that shouldn't have a displayed number (i.e. cover). I can try to scrub these out and deliver a new file with just the correct part numbers. I will check the records to find the audio URL -- that is a mistake and should be corrected, somehow. |
(I assumed "track" is an audio file). @workergnome @azaroth42 here's a proposed mapping, please comment. <aaa/object/(FKITEMID)> crm:P65_shows_visual_item <(FULLSIZE)>.
<(FULLSIZE)> a crm:E38_Image;
crm:P2_has_type <aaa/thesaurus/part/urlify(PARTTYPE)>;
crm:P1_is_identified_by <aaa/image/id/(DIGITALRESOURCEID)>;
crmx:number "(PARTNUM)"^^xsd:integer; # skip 0
crmx:sort_oder "(ORDER)"^^xsd:integer.
<aaa/image/id/(DIGITALRESOURCEID)> a crm:E42_Identifier;
rdf:value "(DIGITALRESOURCEID)".
<aaa/thesaurus/part/urlify(PARTTYPE)> a skos:Concept;
skos:prefLabel "(PARTTYPE)";
skos:inScheme <aaa/thesaurus/part/>.
<aaa/thesaurus/part/> a skos:ConceptScheme;
skos:prefLabel "Object parts (page, cover, etc)". To be more CRM compliant (but slightly more wordy), we could say the object has several features, and each feature carries one image. Here P2 matches much better (a feature can be a Cover, but an image cannot be a Cover: it's just a depiction of the cover). <aaa/object/(FKITEMID)> crm:P56_bears_feature <aaa/feature/(DIGITALRESOURCEID)>.
<aaa/feature/(DIGITALRESOURCEID)> a crm:E25_Man-Made_Feature;
crm:P2_has_type <aaa/thesaurus/part/urlify(PARTTYPE)>;
crm:P1_is_identified_by <aaa/feature/(DIGITALRESOURCEID)/id>;
crmx:number "(PARTNUM)"^^xsd:integer; # skip 0
crm:P65_shows_visual_item <(FULLSIZE)>.
<(FULLSIZE)> a crm:E38_Image;
crmx:sort_oder "(ORDER)"^^xsd:integer.
<aaa/feature/(DIGITALRESOURCEID)/id> a crm:E42_Identifier;
rdf:value "(DIGITALRESOURCEID)".
<aaa/thesaurus/part/urlify(PARTTYPE)> a skos:Concept;
skos:prefLabel "(PARTTYPE)";
skos:inScheme <aaa/thesaurus/part/>.
<aaa/thesaurus/part/> a skos:ConceptScheme;
skos:prefLabel "Object parts (page, cover, etc)". |
Not for the current work. |
@VladimirAlexiev: Just to be clear, how are you accounting for multiple images per item? It looks like you're planning on using p65 to link to the image (using the FULLSIZE URI as the resource identifier), but we might have multiple images per Item. Maybe I'm not reading the mapping correctly, but it looks like it assumes a one-to-one relationship for items to images. We do have a representative image for each item that is stored in the has_representation field in Items that corresponds to an entry in the Item_DigitalResources spreadsheet, but many items will have more than one matching image (and these would link based on the fkItemID). If there's a part of the mapping that I'm not understanding, I'd appreciate getting a better sense of how multiple records are being handled. Thanks! |
@VladimirAlexiev : The Item_DigitalResources file is now updated to reflect changes to our part numbers (scrubbing them for part types that shouldn't have them). |
@tobiashreiter As any RDF property, p65 can (and in this case will) be used multiple times. |
If so, can you please use has_representation and assign a "classified by" property of AAT:300404670 (preferred terms), so it's clear that this representative image is the manually curated representation for the item? |
In the end I ended up mapping this slightly differently (I was in a rush, and not always looking at the GitHub issues when doing my mapping). Right now, I have a has_representation that links to the primary image for the object, and then links out (also using has_representation) to the digital resource records. I'm happy to drop the additional records if we really can't handle that now. |
@tobiashreiter: "The Item_DigitalResources file was updated on May 15th to use our publicly facing URL".
That file has very rich info and I think should be mapped. Columns:
Analysis of PARTTYPE:
Comparing PARTNUM vs ORDER (ignoring PARTNUM=0):
So they are different in 3% of all rows.
Analysis of PARTTYPE for those that have PARTNUM:
I think we should map both PARTNUM and ORDER:
Every image is used only once:
So we can use FULLSIZE rather than DIGITALRESOURCEID for the URL.
The text was updated successfully, but these errors were encountered: