Replies: 7 comments 21 replies
-
As cataloguing processes move towards entity management, the use and addition of persistent identifiers and URIs in records (or metadata works) is a necessity. Further, enabling linked data and explore task presupposes that third-party identifiers and URIs will be likely used. Library of Congress has published its data in id.loc.gov and has also provided guidelines for the use of the $0 $1 subfields, i.e., Report, FAQs and Wikidata page. LoC differentiates between a RWO and the information resource describing it. LoC suggests that the URIs for the RWOs can be used in $1, while the URIs for the information resource (bibliographic and authority records) can be used in $0. In the MARC proposals related to the use of $0 and $1 subfields, e.g., MARC Proposal No. 2017-08, the previously-mentioned distinction is also made by providing these two definitions:
In these 2 definitions, two modelling issues must be highlighted:
When checking the 1st issue, one can conclude that LoC does not implement this modelling. LoC does not keep separated the information about the RWO and the authority record referring to that RWO. Let's take Homer as an example (Check Homer RDF files.zip). The authority record is an instance of mads:Authority class (which is not considered RWO by LoC), and of skos:Concept class (which is a RWO). It includes both information about Homer (e.g., gender, occupation, associated language) and information about the use of the heading Homer in LoC (e.g., citation sources, LC classifications, editorial notes, use of heading Homer as subject in LoC bibliographic records). The RWO RDF file presents an instance of mads:RWO, bf:Person, and foaf:Person. The description of Homer as a RWO includes information about Homer (e.g., gender, occupation, associated language) BUT the properties used for this descriptions are not from Bibframe (bf does not have such properties), nor from foaf (foaf has appropriate properties). MADS properties are used instead. Checking the MADS ontology, the relevant properties are modelled having as domain mads:RWO. So, from a model's point of view LoC may be consistent, yet IMHO it is strange modelling. Another important issue is that information about the heading Homer in the LoC catalog is maintained in the RWO, e.g. subjectOf. To be honest, I did not expect a subjectOf relationship in the Homer RWO RDF file, but an author relationship relating Homer to his Works. When checking the 2nd issue, one can conclude that there is a semantic problem. SKOS was developed for KOS, concepts and their relationships between them. An authority record is a metadata WORK. So, I believe that it is wrong to model an authority record as an instance of skos:Concept. The LoC practice that skos:Concepts are not RWOs originates from this wrong modelling. In relation to our mapping exercise, I propose
PS. I have not touched the issue of what is a RWO. To me it is something that can be dereferenced. Since there is not currently a tool to check if and how id.loc.gov URIs are dereferenced, I have focused on the modelling part of this discussion. |
Beta Was this translation helpful? Give feedback.
-
Thank you for beginning this discussion. I don't think we can say we've properly discussed RWOs if it doesn't involve some pain, as it's a beastly topic. Anyway... I disagree with the direction our group has taken that maintains that instances of skos:Concept are necessarily RWOs. The SKOS specification allows ambiguity here, so we do seem to have a choice of creating a vocabulary where URIs are assigned to RWO concepts typed as skos:Concept. However, anything typed as a skos:Concept is not necessarily a real world concept; in fact, I would argue that a skos "concept" is a "concept" in a knowledge organization system and, as such, more appropriately considered documentary in nature. When we retrieve the document -- and this is complicated by the recommendation (Architecture of the World Wide Web, 2004) that a web document is actually content that has, potentially, multiple representations (an html representation, an RDF/XML representation, a Turtle representation, etc.) -- that is exactly what we are retrieving, "content" describing an entry in a KOS. That KOS is a document presenting a model that helps us organize a domain of knowledge but is not a "knowledge model." If you read the SKOS specification's introduction -- especially section 1.3 -- I don't how anyone can maintain that anything typed skos:Concept should be understood as a RWO; in fact, I would expect the reader to emerge thinking a skos:Concept is not an RWO. I would be very pleased if someone could show me one authoritative document that informs us that a skos:Concept is a RWO. You can find affirmations that concepts are RWOs, but a skos:Concept is not the same thing as a concept in the world. |
Beta Was this translation helpful? Give feedback.
-
What is an RWO. I would say that I take the views of a day-to-day librarian who just wants to apply the rules. I think there are more ambitious people that want to change the architecture of the web. In that imaginary, I believe that saying a RWO is something that can be dereferenced is an attempt to rethink our architecture. In the current architecture, there's only one "thing" that can dereferenced and that is a web document. In the current architecture, there are only two things in the world: (1) a web document, (2) everything else. Only web documents can be exchanged on the web. We call everything else a "real world object" and we cannot exchange it over the web; instead, we offer up some description in a web document. We've been given recommendations on how to do this (Cool URIs, 2008): we either redirect to a web document when RWOs are requested over HTTP, or we create fragment identifiers (hash URIs) in a web document where we describe RWOs in that document (of course actual practice mostly does not follow these recommendations). So an RWO, in what I call the current architecture, is anything that cannot be dereferenced directly. So I don't understand the idea that an RWO is anything that dereferences, as dereferencing means accessing a referenced resource. We can't access an actual elephant over http, only a document describing the elephant. |
Beta Was this translation helpful? Give feedback.
-
I don't think we have to define what an RWO is for the purposes of the project, although it is an interesting side discussion and should be part of a basic understanding of RDF and linked open data. [Nonetheless, a couple of points have been made that I think are incorrect:
For the project, we need to know of what RDF type (class/entity) is the instance represented by an IRI. Note that every IRI represents an instance of something (some "thing"). Example: The instance IRIs http://id.loc.gov/rwo/agents/nb2001072552 and http://id.loc.gov/authorities/names/nb2001072552 that are both labelled "Dunsire, Gordon" in id.loc.gov. Both instance IRIs have multiple types. This means the instance is a member of each of the type classes and may be described using typed-domain properties of any of the classes (including their super-classes). The RDF type graphs for each instance are:All I don't think we have to define what an RWO is for the purposes of the project, although it is an interesting side discussion and should be part of a basic understanding of RDF and linked open data. [Nonetheless, a couple of points have been made that I think are incorrect:
For the project, we need to know of what RDF type (class/entity) is the instance represented by an IRI. Note that every IRI represents an instance of something (some "thing"). Example: The instance IRIs http://id.loc.gov/rwo/agents/nb2001072552 and http://id.loc.gov/authorities/names/nb2001072552 that are both labelled "Dunsire, Gordon" in id.loc.gov. Both instance IRIs have multiple types. This means the instance is a member of each of the type classes and may be described using typed-domain properties of any of the classes (including their super-classes). The RDF type graphs for each instance are: The dotted line property is rdfs:type; the unbroken line property is rdfs:subClassOf. Type statements are entailed upwards; for example: http://id.loc.gov/rwo/agents/nb2001072552 rdfs:type bf:Person . These statements should be sufficient to determine the appropriate RDA class with which to type each instance IRI. This is the same as determining a map from the MARC 21 classes to RDA classes. To determine the mapping from each class we have to compare the semantics of the source and target classes. The semantics are indicated by definitions, etc., subclass declarations, and the properties (in that order and recursively) but are not indicated by class labels. The MARC 21 types involve the following classes and definitions:
The equivalent RDA classes are:
The MARC 21 class definitions are significantly less precise than the RDA definitions. The MADS/RDF definitions are circular, inconsistent, and reference the definitions of other properties in the namespace. In particular, the MADS/RDF RWO definition can be unpacked to: RWO: An abstract entity that identifies a Real World Object (RWO) identified by the label of an idea or notion with a controlled label or a former idea or notion with a controlled label. I think this means that a MADS RWO is any thing that can or does have a label, and any thing that cannot be labelled is not an RWO. I note that a blank node is not an instance of an RWO, because there is no label (IRI to stringify). If so, we can ignore the MADS RWO class because an RDA instance must have an appellation (i.e. label). It seems clear from the definitions that RDA Person has a narrower coverage than BF Person, and therefore: rdac:C10004 rdfs:subClassOf bf:Person . All RDA persons are BIBFRAME persons, but not all BIBFRAME persons are RDA persons, so a transformation from BF/MARC 21 to RDA Person is unsafe. I think this analysis is applicable to all entities with authorities in id.loc.gov. At some point in the chain of classes and definitions, fictitious instances of the entity are included in the scope, so the corresponding RDA entity must be a subclass and the instance IRI cannot be transformed safely. |
Beta Was this translation helpful? Give feedback.
-
The key to understanding the id.loc.gov modelling is the property foaf:focus. This is a super-property of madsrdf:identifiesRWO which relates an authority to its RWO. The domain of the property is madsrdf:Authority and the range is madsrdf:RWO. ex:IRI1 madsrdf:identifiesRWO ex:IRI2 . This is a subpropertyof foaf:focus. The domain of the FOAF property is skos:Concept, the range is owl:Thing. ex:IRI1 madsrdf:identifiesRWO ex:IRI2 . This is why an instance IRI is typed as a MADS Authority and as a SKOS Concept, and why MADS Authority is declared a subclass of SKOS Concept. Otherwise, id.loc.gov cannot use foaf:focus. The "focus" property was added to FOAF in 2010, well after the publication of the original ontology; see FOAF spec revised - addtion of foaf:focus, a skos extension linking topical and factual information. [There seems to be a problem with the FOAF Vocabulary Specification which displays an older version without the "focus" property.] It is a bit strange that this property was added to FOAF and not to SKOS: it has a SKOS class as a domain. I think the original modelling decision at id.loc.gov was to use SKOS irrespective of the distinction between a value vocabulary and a dataset that is discussed in Library Linked Data Incubator Group: Datasets, Value Vocabularies, and Metadata Element Sets. This report was published in 2011. SKOS is not really intended for datasets of instances of entities such persons, works, and places, but it is fine for thesauri and other kinds of value vocabulary of instances of concepts such as carrier types. There is an interesting discussion about much of this, also from 2011, in Things & their conceptualisations: SKOS, foaf:focus & modelling choices. I have always been puzzled as to why id.loc.gov did not consider using the SKOSXL ontology: SKOS Simple Knowledge Organization System eXtension for Labels (SKOS-XL) Namespace Document - HTML Variant. This was published in 2009, and is closer to the idea of LRM Nomen. Finally, foaf:focus has no declared super-property. As @szapoun points out, RDA takes a different approach to relating a description to its subject that avoids confusion between labels, concepts, and subject relationships ("aboutness") by using the property rdax:P00030 "RDA entity described with metadata by" with domain RDA Entity and range (metadata) Work. Definitions:
The RDA definition unpacks to: A work that is a metadata statement or a metadata description set for an RDA entity. |
Beta Was this translation helpful? Give feedback.
-
RDA definition of a RWO: An instance of an entity or a term from a vocabulary encoding scheme that is referenced by an Internationalized Resource Identifier for representation in Resource Description Framework. |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.
-
This discussion is related to this discussion $0's and $1's #328
Beta Was this translation helpful? Give feedback.
All reactions