-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CDC 2.5 and 3.2 profiles: access CV information #104
Comments
Original comment by Taina Jääskeläinen. Could dc: Access rights or dc: Rights elements be used? |
Original comment by Darren Bell (GitHub: darrenbell2). From: Wendy Thomas [[mailto:email address removed](mailto:email address removed)] Ok first there is a place for data access and metadata access in 2.6 codebook/stdyDscr/dataAcces/typeOfAccess codebook/stdyDscr/metadataAcces/typeOfAccess
as for Dublin Core it is the last contained element in "citation"
On Wed, Jan 26, 2022 at 4:38 AM Bell, Darren S <[email address removed](mailto:email address removed)> wrote: Hi Wendy – sorry if this is a dumb question but Taina is talking about using dc:rights in a codebook 2.5 document. I can see that codebook.xsd imports the dcterms schema: <xs:import namespace="http://purl.org/dc/terms/" schemaLocation="dcterms.xsd"/> which in turn imports the dc schema: <xs:import namespace="http://purl.org/dc/elements/1.1/" schemaLocation="dc.xsd"/>
But how (and where) can I use “dc:rights” in a codebook xml document so that can be validated properly? Should I do a namespace and schema declaration for dc.xsd in the root element of the codebook xml document?
Thanks, Darren |
Original comment by Taina Jääskeläinen. Wendy said: Lifecycle: TypeOfAccess is an element of AccessType and is expressed in Access, DefaultAccess, and SampleFrameAccess and Access is available in Item. DDILIFE-3699 addresses this. (Taina: So DDI 3.3. has an element TypeOfAccess but DDI 3.2 does not seem to have it) Codebook: The ability to describe a typeOfAccess for data and metadata has been added to Version 2.6 dcterms:AccessRights is available and I have talked with Darren about how this could be used and how to associate a controlled vocabulary with this item __________________ Another suggestion from a CESSDA deliverable D3 Specification for interoperable access conditions in CDC: The technical capacities of OAI-PMH endpoints must also be examined, configuring OAI-PMH headers12 to set up a True or False filter seems promising. http://www.openarchives.org/OAI/openarchivesprotocol.html#Record |
Original comment by Former user. I am not convinced that OAI setspec is a good approach to solve this, it shifts metadata information over to logical organisation of records and is not interoperable beyond CESSDA |
Original comment by Taina Jääskeläinen. CDC Upgrade project thought that while waiting for DDI 2.6 which solves this, DDI 2.5 could use
I don’t know if archives already have some other content in the element. |
Original comment by Taina Jääskeläinen. Any ideas about this one? |
Original comment by Morten.Jakobsen (GitHub: MortenSikt). We have had some talks on this, and the proposed position in codebook seems like an acceptable solution and is agreed. But there are still some difficulties in order to achieve the desired result in CDC. Not sure they are relevant to this Bitbucket issue itself, but I’ll note them down for anyone interested.
|
Original comment by Katja Moilanen. It would be also possible to do like this Avoin Öppen Open OR use only a not-language-specific code O |
Original comment by Morten.Jakobsen (GitHub: MortenSikt). Oh, then I have misunderstood it. I thought that each instance of the language would require its own @ID. This makes it easier, thank you for this. |
Original comment by Katja Moilanen. If you have same ID for two or more elements the XML document is not valid. This is not valid: …. Avoin Open Öppen …. Having language code attached to string CESSDA_Data_Access (if and only if there is one term from CESSDA Data Access CV used for one study) make IDs valid. This is valid: …. Avoin Open Öppen …. |
Original comment by Morten.Jakobsen (GitHub: MortenSikt). My concern with the languages was based on this beeing a freetext field, so I assume they have to be specified no matter what, but I might be wrong. My codebook knowledge is limited. But in trying to imitate a CV here it would make the most sense that the content of the conditions element is a Code, not a descriptive term. Then it would be easier for CDC to handle it, since its one value instead of many different they need to resolve? |
Original comment by Taina Jääskeläinen. Yes, I agree, it might be better to use the code value only. I’m attaching the CESSDA Access Interoperability Vocabulary to show the code values. It is not published yet, first a review round is to be made. |
Original comment by Taina Jääskeläinen.
|
Original comment by Katja Moilanen. In DDI-C, it is not mandatory to use language tags for freetext fields (if I checked the DDI-L specification correctly, it is not mandatory in it either). If the DDI-C file is monolingual, the language is usually specified in codeBook root element and all elements/attributes inherit the language of the root element. In multilingual files elements have their own language tags and actually in most cases all contents have language either through their own language tag or inherited. So Morten you are correct, that language is defined somewhere. Kuha2 seems to add language tag for all elements so having one element without language might cause a problem. |
Original comment by Taina Jääskeläinen. Hmmm… that complicates things. Would you say Katja then that it would be best to use the descriptive term with language tags or the code value with language tags? |
Original comment by Morten.Jakobsen (GitHub: MortenSikt). Can’t we just exclude language tags from the DDI Profile then for this element? Then CMV/CDC just ignores the information in the cases where it is present, or am I misunderstanding this? edit: I mean that it would ignore the language tag, not the conditions element or @ID attribute |
Original comment by Katja Moilanen. I asked this from Toni who has built the CESSDA metadata aggregator and Kuha2. The information, if the data is openly available or not is useful for other catalogues as well, not only for CDC. E.g. BY-COVID portal (https://www.covid19dataportal.org/) will in future include metadata of the COVID-19 related datasets appearing in CDC. Toni said that using ID in this way does not work in CESSDA metadata aggregator (or for Kuha2) because there one OAI-PMH-XML document has many entries and ID must be unique inside one XML document so misusing ID for separating CESSDA field from other does not work. |
Original comment by Katja Moilanen. How about misusing //conditions/@elementVersion ? It seems to be xs:string so it could always get value “CESSDA” and it does not need to be unique. |
Original comment by Morten.Jakobsen (GitHub: MortenSikt). Well that’s not good news. I don’t really mind what we use as long as it is something that can work at this point. But is there not a higher risk that this attribute is already being used for versioning? I was looking at //conditions/@ddiCodebookUrn but I don’t know if this includes information from the @ID attribute the same way LifecycleUrns do? But there are not many options left here, so @elementVersion works for me.. |
Original comment by Katja Moilanen. I think that codeBook users do not largely use @elementVersion for versioning (it has been available only since DDI2.5). @ddiCodebookUrn contents must be of type xs:anyURI. In any case, some other attribute than ID needs to be misused. |
Original comment by Former user. Sorry to add to the complications, but OpenAIRE requires ‘Rights’, see https://guidelines.openaire.eu/en/latest/data/field_rights.html#rightsuri-ma This has the following allowed values:
According to MDO’s current schema mapping (https://doi.org/10.5281/zenodo.5614658), the information should come from So the solution of this issue should include the specification for that mapping, i.e. how the OpenAIRE CV is to be used in the transformation. |
Original comment by Katja Moilanen. I think that current MDO - OpenAIRE mapping does not include OpenAIRE 16.1. RightsURI at all. I suppose that OpenAIRE 16. Rights is mapped with CDC codeBook/stdyDscr/dataAccs/useStmt/restrctn because both of these are freetext fields. In OpenAIRE 16. Rights and 16.1 RightsURI are not mandatory/required; occurences of the 16. Rights is 0-n and 16.1 RightsURI 0-1. (OpenAIRE 2. Creator, which is mandatory/required may have 1-n occurences). Do we need CESSDA’s own data access CV at all, if there is a need to use OpenAIRE CV? |
Original comment by Taina Jääskeläinen. @sharonbolton @MortenSikt What do you think about replacing the CESSDA CV with OpenAire vocabulary, would this work? It would of course mean asking SPs to do the exercise of mapping their access categories to it. It solves the issue of embargoed access but the definition given for restricted access is rather problematic for CESSDA since it simplifies all types of restrictions to getting an email address in the last sentence. The definitions seem to be focused on articles and other publications. Is someone from MDO in contact with OpenAire? Would be good to discuss a possible amendments of term definitions? Or adapt the CESSDA CV to be more compatible with OpenAire by using closedAccess, embargoedAccess, restrictedAccess and openAccess as code values but adapt the definitions to fit the CESSDA case better? OpenAire Access Rights vocabularySAME AS the e-prints accessRights property in the Scholarly Works Application Profile see: http://purl.org/eprint/accessRights/ |
Original comment by Morten.Jakobsen (GitHub: MortenSikt). If replacing the CESSDA CV with the OpenAire vocabulary solves other issues it is probably better. To me they look as general as the ones we intended, and there should not be a problem for SPs mapping to these. On another note, we will misuse this as katja proposed: //conditions/@elementVersion for specifying that the content of the element is a code for the CV for 2.5. |
Original comment by Katja Moilanen. Especially if the decision will be to use two classes
it does not really matter which vocabulary is used. |
Original comment by Katja Moilanen. Taken from Morten’s email: As for the vagueness of the OpenAire CV definition. The report will include guidelines for SPs on mapping to the CV and this will use the definitions stated in the CESSDA CV for CESSDA Data Access Interoperability Vocabulary. In short, all data with access that fall within “Free download. May require registration to the system and/or accepting the terms of use online to gain access to the data. No restriction on the type of use.” Will be mapped to openAccess. Any other restrictions that are not within that scope, the SPs should map to restrictedAccess. |
Original comment by Taina Jääskeläinen. @MortenSikt OK, with those mapping instructions I guess it would work and CESSDA could then use the OpenAire vocabulary. The misuse of the DDI 2.5 element would be only temporary. The draft DDI 2.6 specification will be under review from next week. As Darren said, it contains ‘TypeOfDataAcdess’. In the most optimal scenario, the end-of-the-year DDI2 profiles could already contain that element. However, this depends on the review cycle. But is there a suitable element in DDI 3.2.? There is in DDI 3.3. |
Original comment by Morten.Jakobsen (GitHub: MortenSikt). Should not be to hard to add it in for 2.6 when that arrives. There is not a suitable element under Archive/Access for DDI 3.2, it is possible to specify the CV using UserAttributePair(https://ddialliance.org/Specification/DDI-Lifecycle/3.2/XMLSchema/FieldLevelDocumentation/schemas/reusable_xsd/elements/UserAttributePair.html), this can basically be used to specify whatever you want. But since a proper element exists in DDI 3.3 I am writing the specification for that standard. |
Original comment by Katja Moilanen. I and Esra checked DDI3.2. We suggest to use ddi:DDIInstance/s:StudyUnit/a:Archive/a:ArchiveSpecific/a:Item/a:Access/a:AccessTypeName/r:String in DDI3.2 for saying if access is “openAccess” or “restrictedAccess” and attribute //a:AccessTypeName/@context to be used for vocabulary name. We know that it is not a perfect solution, but we did not find better one. |
Original comment by Katja Moilanen. Conclusion: CDC will use the same Access Rights vocabulary as OpenAIRE. The name of the vocabulary is “info:eu-repo-Access-Terms vocabulary”. Only the codes “restrictedAccess” and “openAccess” will be used. DDI2.5: DDI2.6: DDI3.2: DDI3.3: Guidance for SPs how to use info:eu-repo-Access-Terms vocabulary vocabulary: |
Original comment by Matthew Morris (GitHub: matthew-morris-cessda). |
Implemented following in 2.5, 2.5_mono, 1.2.2 and 1.2.2_mono profiles: |
Implemented following in CDC 3.2 profile: |
Will close as new 2.6 and 3.3. profiles will be based in 2.5/3.2 and hence will incorporate this information. |
Reopening, as this hasn't been merged into main (or mdo-d6) |
Original report on BitBucket by Taina Jääskeläinen.
CESSDA will have an access vocabulary to which SPs will map their access categories. This will allow CDC users to narrow down their searches to open access data.
So far neither DDI-C nor DDI-L have an element for this but Darren had an idea of how this information could be incorporated into the profiles. Important to have.
CV Element promised for DDI 2.6.
The text was updated successfully, but these errors were encountered: