-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[DS-4104] Google Scholar requires either a correct citation_date or no date - incorrect ones should be avoided #7451
Comments
Is related to: DS-2930 |
bram said: Apologies for making these PR branches in the DSpace repo instead of in my own fork by the way! |
bram said: I need to add an additional change to address: "citation_date is the old & deprecated version of the metatag. It was deprecated since it could be and was interpreted in multiple way.s. The currently recommended version is citation_publication_date. citation_publication_date is preferred when available." e.g. right now, we are exposing citation_date and we should ideally be exposing citation_publication_date |
tdonohue said: I had a similar conversation with the Google Scholar team (Anurag and Monica) just before the holidays, and they had mentioned this as being a major issue. Thanks for the PRs, Bram Luyten (Atmire), I've already given them a +1. |
kshepherd said: I've merged the 5.x and 6.x versions – sorry Bram Luyten (Atmire), didn't notice the comment about citation_publication_date until afterwards so that might need a new PR afterwards. We should port this to master as well, then we can close this ticket |
kshepherd said: I've ported this ticket to master and merged. Not closing ticket just yet as I realise we might want another round of PRs to update the deprecated date field name. |
bram said: Created a separate issue for the renaming of the field: https://jira.lyrasis.org/browse/DS-4518 so we could close out 4104 |
Imported from JIRA [DS-4104] created by bram
Not 100% yet which versions are affected but we're currently crosswalking out too many dates to citation_date:
https://github.com/DSpace/DSpace/blob/master/dspace/config/crosswalks/google-metadata.properties#L42
google.citation_date = dc.date.copyright | dc.date.issued | dc.date.available | dc.date.accessioned
This patterns is "first match", so in case dc.date.copyright isn't filled out, dc.date.issued is consulted, after that the automatically generated deposit dates are offered.
The consequence is that sometimes, the date submitted into the repository gets exposed into citation_date. This makes it hard/problematic for Scholar to match it accurately against the actual publication date for the item.
If there is no accurate publication date, no date should be crosswalked at all. So I propose ONLY crosswalking dc.date.issued by default and remove the fallback options we have today.
Quick Fix (for any site running DSpace 1.7.x or above): Modify your [dspace]/config/crosswalks/google-metadata.properties configuration file. Change the "google.citation_date" setting to ONLY list "dc.date.issued".
google.citation_date = dc.date.issued
The text was updated successfully, but these errors were encountered: