Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[DS-4104] Google Scholar requires either a correct citation_date or no date - incorrect ones should be avoided #7451

Closed
dspace-bot opened this issue Dec 11, 2018 · 8 comments
Milestone

Comments

@dspace-bot
Copy link

Imported from JIRA [DS-4104] created by bram

Not 100% yet which versions are affected but we're currently crosswalking out too many dates to citation_date:

https://github.com/DSpace/DSpace/blob/master/dspace/config/crosswalks/google-metadata.properties#L42

google.citation_date = dc.date.copyright | dc.date.issued | dc.date.available | dc.date.accessioned

This patterns is "first match", so in case dc.date.copyright isn't filled out, dc.date.issued is consulted, after that the automatically generated deposit dates are offered.

The consequence is that sometimes, the date submitted into the repository gets exposed into citation_date. This makes it hard/problematic for Scholar to match it accurately against the actual publication date for the item.

If there is no accurate publication date, no date should be crosswalked at all. So I propose ONLY crosswalking dc.date.issued by default and remove the fallback options we have today.

Quick Fix (for any site running DSpace 1.7.x or above): Modify your [dspace]/config/crosswalks/google-metadata.properties configuration file. Change the "google.citation_date" setting to ONLY list "dc.date.issued".
google.citation_date = dc.date.issued

@dspace-bot
Copy link
Author

Is related to: DS-2930

@dspace-bot
Copy link
Author

bram said:

PR for DSpace 5 #2295

PR for DSpace 6 #2294

@dspace-bot
Copy link
Author

bram said:

Apologies for making these PR branches in the DSpace repo instead of in my own fork by the way!

@dspace-bot
Copy link
Author

bram said:

I need to add an additional change to address:

"citation_date is the old & deprecated version of the metatag. It was deprecated since it could be and was interpreted in multiple way.s. The currently recommended version is citation_publication_date. citation_publication_date is preferred when available."

e.g. right now, we are exposing citation_date and we should ideally be exposing citation_publication_date

@dspace-bot
Copy link
Author

tdonohue said:

I had a similar conversation with the Google Scholar team (Anurag and Monica) just before the holidays, and they had mentioned this as being a major issue. Thanks for the PRs, Bram Luyten (Atmire), I've already given them a +1.

@dspace-bot
Copy link
Author

kshepherd said:

I've merged the 5.x and 6.x versions – sorry Bram Luyten (Atmire), didn't notice the comment about citation_publication_date until afterwards so that might need a new PR afterwards.

We should port this to master as well, then we can close this ticket

@dspace-bot
Copy link
Author

kshepherd said:

I've ported this ticket to master and merged.

Not closing ticket just yet as I realise we might want another round of PRs to update the deprecated date field name.

@dspace-bot
Copy link
Author

bram said:

Created a separate issue for the renaming of the field:

https://jira.lyrasis.org/browse/DS-4518

so we could close out 4104

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant