Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create a single Embedded Metadata translator #686

Closed
aurimasv opened this issue Mar 6, 2014 · 1 comment
Closed

Create a single Embedded Metadata translator #686

aurimasv opened this issue Mar 6, 2014 · 1 comment
Labels
Difficulty: Hard New Translator Pull requests for new translators

Comments

@aurimasv
Copy link
Contributor

aurimasv commented Mar 6, 2014

We have discussed this already to some extent, but because this is a somewhat difficult undertaking, this has not really been done yet.

The general idea is to have a single Embedded Metadata translator that would cover metadata in META tags (as it does now), DOIs on the page (taking over for DOI), and I would argue COinS and probably unAPI.

This would have a couple of advantages:

  • We could gracefully fall back to a lower priority translator if a higher priority translator ends up failing (as reported in Fall back to "embedded metadata" if the DOI translator fails? #685)
  • We could combine metadata from multiple sources (e.g. retrieve high quality metadata via DOI and supplement it with abstract, keywords, and attachments from META tags)
  • More convenient to the users, since they wouldn't have to worry about switching translators (if at all possible) to import what they want.

One difficulty about this approach is figuring out when different "embedded metadata" (DOI, COinS, etc.) describes the same resource so that data could be combined and not displayed as multiple choices to the user. We'll probably end up deciding this based on unique identifiers, like DOI, ISBN, etc. and/or similarity between titles (maybe ignoring case and punctuation). Any ideas on this would be helpful.

@dstillman
Copy link
Member

Superseded by #1092, but all of this applies to that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Difficulty: Hard New Translator Pull requests for new translators
Development

No branches or pull requests

3 participants