Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple manifestations of single edition handled poorly #2303

Open
tfmorris opened this issue Aug 15, 2019 · 6 comments
Open

Multiple manifestations of single edition handled poorly #2303

tfmorris opened this issue Aug 15, 2019 · 6 comments
Labels
Affects: Data Issues that affect book/author metadata or user/account data. [managed] Editions Lead: @mekarpeles Issues overseen by Mek (Staff: Program Lead) [managed] metadata Module: Import Issues related to the configuration or use of importbot and other bulk import systems. [managed] Priority: 3 Issues that we can consider at our leisure. [managed] Theme: Identifiers Issues related to ISBN's or other identifiers in metadata. [managed] Type: Feature Request Issue describes a feature or enhancement we'd like to implement. [managed]
Projects

Comments

@tfmorris
Copy link
Contributor

Description

The edition below was scanned twice by the Internet Archive, which is arguably redundant and wasteful of resources, but given that that they do it, we should be able to handle it. ImportBot added a second IA identifier to the edition record, but the edition page only displays the first identifier, making the second scan invisible.

Relevant url?

https://openlibrary.org/books/OL24656394M/L'%C3%A9volution_du_dogme
https://openlibrary.org/people/ImportBot

Expectation

We either need to be able to handle multiple IA scans per edition record or create duplicate edition records with one scan per record. The latter is singularly unattractive from a metadata point of view, but handling the former may require adjusting how IA does "loans" for books which are still in copyright since currently availability and loans are tied to edition records.

Proposal & Constraints

It's more work, but I'd lean towards adding the concept of "copies" to our metadata model with multiple copies per edition.

Stakeholders

@mekarpeles @hornc

@tfmorris tfmorris added Type: Bug Something isn't working. [managed] Affects: Data Issues that affect book/author metadata or user/account data. [managed] Editions Module: Import Issues related to the configuration or use of importbot and other bulk import systems. [managed] labels Aug 15, 2019
@tfmorris
Copy link
Contributor Author

This was discovered while reviewing the duplicates in #1620 (comment)

@cdrini
Copy link
Collaborator

cdrini commented Aug 15, 2019

I agree; I think the best long-term solution would be to have a new type, maybe Copy (or Scan or Digitization), and then have a one-to-many relationship from Edition to Copy. Then each IA id would be associated with a single Copy.

@LeadSongDog
Copy link

So your "copy" would be approximately equivalent to frbr:item where "edition" is approximately frbr:manifestation ? Bringing something of the sort into use would be helpful. It would be useful to determine the referent level for each common identifer type: does it pertain to a single item/copy, or to all similar items/copies (the manifestation).

@xayhewalo xayhewalo added this to Un-Triaged in Triage Oct 20, 2019
@xayhewalo xayhewalo added metadata Priority: 2 Important, as time permits. [managed] State: Backlogged Theme: Identifiers Issues related to ISBN's or other identifiers in metadata. [managed] Needs: Triage This issue needs triage. The team needs to decide who should own it, what to do, by when. [managed] labels Nov 19, 2019
@xayhewalo xayhewalo moved this from Un-Triaged to Needs: Assessment in Triage Nov 19, 2019
@hornc
Copy link
Collaborator

hornc commented Feb 28, 2020

I'm not sure this is an import issue, and not a bug. It looks like the importer did the correct thing here by associating a newly discovered scan with the correct OL record.

I don't think creating a new edition, or new class of thing, is correct. If anything, there is potentially a UI / UX change required as a new feature, but a supporting use-case would help.

From a metadata POV, this seems like the correct behaviour. Source records is a list, and all sources are listed. We only provide one link out to a scan with the ocaid field, and we've got one. Either should be 'good' copies, and there isn't currently an automated way to chose a 'best' one. This example is a public domain scan, so there isn't a availability problem. There might be for borrowable items, but archive.org already has this problem.

@tfmorris
Copy link
Contributor Author

The example is a public domain scan, but this applies to copyrighted scans that Internet Archive is going to "lend" as well. If they've scanned multiple editions, users can join the wait list for any of them (actually they have to do it individually), but if the same edition is scanned twice, only one of the copies can be lent out through OpenLibrary.

@hornc Are you proposing that we just refuse to support lending multiple scans of the same edition?

@mekarpeles mekarpeles removed the Needs: Triage This issue needs triage. The team needs to decide who should own it, what to do, by when. [managed] label Apr 20, 2020
@LeadSongDog
Copy link

Multiple scans of a single physical copy also represent a way to reduce OCR errors, but that’s an IA process, not OL.

@jimchamp jimchamp added Lead: @mekarpeles Issues overseen by Mek (Staff: Program Lead) [managed] and removed Needs: Lead labels Jan 24, 2022
@mekarpeles mekarpeles added Type: Feature Request Issue describes a feature or enhancement we'd like to implement. [managed] and removed Type: Bug Something isn't working. [managed] labels Feb 2, 2022
@mekarpeles mekarpeles added Priority: 3 Issues that we can consider at our leisure. [managed] and removed Priority: 2 Important, as time permits. [managed] labels Sep 15, 2023
@mekarpeles mekarpeles changed the title Multiple copies/scans of single edition handled poorly Multiple manifestations of single edition handled poorly Oct 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Affects: Data Issues that affect book/author metadata or user/account data. [managed] Editions Lead: @mekarpeles Issues overseen by Mek (Staff: Program Lead) [managed] metadata Module: Import Issues related to the configuration or use of importbot and other bulk import systems. [managed] Priority: 3 Issues that we can consider at our leisure. [managed] Theme: Identifiers Issues related to ISBN's or other identifiers in metadata. [managed] Type: Feature Request Issue describes a feature or enhancement we'd like to implement. [managed]
Projects
No open projects
Triage
  
Needs: Assessment
Development

No branches or pull requests

7 participants