Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MARC/MODS data source: Duplicated records in MODS database but with differential data #44

Closed
ronaldtse opened this issue Jul 13, 2024 · 2 comments
Assignees
Labels
fix export from catalog requires setting changes when exporting from library catalog

Comments

@ronaldtse
Copy link

ronaldtse commented Jul 13, 2024

The latest release of the MODS database, since the switch from CrossRef information, at #42 , contains a number of duplicated records about the same bibliographic item, but there are data discrepancies.

We have discovered 417 duplicates when treating the URL as the primary key.

These discrepancies range from minor editorial changes (punctuation) to actual data corrections. Someone needs to go through these records to merge the changes into one item.

The detailed diff is provided here:

The way to obtain this diff is:

$ git clone https://github.com/relaton/loc_mods.git
$ cd loc_mods
$ bundle
$ bundle exec bin/update-nist-mods
$ bundle exec exe/loc-mods detect-duplicates spec/fixtures/
@ronaldtse ronaldtse changed the title Duplicated records in MODS database Duplicated records in MODS database but with differential data Jul 14, 2024
@ronaldtse ronaldtse changed the title Duplicated records in MODS database but with differential data MARC/MODS data source: Duplicated records in MODS database but with differential data Jul 14, 2024
@kmiller621
Copy link
Collaborator

The duplicates are physical representations of the electronic items; I will make sure I only include the electronic records in the next release.

@kmiller621 kmiller621 self-assigned this Jul 29, 2024
@kmiller621 kmiller621 added the fix export from catalog requires setting changes when exporting from library catalog label Jul 29, 2024
@kmiller621
Copy link
Collaborator

Fixed with Aug2024 release

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
fix export from catalog requires setting changes when exporting from library catalog
Projects
None yet
Development

No branches or pull requests

2 participants