Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DSpace 6/7 bulkedit MetadataImport unable to remove duplicate metadata in multi-value fields #7989

Open
alanorth opened this issue Oct 11, 2021 · 0 comments · May be fixed by #8993
Open

DSpace 6/7 bulkedit MetadataImport unable to remove duplicate metadata in multi-value fields #7989

alanorth opened this issue Oct 11, 2021 · 0 comments · May be fixed by #8993
Labels
bug help wanted Needs a volunteer to claim to move forward tools: batch-editing Related to batch / bulk metadata editing, csv import/export tools: import Related to import of data into the system

Comments

@alanorth
Copy link
Contributor

Describe the bug
If an item has duplicate metadata in a multi-value field, for example the same author listed twice:

id,dc.contributor.author[en_US]
b88eb8a6-eb41-4f4f-9f39-9600ca6e82fe,Alan Orth||Alan Orth

... it is not possible to remove the duplicate value via the CSV/bulkedit metadata import. If you correct the item's metadata by removing the duplicate value in the CSV and re-uploading it, DSpace does not detect changes:

$ cat /tmp/duplicate-metadata.csv
id,dc.contributor.author[en_US]
b88eb8a6-eb41-4f4f-9f39-9600ca6e82fe,Alan Orth
$ dspace metadata-import -f /tmp/duplicate-metadata.csv
There were no changes detected

I have verified this issue on DSpace 6.3 and 7.1-SNAPSHOT.

Expected behavior
We should be able to remove duplicate metadata via CSV bulkedit. To work around this we currently have to add a new column with the new data, blank out the original column, then move the data back to the old column (and remove the new column).

Related work
This issue was noticed by at least one user on the dspace-tech mailing list in 2015 (see the thread entitled "No changes were detected" when importing metadata via XMLUI).

I guess the problem is in the compare method in dspace-api/src/main/java/org/dspace/app/bulkedit/MetadataImport.java, but I don't understand the code.

@alanorth alanorth added bug needs triage New issue needs triage and/or scheduling labels Oct 11, 2021
@tdonohue tdonohue added tools: batch-editing Related to batch / bulk metadata editing, csv import/export and removed needs triage New issue needs triage and/or scheduling labels Nov 2, 2021
@tdonohue tdonohue added this to To Do in DSpace 7.x Tier 4: Medium Low Priority via automation Nov 2, 2021
@tdonohue tdonohue added the help wanted Needs a volunteer to claim to move forward label Nov 2, 2021
@tdonohue tdonohue added this to To Do in DSpace 7.x Tier 5: Low Priority via automation Nov 30, 2022
@tdonohue tdonohue added the tools: import Related to import of data into the system label Jul 7, 2023
@ybnd ybnd linked a pull request Jul 28, 2023 that will close this issue
8 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug help wanted Needs a volunteer to claim to move forward tools: batch-editing Related to batch / bulk metadata editing, csv import/export tools: import Related to import of data into the system
Projects
Status: 🙋 Needs Help / Unscheduled
Development

Successfully merging a pull request may close this issue.

2 participants