-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
merge_sets
creates duplicates in annotations.csv
#384
Comments
The merge overwrites the previously generated set files without needing an 'overwrite' argument and without any warning. It could probably cause problems when not careful. We could enforce that the output_set must not already exist. Is it possible that a set is constituted of multiple different merges (sounds to me like they should each have a separate set) ? What do you think? |
Yes, I think the best would be to raise an error if the set already exists so that the user first deletes it and re-merges it. |
yeah, tracking down outdated sets will be kind of hard to do |
When merging several sets together using
merge_sets
several times (see below), duplicate lines are created inannotations.csv
Duplicate lines in
annotations.csv
when runningmerge_sets
twiceThese lines should be dropped before (re)merging the sets and adding the resulting new annotation lines.
The text was updated successfully, but these errors were encountered: