Skip to content

[BUFIX] fail when passed fastas with duplicate sequence ids#555

Merged
skothenhill-nv merged 2 commits into
mainfrom
hillst/nvfaidx-restrictions
Dec 25, 2024
Merged

[BUFIX] fail when passed fastas with duplicate sequence ids#555
skothenhill-nv merged 2 commits into
mainfrom
hillst/nvfaidx-restrictions

Conversation

@skothenhill-nv
Copy link
Copy Markdown
Collaborator

When NvFaidx was used on Fasta files containing duplicate sequence ids, which violates the FASTA spec, it would silently fail and use the last-seen sequence as an entry.

This PR fails by default and exposes a parameter to ignore sequence_ids and use integer indexing instead.

@skothenhill-nv
Copy link
Copy Markdown
Collaborator Author

/build-ci

@skothenhill-nv skothenhill-nv enabled auto-merge (squash) December 24, 2024 21:35
Copy link
Copy Markdown
Contributor

@edawson edawson left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good, and appreciate the tests - approved.

@skothenhill-nv skothenhill-nv merged commit f8db8a2 into main Dec 25, 2024
@skothenhill-nv skothenhill-nv deleted the hillst/nvfaidx-restrictions branch December 25, 2024 03:00
nvdreidenbach pushed a commit to nvdreidenbach/bionemo-framework that referenced this pull request Jan 2, 2025
…ioNeMo#555)

When NvFaidx was used on Fasta files containing duplicate sequence ids,
which violates the FASTA spec, it would silently fail and use the
last-seen sequence as an entry.

This PR fails by default and exposes a parameter to ignore sequence_ids
and use integer indexing instead.

Signed-off-by: Danny <dreidenbach@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants