Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MRG: write manysketch --singleton sketches incrementally to reduce memory usage #375

Merged
merged 2 commits into from
Jul 1, 2024

Conversation

bluegenes
Copy link
Contributor

@bluegenes bluegenes commented Jul 1, 2024

I'm running into memory issues with manysketch --singleton, likely brought on by the fact that we're storing all the singleton sigs in a list until the end of the file, rather than streaming + sending them to be written. This PR fixes that by sending singletons for writing right after signature generation (before going on to the next read/contig). We then avoid writing singletons at the end of the loop, where they would just be empty template sigs.

@bluegenes bluegenes changed the title WIP: update manysketch --singleton MRG: update manysketch --singleton Jul 1, 2024
@bluegenes
Copy link
Contributor Author

@ctb ready for review

src/manysketch.rs Outdated Show resolved Hide resolved
Co-authored-by: C. Titus Brown <titus@idyll.org>
@bluegenes bluegenes enabled auto-merge (squash) July 1, 2024 18:26
@bluegenes bluegenes changed the title MRG: update manysketch --singleton MRG: write manysketch --singleton sketches incrementally to reduce memory usage Jul 1, 2024
@bluegenes bluegenes merged commit f6c7c30 into main Jul 1, 2024
1 check passed
@bluegenes bluegenes deleted the upd-singleton-ms branch July 1, 2024 18:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants