Use iterator() when rehashing to save memory #4139
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Any background context you want to provide?
The
0200_rehash
migration was failing due to memory pressure when migrating the production database with over 14E6 states.What's this PR do?
Convert the
.all()
call to.all().iterator()
to prevent all records from returing at one time, rather force a database iterator.This migration still takes a long time to run--around 10 hours. We might need to parallalize this in the future.
How should this be manually tested?
If CI passes, then there are no syntax errors. The PR does not change the core functionality.
What are the relevant tickets?
n/a
Screenshots (if appropriate)