Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EXPERIMENT: Simplify mirror sync #3844

Closed
wants to merge 1 commit into from
Closed

Conversation

mdellweg
Copy link
Member

Just start with a clean repository version for a mirror sync. This way, there is no need to keep e record of all existing content in memory at any time. This should reduce the memory pressure on the task worker (yet to be confirmed).

[noissue]

Just start with a clean repository version for a mirror sync.
This way, there is no need to keep e record of all existing content in
memory at any time. This should reduce the memory pressure on the task
worker (yet to be confirmed).

[noissue]
@mdellweg
Copy link
Member Author

Somehow it decreases the memory usage a bit (but I suspect, what we see is that the garbage collection did not even think of starting to do any work...), but at the expense of taking almost twice the time as the original sync. This is clearly not the best idea in it's current form...

Tested with 100 000 content units and stream policy.

plot

@dralley
Copy link
Contributor

dralley commented Jun 13, 2023

Also, I think when you scale this up to something like PyPI or dockerhub mirroring, it's be much much worse

@dralley dralley closed this Jul 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants