-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enable repodata.json.zst by default #13256
Comments
Let's first conduct a review of the current flagged feature and review how JLAP adoption in mamba and rattler project has been going before we commit to shipping it by default. E.g. in case there were learnings that we should apply to the CEP before rollout. |
@jezdez I guess this issue is about whether to use We have been using zstd compressed repodata in rattler (and thus pixi) since the start. I think mamba is also doing this? (@AntoinePrv ) This has drastically reduced the wait time for repodata. I think the biggest file is the conda-forge linux-64 zstd file which is about +-28MB. For reference, on a fairly typical 100mbit connection this takes about 2.5-3 seconds to download and decompress, which I think is fairly fast. I have a 500mbit internet connection so downloading and decompressing takes 0.5-1 second! From what I understood the on the fly gzip compressed file is rate limited which is why it takes longer to download. I think it makes total sense to enable this by default! |
In libmamba we have |
Yes, this is a separate issue from "enable jlap by default" because The community learned that repodata.json (uncompressed) is faster than repodata.json (Content-Encoding: gzip) if your bandwidth is > ~300Mbps. We may have to consider whether anaconda.org's on-the-fly zstd compression on non-CDN-mirrored channels loads that server. It will be much less server load than the old on-the-fly bzip2 compression however. |
|
Checklist
What is the idea?
repodata.json.zst is faster to download than repodata.json and will always be a win over the latter. Depending on your network versus disk bandwidth it may be faster than jlap, and it is usable on the first request e.g. in a CI system when repodata.json is not yet cached. Enable repodata.json.zst by default.
We could use the jlap network code, that already knows how to fetch from alternate URL's depending on whether
.zst
andjlap
is available or not, and add a separate "zst but not jlap" flag.Why is this needed?
The text was updated successfully, but these errors were encountered: