-
Notifications
You must be signed in to change notification settings - Fork 104
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
BOSH Stemcells are very slow to download #94
Comments
We have created an issue in Pivotal Tracker to manage this: https://www.pivotaltracker.com/story/show/166018258 The labels on this github issue will be updated when the story is started. |
I've got some actual numbers from GCP. I uploaded a stemcell to GCS and then measured downloading it from there and then from the original endpoint. GCP
AWS
8 seconds vs. nearly 180 seconds makes it seem like there's something fishy with the bucket setup. |
It just took me nearly 2 hours to download a BOSH-lite stemcell. 😭 Can I help you with any work to improve this? I expect we're wasting a tonne of time across the org. waiting for stemcell downloads (on workstations and in pipelines). |
At one point some downloads were being fronted by CloudFront CDN; not sure if/when/why that may have changed. I think there was some code to recognize and convert supported buckets to the CDN. Let me know if I can help. |
We enabled transfer acceleration from S3 a while ago, but it seems that you have to do some more leg work to actually use it. We are looking into that today. Update: It looks like we will have to change all the URLs to the accelerated endpoint. |
I also did a little experiment to mirror all of the stemcells into a GCS bucket using their storage transfer service. It was able to mirror everything (500GB) in around two minutes. This makes it seem like it might be an issue with our office (the peering between AWS and our ISP?) rather than the bucket itself? |
Interesting. That might be another issue. We were able to assert that turning on the acceleration brings the downloads closer to GCP levels. Here are the links: Fast
Slow
So for now, you can use the "fast" link as a template for future downloads as we figure out how to change all the URLs. |
Just tried it out and it works great - thank you! Want me to leave this open until the https://bosh.io change is made or should I close it? |
We can leave it open. |
Closing - stemcells were moved to accelerated buckets in https://www.pivotaltracker.com/story/show/168994545 |
When downloading stemcells from https://bosh.io/stemcells I'm getting download speeds a few orders of magnitude slower than what my connection is capable of. I'm on wired office internet and I can download files from Google Cloud Storage at around 70 megabytes per second (~500 megabits per second).
I've graphed the download speed of a typical stemcell download session speed below.
The specific stemcell I downloaded was: https://s3.amazonaws.com/bosh-core-stemcells/315.13/bosh-stemcell-315.13-google-kvm-ubuntu-xenial-go_agent.tgz
As you can see this is always an order of magnitude slower than my connection and sometimes drops to 2 orders of magnitude slower. This is really frustrating as these files are so large.
Do you have any idea what could be causing this?
The text was updated successfully, but these errors were encountered: