Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Uploading large files seems super slow #87

Closed
harshitha-bt opened this issue Aug 21, 2017 · 4 comments
Closed

Uploading large files seems super slow #87

harshitha-bt opened this issue Aug 21, 2017 · 4 comments

Comments

@harshitha-bt
Copy link

I am trying to upload a ~500MB file to HDFS running on another node. This operation seems to take more than 10-15 minutes. Is this expected?

@colinmarc
Copy link
Owner

Hi @harshitha-bt,

This is probably due to #84, a really unfortunate performance regression. A fix is on the way.

@colinmarc
Copy link
Owner

(Should be fixed now, in cdda132 and v1.0.4)

@harshitha-bt
Copy link
Author

My use case is to download the a multipart message of 700 MB from a rest endpoint and storing it locally and then did a client.CopyToRemote() which took 1 hour for ~700MB of data.
So, i tried using a fileWriter.Write(buf), but this is what i see from file_writer.go ( Write() ).

2017/08/27 20:25:38 offset and len of buffer = 0 706694149
2017/08/27 20:37:01 offset and len of buffer = 134217728 706694149

Even after the fix you mentioned here, it takes 10 mins to upload 1/7th of the file.
What is the best way to upload a file? CopyToRemote() or fileWriter.Write()?

@yetanotherbot
Copy link

Any update on this issue? I am seeing performance degradation from cross-zone uploads (also using fileWriter.Write(buf).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants