Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sync command issue with many files. #447

Closed
eminegit opened this issue Jun 20, 2022 · 4 comments · Fixed by #483
Closed

Sync command issue with many files. #447

eminegit opened this issue Jun 20, 2022 · 4 comments · Fixed by #483
Assignees
Milestone

Comments

@eminegit
Copy link

Hi, the sync command works great with a few files. I am trying to sync 2 million files into the s3 bucket which has more than 15 million files but it stops after 5 minutes with a "killed" error message. I use AWS s3 sync command; it works fine but it is just slower than the s5cmd command.

@sonmezonur
Copy link
Member

@eminegit Hello! Could you share your s5cmd version please? We have released a new version to address this issue (v2.0.0-beta.2)

@eminegit
Copy link
Author

Hi, the version I have is v2.0.0-beta.2-c978e4a. I used 's5cmd_2.0.0-beta.2_Linux-64bit.tar.gz' file to install s5cmd.

@misuto
Copy link

misuto commented Jul 6, 2022

Sounds like this could be the same issue as in #441

@kucukaslan
Copy link
Contributor

Sounds like this could be the same issue as in #441

I agree that this is related to it.

I've tried to reproduce error. I've tried to sync from a directory with huge number of files. I received
fatal error: runtime: out of memory not a "killed" error in Linux. The errors probably means the same thing.

I've also outlined some of the possible solutions at #441 (comment).

@kucukaslan kucukaslan moved this from Todo to In Progress in s5cmd Aug 16, 2022
igungor pushed a commit that referenced this issue Jun 16, 2023
It uses external sort instead of in-memory sort to dramatically reduce memory usage in the expense of the speed.
It uses encoding/gob format to store to disk and restore from there.

Fixes #441
Fixes #447
@github-project-automation github-project-automation bot moved this from In Progress to Done in s5cmd Jun 16, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Archived in project
Development

Successfully merging a pull request may close this issue.

7 participants