Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

In CPU mode, smaller chunk_size does not reduce the memory use. #13

Closed
dawnmy opened this issue Apr 8, 2022 · 3 comments
Closed

In CPU mode, smaller chunk_size does not reduce the memory use. #13

dawnmy opened this issue Apr 8, 2022 · 3 comments
Assignees
Labels
enhancement New feature or request

Comments

@dawnmy
Copy link
Member

dawnmy commented Apr 8, 2022

The memory use seems to be not related to the chunk_size setting but the total number of input sequence bases. Need to find a way to reduce the memory use for large input files.

@dawnmy dawnmy self-assigned this Apr 8, 2022
@dawnmy dawnmy added the enhancement New feature or request label Apr 8, 2022
@HongxiangXu
Copy link

I seems met similar problem, as I used 4 threads and 256 chunk size, which should use a little memory right? as 20 thread with 1024 chunk size will only use about 20G memory as it described. However, my PC was stuck and my task manager showed nearly all memory(16G) were consumed, please figured out this problems. Thanks very much!

@dawnmy
Copy link
Member Author

dawnmy commented Apr 20, 2022

I seems met similar problem, as I used 4 threads and 256 chunk size, which should use a little memory right? as 20 thread with 1024 chunk size will only use about 20G memory as it described. However, my PC was stuck and my task manager showed nearly all memory(16G) were consumed, please figured out this problems. Thanks very much!

Thank you for your interest in RiboDetector. I am working on the new version. Next release will solve this issue. BTW, how large is your input fastq file (number of nucleotides)?

@dawnmy dawnmy closed this as completed in e2207e5 Apr 21, 2022
dawnmy added a commit that referenced this issue Apr 21, 2022
Fixed issue #13 and updated help message by merging
@dawnmy
Copy link
Member Author

dawnmy commented Apr 21, 2022

@xhxlilium This issue has been solved in the latest version v0.2.6. It can be updated with pip. When running on your large input files, you can use smaller threads and chunk_size (e.g. --threads 10 chunk_size 400, which will use about 10GB RAM but may be up to 20GB). I suggest to run RiboDetector on a PC with at least 32GB RAM.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants