pbmm2 using too much memory #3

zhoudreames · 2021-09-02T01:58:00Z

my machine is down when i using pbmm2 aligning,my machine memory is 500G,and the hifi data 130G.how to solve it ?

armintoepfer · 2021-09-02T03:43:39Z

You can't use a full chip worth of Hifi reads as references. DeepConsensus is in the proof of concept stage, not meant for production yet. You can try this new tool, but there's no official support for it yet: https://github.com/PacificBiosciences/align-clr-to-ccs

AndrewCarroll · 2021-09-02T06:09:46Z

Hi @zhoudreames

As @armintoepfer indicates, DeepConsensus isn't yet scalable to run a full SMRT cell on an external machine. The v0.1 release still uses many systems that are efficient within Google's distributed systems, but are very inefficient when run on a single machine. We plan to improve this in future releases, but for now, it will be too slow for a full SMRT cell. If there is something you can use a targeted part of a sequence run, that might be better.

For the error you are encountering, I assume that this is in the pre-processing step? Thank you for pointing this out. This is one of the areas we will improve in the next release (potentially by using the tool that Armin linked). For now, to map the reads, you would need to use fewer reads for your input.

Thank you,
Andrew

zhoudreames · 2021-09-02T06:29:03Z

You can't use a full chip worth of Hifi reads as references. DeepConsensus is in the proof of concept stage, not meant for production yet. You can try this new tool, but there's no official support for it yet: https://github.com/PacificBiosciences/align-clr-to-ccs

thanks for your help

zhoudreames · 2021-09-02T06:33:52Z

Hi @zhoudreames

As @armintoepfer indicates, DeepConsensus isn't yet scalable to run a full SMRT cell on an external machine. The v0.1 release still uses many systems that are efficient within Google's distributed systems, but are very inefficient when run on a single machine. We plan to improve this in future releases, but for now, it will be too slow for a full SMRT cell. If there is something you can use a targeted part of a sequence run, that might be better.

For the error you are encountering, I assume that this is in the pre-processing step? Thank you for pointing this out. This is one of the areas we will improve in the next release (potentially by using the tool that Armin linked). For now, to map the reads, you would need to use fewer reads for your input.

Thank you,
Andrew
when will the next version releases ? How much do I need to split reads for the 500G menmory ？ thanks~

AndrewCarroll · 2021-09-11T08:46:34Z

Hi @zhoudreames

I cannot give you an estimate for when the next version will be ready. With a fair amount of certainty, I will say likely more than 1 month from now and less than 6 months.

The changes in the next release will probably change the memory use, and I cannot now give you an estimate of memory at that time.

AndrewCarroll closed this as completed Sep 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pbmm2 using too much memory #3

pbmm2 using too much memory #3

zhoudreames commented Sep 2, 2021

armintoepfer commented Sep 2, 2021

AndrewCarroll commented Sep 2, 2021

zhoudreames commented Sep 2, 2021

zhoudreames commented Sep 2, 2021

AndrewCarroll commented Sep 11, 2021

pbmm2 using too much memory #3

pbmm2 using too much memory #3

Comments

zhoudreames commented Sep 2, 2021

armintoepfer commented Sep 2, 2021

AndrewCarroll commented Sep 2, 2021

zhoudreames commented Sep 2, 2021

zhoudreames commented Sep 2, 2021

AndrewCarroll commented Sep 11, 2021