New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ERROR: Caught unhandled exception: std::bad_alloc in both 2.3.2 and 2.3.3 #46
Comments
Hi, bad_alloc usually means that system ran out of memory. How much RAM does your machine have? It seems that you have raw reads at 100x coverage - you might try to downsample them to, say 40x (take the longest ones), this should reduce memory requirements. You also should be able to rerun the repeat resolution step with the entire set of reads afterwards. Notice that Canu reads have 30x coverage - so that is why less memory was required. If you running with error-corrected reads (which is also an option), make sure you are using 'pacbio-corr', not 'pacbio-raw' option. |
Thank you for your reply! The computer node I submitted the job has 2T RAM. I don't know if it's enough. I'll try to downsample the raw data. A basic question: what's the difference between using all raw data (say 100X) and using downsampling data (say longest 50X)? except the memory required. |
You might have extra connectivity information in these 100x reads (you can resolve more repeats, for example). But some studies suggest (Canu paper, for example) that you don't really need more than 40x in general (but it, of course, also depends on the genome complexity, ploidy etc..). Plus, extra coverage helps to get a good final consensus. |
I see. Thank you! |
Hi, I have meet the same issue as "ERROR: Caught unhandled exception: std::bad_alloc" Here is error message Thank you |
Hi, Looks strange, for a genome of ~130m and 30x coverage it should not use more than 50G. Does the node that you are using to run Flye has 300G RAM (or you refer to the total memory of all nodes)? Could you send me the file.log file? It would be also helpful if you can watch the memory consumption to make sure that it indeed ran out of memory. You can either manually watch top/htop, or use this script - https://github.com/jhclark/memusg. |
Hi, I got this error messages when using version 2.3.2 and version 2.3.3.
The genome is about 2G, and default parameters were used.
version 2.3.2
version 2.3.3
BTW, I also ran
Flye 2.3.3
based on the corrected reads ofCanu
, and it ran successfully. Here is the logs if it's useful.[2018-04-28 05:15:35] INFO: Running Flye 2.3.3-g47cdd0b [2018-04-28 05:15:35] INFO: Assembling reads [2018-04-28 05:15:36] INFO: Running with k-mer size: 17 [2018-04-28 05:15:36] INFO: Reading sequences [2018-04-28 05:49:20] INFO: Reads N50/90: 22994 / 18323 [2018-04-28 05:49:22] INFO: Selected minimum overlap 5000 [2018-04-28 05:49:24] INFO: Expected read coverage: 34 [2018-04-28 05:49:24] INFO: Generating solid k-mer index [2018-04-28 05:49:47] INFO: Counting kmers (1/2): 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% [2018-04-28 05:55:09] INFO: Counting kmers (2/2): 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% [2018-04-28 08:55:36] INFO: Filling index table 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% [2018-04-28 17:32:09] INFO: Extending reads [2018-04-28 18:19:00] INFO: Overlap-based coverage: 20 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% [2018-05-03 00:13:25] INFO: Assembled 6725 draft contigs [2018-05-03 00:13:57] INFO: Generating contig sequences 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% [2018-05-03 01:03:32] INFO: Running Minimap2 [2018-05-03 10:15:46] INFO: Computing consensus [2018-05-03 11:18:10] INFO: Alignment error rate: 0.0299390805236 [2018-05-03 11:18:34] INFO: Performing repeat analysis [2018-05-03 11:18:35] INFO: Reading sequences [2018-05-03 11:50:14] INFO: Building repeat graph 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% [2018-05-03 18:02:10] INFO: Aligning reads to the graph 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% [2018-05-04 02:18:05] INFO: Aligned sequence: 137844577112 / 149133911062 (0.924301) [2018-05-04 02:18:34] INFO: Mean edge coverage: 38 [2018-05-04 02:20:09] INFO: Resolving repeats [2018-05-04 11:02:04] INFO: Generating contigs [2018-05-04 12:05:35] INFO: Generated 17311 contigs [2018-05-04 14:08:03] INFO: Polishing genome (1/1) [2018-05-04 14:08:03] INFO: Running Minimap2 [2018-05-04 21:32:38] INFO: Separating alignment into bubbles [2018-05-05 03:50:13] INFO: Alignment error rate: 0.0230640593152 [2018-05-05 03:50:14] INFO: Correcting bubbles 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% [2018-05-05 08:18:15] INFO: Assembly statistics: Total length: 1886554189 Contigs: 13177 Scaffolds: 13049 Scaffolds N50: 315687 Largest scf: 2690265 Mean coverage: 34 [2018-05-05 08:18:15] INFO: Final assembly: /parastor300/niuyw/Project/Goqi_genome_180207/flye/run2/scaffolds.fasta
Do you know what could have cause it? Thanks in advance!
Bests,
Yiwei Niu
The text was updated successfully, but these errors were encountered: