FastANI still not fast enough? #10

biofuture · 2018-04-26T04:08:06Z

Dear Sir

I tried your fastANI to generate ANI to about 2000 genomes; the speed is quite slow. I run the program on a super node with 64 cores and 500 Gb memory. The software can only run in one single thread.

I know that you have already supplied a script to split genomes into smaller parts. But in one node, the speed is limited by the IO transfer if I run it parallel in one hard disk.

How did you generate the ANI among 80000 genomes? Can you give me some hint?

I tried to run it on our HPCF; however, for every single run, the memory requirements exceed 96 Gbs which is the configuration in most of our node.

I can only submit limited jobs (10) at one time, so I can just split the total jobs into less than 100 jobs rather than 1000 of jobs.

Thank you very much!

Xiaotao

cjain7 · 2018-04-30T13:54:15Z

Generating ANI for 2000 genomes should be pretty quick and should take less than 96G. In my latest run with 8000 genomes, FastANI used about 60G memory.

Could you double check your scripts that split the reference DB and call FastANI? Also see #6 .

cjain7 closed this as completed Aug 2, 2018

rainjy mentioned this issue Sep 6, 2019

Output is still empty after two days running #52

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FastANI still not fast enough? #10

FastANI still not fast enough? #10

biofuture commented Apr 26, 2018 •

edited

cjain7 commented Apr 30, 2018

FastANI still not fast enough? #10

FastANI still not fast enough? #10

Comments

biofuture commented Apr 26, 2018 • edited

cjain7 commented Apr 30, 2018

biofuture commented Apr 26, 2018 •

edited