run_all.py taking forever #14

nitinra · 2023-02-08T00:37:04Z

Hello,

I am running swiftortho for 276 insect species. I used the following command

python run_all.py -i allprotein.fa -a 20

I started the run on Jan 30th and it's still running the first step (all-vs-all homology search). Is there anyway I can make it run faster?

Regards,
Nitin

The text was updated successfully, but these errors were encountered:

Rinoahu · 2023-03-07T20:44:06Z

How many protein sequences are in the file? Are they the same species?

nitinra · 2023-03-07T21:36:51Z

Hello,

The insects species span across the entire insect clade, so 276 different species and the # of sequences in my fasta file is ~6839337 sequences (in the combined fasta file).

How do I make it run faster? thank you!

Regards,
Nitin

Rinoahu · 2023-03-08T03:50:26Z

delete the old version
git clone the latest one
install according to the instructions
Run the command:
python run_all.py -i allprotein.fa -a 20 -s 11111111 -v 500
You can also try the new tool if the protein sequences have a lot of redundancy
python run_all_fast.py -i allprotein.fa -a 20 -s 11111111 -v 500

Generally, increasing the seed length or reducing the number of hit at homolog search can make it run faster

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

run_all.py taking forever #14

run_all.py taking forever #14

nitinra commented Feb 8, 2023

Rinoahu commented Mar 7, 2023

nitinra commented Mar 7, 2023

Rinoahu commented Mar 8, 2023

run_all.py taking forever #14

run_all.py taking forever #14

Comments

nitinra commented Feb 8, 2023

Rinoahu commented Mar 7, 2023

nitinra commented Mar 7, 2023

Rinoahu commented Mar 8, 2023