Starting Captus-assembly: EXTRACT (2024-08-13 05:40:27)  Welcome to the marker extraction step of Captus-assembly. In this step, Captus will use Scipio to search within your FASTA assemblies and recover any set of reference proteins provided through '--nuc_refs', '--ptd_refs', and/or ' --mit_refs'. Captus includes some reference protein sets to work with Plants like the 'Angiosperms353' protein set for nuclear genes, and two organellar protein reference sets 'SeedPlantsPTD' for plastids, and 'SeedPlantsMIT' for mitochondria.  If you have references that are not proteins (e.g. non-coding regions, gene sequences including introns, full mRNAs, etc.) You can provide these miscellaneous DNA references with '--dna_refs' and Captus will use BLAT to find and extract matches in your assemblies.  Finally, if you want to explore the usefulness of the contigs that were not hit by any protein or other DNA references after the Captus' extraction process, or simply if you do not have reference sets to test, you can try the option '--cluster_leftovers' to attempt sequence clustering across samples in order to discover homologous markers.  For more information, please see https://github.com/edgardomortiz/Captus Captus version: v1.0.1 Command: /dmz/home/vferreira/.conda/envs/captus/bin/captus_assembly extract -a 02_assemblies -n probe.fasta Max. RAM: 747.7GB (out of 755.3GB) Max. Threads: 72 (out of 72) Dependencies: Scipio : v1.4.1 OK ├─BioPerl: v1.7.8 OK └─YAML : v1.30 OK BLAT : v37x1 OK MMseqs2 : not used MAFFT : not used Python libraries: numpy: v2.0.0 OK pandas: v2.2.2 OK plotly: v5.22.0 OK Captus assemblies dir: /dmz/home/vferreira/captus_lycidae_working2/02_assemblies  VALID Captus assemblies directory with 27 samples and 27 'assembly.fasta' files Total assemblies found: 27 Output directory: /dmz/home/vferreira/captus_lycidae_working2/03_extractions Output directory already exists and files may be overwritten Protein and DNA Markers Extraction from FASTA assemblies (2024-08-13 05:40:39)  Now Captus will search within your FASTA assemblies for the set of reference proteins provided by '--nuc_refs', '--ptd_refs', and/or '--mit_refs' and extract the sequences both in nucleotide and translated to aminoacid using the specified '--nuc_transtable', '--ptd_transtable' and '--mit_transtable' respectively.  Additionally, miscellaneous DNA references provided with '--dna_refs' will also be extracted. Verifying protein reference sets... Translated 'probe.fasta' using Genetic Code: 1 [8.103s]  PROTEIN OPTIONS: Max. loci for Scipio x2: 2000 Predict dubious introns: False Concurrent extractions: 27 RAM per extraction: 27.7GB Threads per extraction: 2  Nuclear proteins: reference: /dmz/home/vferreira/captus_lycidae_working2/probe.captus.faa (Translated from probe.fasta) reference info: 2,260 loci, 40,190 sequences (loci names found, detected multiple sequences per locus) min_score: 0.13 min_identity: 65 min_coverage: 20 translation table: 1  Plastidial proteins: reference: not used  Mitochondrial proteins: reference: not used MISCELLANEOUS DNA OPTIONS: reference: not used  OUTPUT OPTIONS: Max. paralogs: -1 (Keep all paralogs) Max. separate loci files: 0 (Do not write separate loci files per sample) Overwrite files: False Keep all files: False Samples to process: 27 Extractions to process: 27 protein and 0 nucleotide Nuclear proteins: /dmz/home/vferreira/captus_lycidae_working2/03_extractions/[Sample_name]__captus-ext/01_coding_NUC Annotated assemblies: /dmz/home/vferreira/captus_lycidae_working2/03_extractions/[Sample_name]__captus-ext/06_assembly_annotated (These output directories will be created for each sample) Paths to references used: /dmz/home/vferreira/captus_lycidae_working2/03_extractions/captus-assembly_extract.refs.json Extracting protein-coding markers with Scipio: 0%| | 0/27 [00:00 successfully completed [1h 17m 23.4s (4643.393s)] (2024-08-13 06:57:51) Starting Captus-assembly: EXTRACT (2024-08-13 09:22:17)  Welcome to the marker extraction step of Captus-assembly. In this step, Captus will use Scipio to search within your FASTA assemblies and recover any set of reference proteins provided through '--nuc_refs', '--ptd_refs', and/or ' --mit_refs'. Captus includes some reference protein sets to work with Plants like the 'Angiosperms353' protein set for nuclear genes, and two organellar protein reference sets 'SeedPlantsPTD' for plastids, and 'SeedPlantsMIT' for mitochondria.  If you have references that are not proteins (e.g. non-coding regions, gene sequences including introns, full mRNAs, etc.) You can provide these miscellaneous DNA references with '--dna_refs' and Captus will use BLAT to find and extract matches in your assemblies.  Finally, if you want to explore the usefulness of the contigs that were not hit by any protein or other DNA references after the Captus' extraction process, or simply if you do not have reference sets to test, you can try the option '--cluster_leftovers' to attempt sequence clustering across samples in order to discover homologous markers.  For more information, please see https://github.com/edgardomortiz/Captus Captus version: v1.0.1 Command: /dmz/home/vferreira/.conda/envs/captus/bin/captus_assembly extract -a 02_assemblies -n probe.fasta Max. RAM: 747.7GB (out of 755.3GB) Max. Threads: 72 (out of 72) Dependencies: Scipio : v1.4.1 OK ├─BioPerl: v1.7.8 OK └─YAML : v1.30 OK BLAT : v37x1 OK MMseqs2 : not used MAFFT : not used Python libraries: numpy: v2.0.0 OK pandas: v2.2.2 OK plotly: v5.22.0 OK Captus assemblies dir: /dmz/home/vferreira/captus_lycidae_working2/02_assemblies  VALID Captus assemblies directory with 27 samples and 27 'assembly.fasta' files Total assemblies found: 27 Output directory: /dmz/home/vferreira/captus_lycidae_working2/03_extractions Output directory already exists and files may be overwritten Protein and DNA Markers Extraction from FASTA assemblies (2024-08-13 09:22:18)  Now Captus will search within your FASTA assemblies for the set of reference proteins provided by '--nuc_refs', '--ptd_refs', and/or '--mit_refs' and extract the sequences both in nucleotide and translated to aminoacid using the specified '--nuc_transtable', '--ptd_transtable' and '--mit_transtable' respectively.  Additionally, miscellaneous DNA references provided with '--dna_refs' will also be extracted. Verifying protein reference sets... Translated 'probe.fasta' using Genetic Code: 1 [7.951s]  PROTEIN OPTIONS: Max. loci for Scipio x2: 2000 Predict dubious introns: False Concurrent extractions: 27 RAM per extraction: 27.7GB Threads per extraction: 2  Nuclear proteins: reference: /dmz/home/vferreira/captus_lycidae_working2/probe.captus.faa (Translated from probe.fasta) reference info: 2,260 loci, 40,190 sequences (loci names found, detected multiple sequences per locus) min_score: 0.13 min_identity: 65 min_coverage: 20 translation table: 1  Plastidial proteins: reference: not used  Mitochondrial proteins: reference: not used MISCELLANEOUS DNA OPTIONS: reference: not used  OUTPUT OPTIONS: Max. paralogs: -1 (Keep all paralogs) Max. separate loci files: 0 (Do not write separate loci files per sample) Overwrite files: False Keep all files: False Samples to process: 27 Extractions to process: 27 protein and 0 nucleotide Nuclear proteins: /dmz/home/vferreira/captus_lycidae_working2/03_extractions/[Sample_name]__captus-ext/01_coding_NUC Annotated assemblies: /dmz/home/vferreira/captus_lycidae_working2/03_extractions/[Sample_name]__captus-ext/06_assembly_annotated (These output directories will be created for each sample) Paths to references used: /dmz/home/vferreira/captus_lycidae_working2/03_extractions/captus-assembly_extract.refs.json Extracting protein-coding markers with Scipio: 0%| | 0/27 [00:00 sys.exit(main()) ^^^^^^ File "/dmz/home/vferreira/.conda/envs/captus/lib/python3.12/site-packages/captus/captus_assembly.py", line 1424, in main CaptusAssembly() File "/dmz/home/vferreira/.conda/envs/captus/lib/python3.12/site-packages/captus/captus_assembly.py", line 90, in __init__ getattr(self, args.command)() File "/dmz/home/vferreira/.conda/envs/captus/lib/python3.12/site-packages/captus/captus_assembly.py", line 1074, in extract extract(full_command, args) File "/dmz/home/vferreira/.conda/envs/captus/lib/python3.12/site-packages/captus/extract.py", line 505, in extract tqdm_parallel_nested_run(scipio_coding, scipio_params, d_msg, f_msg, File "/dmz/home/vferreira/.conda/envs/captus/lib/python3.12/site-packages/captus/misc.py", line 158, in tqdm_parallel_nested_run result = future.result() ^^^^^^^^^^^^^^^ File "/dmz/home/vferreira/.conda/envs/captus/lib/python3.12/concurrent/futures/_base.py", line 449, in result return self.__get_result() ^^^^^^^^^^^^^^^^^^^ File "/dmz/home/vferreira/.conda/envs/captus/lib/python3.12/concurrent/futures/_base.py", line 401, in __get_result raise self._exception ZeroDivisionError: division by zero 96%|██████████████████████████████▊ | 26/27 [3:30:15<08:05, 485.20s/extraction]