-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HPC job specifications #49
Comments
Hi @marymcelroy Your question depends on the available partitions/resources in your HPC. For example, in the Zorbas HPC, I would normally use the batch partition (1 node and 20 cores). Μy shell script would be something like #!/bin/bash -l #SBATCH --partition=batch module purge # unloads all previous loads module load singularity/3.7.1 #loads singularity singularity run -B /home1/christina/the_directory_where_the_mydata_folder_and_the_parameters_are/:/mnt/analysis /home1/christina/pema_v.2.1.4.sif module unload singularity/3.7.1 #unloads singularity |
Hi @marymcelroy. @cpavloud is right. Just a few more comments from my side:
In any case, if I had to guess, I would say that you wouldn't need more than 3 days time for any of your analysis no matter what parameters you ask for. A step that can really take some time if you have a great number of otus/asvs, is the Good luck! 💯 |
Thank you both very much for the suggestions! |
@marymcelroy I now close this issue and in case you d like to give us any feedback, please feel free to open a "new discussion". Thanks again for your interest on PEMA. |
Hi Haris! I'd like to use PEMA on some metabarcoding data for my graduate work. I successfully installed the Singularity image on my university's HPC environment, but I was hoping for some advice about how to estimate the HPC resources I would need to run PEMA in my job script (we use Slurm). Specifically, do you have any guidance for the #SBATCH specifications and values I should use?
For context, I have 96 samples that were PE sequenced for COI, 18S, and 16S amplicons (euk eDNA metabarcoding) on an Illumina MiSeq, so I have 576 fastq files as my raw sequencing data. I would like to use a custom ref db for COI, so I will follow your instructions about training the RDP classifier (I know this will likely affect computational load). Thank you!
The text was updated successfully, but these errors were encountered: