HTCondor files to run MrBayes on all files in a directory
Navigate to your working directory on the HTCondor submit node and type the following command:
git clone https://github.com/BotanyHunter/MrBayes.git
which will create a new directory called MrBayes and download:
- five python scripts
- this README file
- the MrBayes executable, mb, version 3.2.4 (1.815.501 bytes)
Create a new directory inside MrBayes and give it a name such as data.
Place all files on which you would like to run MrBayes (in nexus format) into the directory.
The file extensions must be ".nex". If necessary, see repository PhylogeneticFileConversions to batch convert
files from phylip format to nexus.
To only run MrBayes on a particular subset of taxa, create a file in the MrBayes directory with the format shown below. Of course, the names being those specimens to be included and the number being from 1 to the number to be included.
translate
1 taxa_name_1
2 taxa_name_2
:
:
37 taxa_name_last
;
Currently, six of MrBayes parameters can be set. All others will be set to their defaults. The six are
| parameter | meaning | default |
| -n | Number of generations | 1,000,000 |
| -f | Sampling frequency | 1,000 |
| -u | Proportion of samples to treat as burnin | 0.25 |
| -s | Number of substitution types | 2 |
| -r | Rates | gamma |
| -g | Amino acid model | None |
There are two additional parameters
| parameter | meaning | default |
| -i | Name of file with specimens to include | Include all specimens |
| -C | 1 if to only include genes found on all taxa | 0 |
Once these parameters are considered, run the setup program to create the HTCondor submit and dag file. Replace data_directory with the name of the directory with the data files.
python setup_mrbayes.py data_directory -i toInclude.txt -C 1 -n 5000000 -f 50000 -s 6
To run the job, simply enter the following command from the HTCondor submit node:
condor_submit_dag run_mrbayes.dag
The tree files, *.t, are gathered into a tarball, run_mrbayes.tar. This file should be renamed so as not to be overwritten. The details of each MrBayes run are placed in the /log directory.