-
Notifications
You must be signed in to change notification settings - Fork 2
ABGD
The Automatic Barcode Gap Discovery (ABGD, Puillandre et al., 2012) is a delimitation method based on genetic distances of single barconding genes. The main assumption of the method is that the divergence among organisms of the same species is smaller than the divergence among organisms of different species. This is the well-known "barcode gap" assumption which was initially introduced by Hebert et al., 2003.
The method performs the following steps in an attempt to automatically identify the barcoding gap:
-
Sorts the pairwise distances of the aligned sequences (there is the option for using substitution models for corrected distances).
-
Based on a prior belief on the intraspecific divergence (Pmin - Pmax) and a proxy (X) for the minimum width of the gap, the method identifies the largest gap and assumes it is corresponds to barcoding gap.
-
After identifying the barcoding gap, it divides the sequences into candidate species and repeats the above steps (1-2) to each of them.
ABGD is available as a website and as a standalone software implemented in C.
Input: Aligned fasta file or distance matrix
$ wget http://wwwabi.snv.jussieu.fr/public/abgd/last.tgz
$ tar -xvzf last.tgz
$ cd Abgd
$ make$ mafft BR_cob_57ind_no_out.fasta > BR_cob_57ind_no_out_mafft.fastaBR_cob_57ind_no_out_mafft.fasta
$ abgd -p 0.03 -P 0.03 -n 1 BR_cob_57ind_no_out_mafft.fasta$ ../abgd -p 0.001 -P 0.1 BR_cob_57ind_no_out_mafft.fastaoutput files:
BR_cob_57ind_no_out_mafft.abgd.svg
BR_cob_57ind_no_out_mafft.disthist.svg
BR_cob_57ind_no_out_mafft.rank.svg
$ ../abgd -p 0.001 -P 0.1 BR_cob_57ind_no_out_mafft.fasta -X 1
$ ../abgd -p 0.001 -P 0.1 BR_cob_57ind_no_out_mafft.fasta -X 2
$ ../abgd -p 0.001 -P 0.1 BR_cob_57ind_no_out_mafft.fasta -X 3