Skip to content

Commit

Permalink
Update tests for extended phylip format
Browse files Browse the repository at this point in the history
* Update expected output for test_AlignIO, test_SeqIO
* Apply most of the same exceptions affecting the phylip format to
  extended phylip
* Add a sample extended phylip format-file (primates.phyx, from the
  Bodega Phylogenetics Wiki:
  http://bodegaphylo.wikispot.org/RAxML_Tutorial)
  • Loading branch information
Connor McCoy authored and peterjc committed Jun 18, 2011
1 parent f4d2001 commit 4d5e57a
Show file tree
Hide file tree
Showing 6 changed files with 442 additions and 5 deletions.
234 changes: 234 additions & 0 deletions Tests/ExtendedPhylip/primates.phyx
@@ -0,0 +1,234 @@
12 898
Tarsius_syrichta AAGTTTCATT GGAGCCACCA CTCTTATAAT TGCCCATGGC CTCACCTCCT
Lemur_catta AAGCTTCATA GGAGCAACCA TTCTAATAAT CGCACATGGC CTTACATCAT
Homo_sapiens AAGCTTCACC GGCGCAGTCA TTCTCATAAT CGCCCACGGG CTTACATCCT
Pan AAGCTTCACC GGCGCAATTA TCCTCATAAT CGCCCACGGA CTTACATCCT
Gorilla AAGCTTCACC GGCGCAGTTG TTCTTATAAT TGCCCACGGA CTTACATCAT
Pongo AAGCTTCACC GGCGCAACCA CCCTCATGAT TGCCCATGGA CTCACATCCT
Hylobates AAGCTTTACA GGTGCAACCG TCCTCATAAT CGCCCACGGA CTAACCTCTT
Macaca_fuscus AAGCTTTTCC GGCGCAACCA TCCTTATGAT CGCTCACGGA CTCACCTCTT
Macaca_mulatta AAGCTTTTCT GGCGCAACCA TCCTCATGAT TGCTCACGGA CTCACCTCTT
Macaca_fascicularis AAGCTTCTCC GGCGCAACCA CCCTTATAAT CGCCCACGGG CTCACCTCTT
Macaca_sylvanus AAGCTTCTCC GGTGCAACTA TCCTTATAGT TGCCCATGGA CTCACCTCTT
Saimiri_sciureus AAGCTTCACC GGCGCAATGA TCCTAATAAT CGCTCACGGG TTTACTTCGT

CCCTATTATT TTGCCTAGCA AATACAAACT ACGAACGAGT CCACAGTCGA
CCATATTATT CTGTCTAGCC AACTCTAACT ACGAACGAAT CCATAGCCGT
CATTACTATT CTGCCTAGCA AACTCAAACT ACGAACGCAC TCACAGTCGC
CATTATTATT CTGCCTAGCA AACTCAAATT ATGAACGCAC CCACAGTCGC
CATTATTATT CTGCCTAGCA AACTCAAACT ACGAACGAAC CCACAGCCGC
CCCTACTGTT CTGCCTAGCA AACTCAAACT ACGAACGAAC CCACAGCCGC
CCCTGCTATT CTGCCTTGCA AACTCAAACT ACGAACGAAC TCACAGCCGC
CCATATATTT CTGCCTAGCC AATTCAAACT ATGAACGCAC TCACAACCGT
CCATATATTT CTGCCTAGCC AATTCAAACT ATGAACGCAC TCACAACCGT
CCATGTATTT CTGCTTGGCC AATTCAAACT ATGAGCGCAC TCATAACCGT
CCATATACTT CTGCTTGGCC AACTCAAACT ACGAACGCAC CCACAGCCGC
CTATGCTATT CTGCCTAGCA AACTCAAATT ACGAACGAAT TCACAGCCGA

ACAATAGCAC TAGCCCGTGG CCTTCAAACC CTATTACCTC TTGCAGCAAC
ACAATACTAC TAGCACGAGG GATCCAAACC ATTCTCCCTC TTATAGCCAC
ATCATAATCC TCTCTCAAGG ACTTCAAACT CTACTCCCAC TAATAGCTTT
ATCATAATTC TCTCCCAAGG ACTTCAAACT CTACTCCCAC TAATAGCCTT
ATCATAATTC TCTCTCAAGG ACTCCAAACC CTACTCCCAC TAATAGCCCT
ATCATAATCC TCTCTCAAGG CCTTCAAACT CTACTCCCCC TAATAGCCCT
ATCATAATCC TATCTCGAGG GCTCCAAGCC TTACTCCCAC TGATAGCCTT
ACCATACTAC TGTCCCGAGG ACTTCAAATC CTACTTCCAC TAACAGCCTT
ACCATACTAC TGTCCCGGGG ACTTCAAATC CTACTTCCAC TAACAGCTTT
ACCATACTAC TATCCCGAGG ACTTCAAATT CTACTTCCAT TGACAGCCTT
ATCATACTAC TATCCCGAGG ACTCCAAATC CTACTCCCAC TAACAGCCTT
ACAATAACAT TTACTCGAGG GCTCCAAACA CTATTCCCGC TTATAGGCCT

ATGATGACTC CTCGCCAGCT TAACCAACCT GGCCCTTCCC CCAACAATTA
CTGATGACTA CTCGCCAGCC TAACTAACCT AGCCCTACCC ACCTCTATCA
TTGATGACTT CTAGCAAGCC TCGCTAACCT CGCCTTACCC CCCACTATTA
TTGATGACTC CTAGCAAGCC TCGCTAACCT CGCCCTACCC CCTACCATTA
TTGATGACTT CTGGCAAGCC TCGCCAACCT CGCCTTACCC CCCACCATTA
CTGATGACTT CTAGCAAGCC TCACTAACCT TGCCCTACCA CCCACCATCA
CTGATGACTC GCAGCAAGCC TCGCTAACCT CGCCCTACCC CCCACTATTA
TTGATGATTA ACAGCAAGCC TTACTAACCT TGCCCTACCC CCCACTATCA
CTGATGATTA ACAGCAAGCC TTACTAACCT TGCCCTACCC CCCACTATCA
CTGATGACTC ACAGCAAGCC TTACTAACCT TGCCCTACCC CCCACTATTA
CTGATGATTC ACAGCAAGCC TTACTAATCT TGCTCTACCC TCCACTATTA
CTGATGACTC CTAGCAAATC TCGCTAACCT CGCCCTACCC ACAGCTATTA

ATTTAATCGG TGAACTGTCC GTAATAATAG CAGCATTTTC ATGGTCACAC
ATTTAATTGG CGAACTATTC GTCACTATAG CATCCTTCTC ATGATCAAAC
ACCTACTGGG AGAACTCTCT GTGCTAGTAA CCACGTTCTC CTGATCAAAT
ATCTCCTAGG GGAACTCTCC GTGCTAGTAA CCTCATTCTC CTGATCAAAT
ACCTACTAGG AGAGCTCTCC GTACTAGTAA CCACATTCTC CTGATCAAAC
ACCTTCTAGG AGAACTCTCC GTACTAATAG CCATATTCTC TTGATCTAAC
ACCTCCTAGG TGAACTCTTC GTACTAATGG CCTCCTTCTC CTGGGCAAAC
ATCTACTAGG TGAACTCTTT GTAATCGCAA CCTCATTCTC CTGATCCCAT
ACCTACTAGG TGAACTCTTT GTAATCGCGA CCTCATTCTC CTGGTCCCAT
ATCTACTAGG CGAACTCTTT GTAATCACAA CTTCATTTTC CTGATCCCAT
ATCTACTGGG CGAACTCTTC GTAATCGCAA CCTCATTTTC CTGATCCCAC
ATCTAGTAGG AGAATTACTC ACAATCGTAT CTTCCTTCTC TTGATCCAAC

CTAACTATTA TCTTAGTAGG CCTTAACACC CTTATCACCG CCCTATATTC
ATTACAATTA TCTTAATAGG CTTAAATATG CTCATCACCG CTCTCTATTC
ATCACTCTCC TACTTACAGG ACTCAACATA CTAGTCACAG CCCTATACTC
ACCACTCTCC TACTCACAGG ATTCAACATA CTAATCACAG CCCTGTACTC
ACCACCCTTT TACTTACAGG ATCTAACATA CTAATTACAG CCCTGTACTC
ATCACCATCC TACTAACAGG ACTCAACATA CTAATCACAA CCCTATACTC
ACTACTATTA CACTCACCGG GCTCAACGTA CTAATCACGG CCCTATACTC
ATCACCATTA TGCTAACAGG ACTTAACATA TTAATTACGG CCCTCTACTC
ATCACCATTA TATTAACAGG ATTTAACATA CTAATTACGG CCCTCTACTC
ATCACCATTG TGTTAACGGG CCTTAATATA CTAATCACAG CCCTCTACTC
ATCACCATCA TACTAACAGG ACTGAACATA CTAATTACAG CCCTCTACTC
TTTACTATTA TATTCACAGG ACTTAATATA CTAATTACAG CACTCTACTC

CCTATATATA CTAATCATAA CTCAACGAGG AAAATACACA TATCATATCA
CCTCTATATA TTAACTACTA CACAACGAGG AAAACTCACA TATCATTCGC
CCTCTACATA TTTACCACAA CACAATGGGG CTCACTCACC CACCACATTA
CCTCTACATG TTTACCACAA CACAATGAGG CTCACTCACC CACCACATTA
CCTTTATATA TTTACCACAA CACAATGAGG CCCACTCACA CACCACATCA
TCTCTATATA TTCACCACAA CACAACGAGG TACACCCACA CACCACATCA
CCTTTACATA TTTATCATAA CACAACGAGG CACACTTACA CACCACATTA
TCTCCACATA TTCACTACAA CACAACGAGG AACACTCACA CATCACATAA
CCTCCACATA TTCACCACAA CACAACGAGG AGCACTCACA CATCACATAA
TCTCCACATG TTCATTACAG TACAACGAGG AACACTCACA CACCACATAA
TCTTCACATA TTCACCACAA CACAACGAGG AGCGCTCACA CACCACATAA
ACTTCATATG TATGCCTCTA CACAGCGAGG TCCACTTACA TACAGCACCA

ACAATATCAT GCCCCCTTTC ACCCGAGAAA ATACATTAAT AATCATACAC
ACAACCTAAA CCCATCCTTT ACACGAGAAA ACACCCTTAT ATCCATACAC
ACAACATAAA ACCCTCATTC ACACGAGAAA ACACCCTCAT GTTCATACAC
ATAACATAAA GCCCTCATTC ACACGAGAAA ATACTCTCAT ATTTTTACAC
CCAACATAAA ACCCTCATTT ACACGAGAAA ACATCCTCAT ATTCATGCAC
ACAACATAAA ACCTTCTTTC ACACGCGAAA ATACCCTCAT GCTCATACAC
AAAACATAAA ACCCTCACTC ACACGAGAAA ACATATTAAT ACTTATGCAC
TCAACATAAA GCCCCCCTTC ACACGAGAAA ACACATTAAT ATTCATACAC
TCAACATAAA ACCCCCCTTC ACACGAGAAA ACATATTAAT ATTCATACAC
TCAATATAAA ACCCCCCTTC ACACGAGAAA ACATATTAAT ATTCATACAC
TTAACATAAA ACCACCTTTC ACACGAGAAA ACATATTAAT ACTCATACAC
GCAATATAAA ACCAATATTT ACACGAGAAA ATACGCTAAT ATTTATACAT

CTATTTCCCT TAATCCTACT ATCTACCAAC CCCAAAGTAA TTATAGGAAC
ATACTCCCCC TTCTCCTATT TACCTTAAAC CCCAAAATTA TTCTAGGACC
CTATCCCCCA TTCTCCTCCT ATCCCTCAAC CCCGACATCA TTACCGGGTT
CTATCCCCCA TCCTCCTTCT ATCCCTCAAT CCTGATATCA TCACTGGATT
CTATCCCCCA TCCTCCTCCT ATCCCTCAAC CCCGATATTA TCACCGGGTT
CTATCCCCCA TCCTCCTCTT ATCCCTCAAC CCCAGCATCA TCGCTGGGTT
CTCTTCCCCC TCCTCCTCCT AACCCTCAAC CCTAACATCA TTACTGGCTT
CTCGCTCCAA TTATCCTTCT ATCCCTCAAC CCCAACATCA TCCTGGGGTT
CTCGCTCCAA TCATCCTCCT ATCTCTCAAC CCCAACATCA TCCTGGGGTT
CTCGCTCCAA TTATCCTTCT ATCTCTCAAC CCCAACATCA TCCTGGGGTT
CTCGCTCCAA TTATTCTTCT ATCTCTTAAC CCCAACATCA TTCTAGGATT
ATAACACCAA TCCTCCTCCT TACCTTGAGC CCCAAGGTAA TTATAGGACC

CATGTACTGT AAATATAGTT TAAACAAAAC ATTAGATTGT GAGTCTAATA
CACGTACTGT AAATATAGTT TAAA-AAAAC ACTAGATTGT GAATCCAGAA
TTCCTCTTGT AAATATAGTT TAACCAAAAC ATCAGATTGT GAATCTGACA
CACCTCCTGT AAATATAGTT TAACCAAAAC ATCAGATTGT GAATCTGACA
CACCTCCTGT AAATATAGTT TAACCAAAAC ATCAGATTGT GAATCTGATA
CGCCTACTGT AAATATAGTT TAACCAAAAC ATTAGATTGT GAATCTAATA
TACTCCCTGT AAACATAGTT TAATCAAAAC ATTAGATTGT GAATCTAACA
TACCTCCTGT AGATATAGTT TAACTAAAAC ACTAGATTGT GAATCTAACC
TACTTCCTGT AGATATAGTT TAACTAAAAC ATTAGATTGT GAATCTAACC
TACCTCCTGT AAATATAGTT TAACTAAAAC ATTAGATTGT GAATCTAACT
TACTTCCTGT AAATATAGTT TAATTAAAAC ATTAGACTGT GAATCTAACT
CTCACCTTGT AATTATAGTT TAGCTAAAAC ATTAGATTGT GAATCTAATA

ATAGAAGCCC AAAGATTTCT TATTTACCAA GAAAGTA-TG CAAGAACTGC
ATAGAAGCTC AAAC-CTTCT TATTTACCGA GAAAGTAATG TATGAACTGC
ACAGAGGCTT A-CGACCCCT TATTTACCGA GAAAGCT-CA CAAGAACTGC
ACAGAGGCTC A-CGACCCCT TATTTACCGA GAAAGCT-TA TAAGAACTGC
ACAGAGGCTC A-CAACCCCT TATTTACCGA GAAAGCT-CG TAAGAGCTGC
ATAGGGCCCC A-CAACCCCT TATTTACCGA GAAAGCT-CA CAAGAACTGC
ATAGAGGCTC G-AAACCTCT TGCTTACCGA GAAAGCC-CA CAAGAACTGC
ATAGAGACTC A-CCACCTCT TATTTACCGA GAAAACT-CG CAAGGACTGC
ATAGAGACTT A-CCACCTCT TATTTACCGA GAAAACT-CG CGAGGACTGC
ATAGAGGCCT A-CCACTTCT TATTTACCGA GAAAACT-CG CAAGGACTGC
ATAGAAGCTT A-CCACTTCT TATTTACCGA GAAAACT-TG CAAGGACCGC
ATAGAAGAAT A-TAACTTCT TAATTACCGA GAAAGTG-CG CAAGAACTGC

TAACTCATGC CTCCATATAT AACAATGTGG CTTTCTT-AC TTTTAAAGGA
TAACTCTGCA CTCCGTATAT AAAAATACGG CTATCTCAAC TTTTAAAGGA
TAACTCATGC CCCCATGTCT AACAACATGG CTTTCTCAAC TTTTAAAGGA
TAATTCATAT CCCCATGCCT GACAACATGG CTTTCTCAAC TTTTAAAGGA
TAACTCATAC CCCCGTGCTT GACAACATGG CTTTCTCAAC TTTTAAAGGA
TAACTCTCAC T-CCATGTGT GACAACATGG CTTTCTCAGC TTTTAAAGGA
TAACTCACTA TCCCATGTAT GACAACATGG CTTTCTCAAC TTTTAAAGGA
TAACCCATGT ACCCGTACCT AAAATTACGG TTTTCTCAAC TTTTAAAGGA
TAACCCATGT ATCCGTACCT AAAATTACGG TTTTCTCAAC TTTTAAAGGA
TAATCCATGC CTCCGTACTT AAAACTACGG TTTCCTCAAC TTTTAAAGGA
TAATCCACAC CTCCGTACTT AAAACTACGG TTTTCTCAAC TTTTAAAGGA
TAATTCATGC TCCCAAGACT AACAACTTGG CTTCCTCAAC TTTTAAAGGA

TAGAAGTAAT CCATCGGTCT TAGGAACCGA AAA-ATTGGT GCAACTCCAA
TAGAAGTAAT CCATTGGCCT TAGGAGCCAA AAA-ATTGGT GCAACTCCAA
TAACAGCTAT CCATTGGTCT TAGGCCCCAA AAATTTTGGT GCAACTCCAA
TAACAGCCAT CCGTTGGTCT TAGGCCCCAA AAATTTTGGT GCAACTCCAA
TAACAGCTAT CCATTGGTCT TAGGACCCAA AAATTTTGGT GCAACTCCAA
TAACAGCTAT CCCTTGGTCT TAGGATCCAA AAATTTTGGT GCAACTCCAA
TAACAGCTAT CCATTGGTCT TAGGACCCAA AAATTTTGGT GCAACTCCAA
TAACAGCTAT CCATTGACCT TAGGAGTCAA AAACATTGGT GCAACTCCAA
TAACAGCTAT CCATTGACCT TAGGAGTCAA AAATATTGGT GCAACTCCAA
TAACAGCTAT CCATTGACCT TAGGAGTCAA AAACATTGGT GCAACTCCAA
TAACAGCTAT CCATTGGCCT TAGGAGTCAA AAATATTGGT GCAACTCCAA
TAGTAGTTAT CCATTGGTCT TAGGAGCCAA AAACATTGGT GCAACTCCAA

ATAAAAGTAA TAAATTTATT TTCATCCTCC ATTTTACTAT CACTTACACT
ATAAAAGTAA TAAATCTATT ATCCTCTTTC ACCCTTGTCA CACTGATTAT
ATAAAAGTAA TAACCATGCA CACTACTATA ACCACCCTAA CCCTGACTTC
ATAAAAGTAA TAACCATGTA TACTACCATA ACCACCTTAA CCCTAACTCC
ATAAAAGTAA TAACTATGTA CGCTACCATA ACCACCTTAG CCCTAACTTC
ATAAAAGTAA CAGCCATGTT TACCACCATA ACTGCCCTCA CCTTAACTTC
ATAAAAGTAA TAGCAATGTA CACCACCATA GCCATTCTAA CGCTAACCTC
ATAAAAGTAA TAATCATGCA CACCCCCATC ATTATAACAA CCCTTATCTC
ATAAAAGTAA TAATCATGCA CACCCCTATC ATAATAACAA CCCTTATCTC
ATAAAAGTAA TAATCATGCA CACCCCCATC ATAATAACAA CCCTCATCTC
ATAAAAGTAA TAATCATGTA TACCCCCATC ATAATAACAA CTCTCATCTC
ATAAAAGTAA TA---ATACA CTTCTCCATC ACTCTAATAA CACTAATTAG

CTTAATTACC CCATTTATTA TTACAACAAC TAAAAAATAT GAAACACATG
CCTAACTTTA CCTATCATTA TAAACGTTAC AAACATATAC AAAAACTACC
CCTAATTCCC CCCATCCTTA CCACCCTCGT TAACCCTAAC AAAAAAAACT
CTTAATTCTC CCCATCCTCA CCACCCTCAT TAACCCTAAC AAAAAAAACT
CTTAATTCCC CCTATCCTTA CCACCTTCAT CAATCCTAAC AAAAAAAGCT
CCTAATCCCC CCCATTACCG CTACCCTCAT TAACCCCAAC AAAAAAAACC
CCTAATTCCC CCCATTACAG CCACCCTTAT TAACCCCAAT AAAAAGAACT
CCTAACTCTC CCAATTTTTG CCACCCTCAT CAACCCTTAC AAAAAACGTC
CCTAACTCTC CCAATTTTTG CCACCCTCAT CAACCCTTAC AAAAAACGTC
CCTGACCCTT CCAATTTTTG CCACCCTCAC CAACCCCTAT AAAAAACGTT
CCTAACTCTT CCAATTTTCG CTACCCTTAT CAACCCCAAC AAAAAACACC
CCTACTAGCG CCAATCCTAG CTACCCTCAT TAACCCTAAC AAAAGCACAC

CATACCCTTA CTACGTAAAA AACTCTATCG CCTGCGCATT TATAACAAGC
CCTATGCACC ATACGTAAAA TCTTCTATTG CATGTGCCTT CATCACTAGC
CATACCCCCA TTATGTAAAA TCCATTGTCG CATCCACCTT TATTATCAGT
CATATCCCCA TTATGTGAAA TCCATTATCG CGTCCACCTT TATCATTAGC
CATACCCCCA TTACGTAAAA TCTATCGTCG CATCCACCTT TATCATCAGC
CATACCCCCA CTATGTAAAA ACGGCCATCG CATCCGCCTT TACTATCAGC
TATACCCGCA CTACGTAAAA ATGACCATTG CCTCTACCTT TATAATCAGC
CATACCCAGA TTACGTAAAA ACAACCGTAA TATATGCTTT CATCATCAGC
CATACCCAGA TTACGTAAAA ACAACCGTAA TATATGCTTT CATCATCAGC
CATACCCAGA CTACGTAAAA ACAACCGTAA TATATGCTTT TATTACCAGT
TATATCCAAA CTACGTAAAA ACAGCCGTAA TATATGCTTT CATTACCAGC
TATACCCGTA CTACGTAAAA CTAGCCATCA TCTACGCCCT CATTACCAGT

CTAGTCCCAA TGCTCATATT TCTATACACA AATCAAGAAA TAATCATTTC
CTCATCCCAA CTATATTATT TATCTCCTCA GGACAAGAAA CAATCATTTC
CTCTTCCCCA CAACAATATT CATGTGCCTA GACCAAGAAG TTATTATCTC
CTTTTCCCCA CAACAATATT CATATGCCTA GACCAAGAAG CTATTATCTC
CTCTTCCCCA CAACAATATT TCTATGCCTA GACCAAGAAG CTATTATCTC
CTTATCCCAA CAACAATATT TATCTGCCTA GGACAAGAAA CCATCGTCAC
CTATTTCCCA CAATAATATT CATGTGCACA GACCAAGAAA CCATTATTTC
CTCCCCTCAA CAACTTTATT CATCTTCTCA AACCAAGAAA CAACCATTTG
CTCCCCTCAA CAACTTTATT CATCTTCTCA AACCAAGAAA CAACCATTTG
CTCCCCTCAA CAACCCTATT CATCCTCTCA AACCAAGAAA CAACCATTTG
CTCTCTTCAA CAACTTTATA TATATTCTTA AACCAAGAAA CAATCATCTG
ACCTTATCTA TAATATTCTT TATCCTTACA GGCCAAGAAT CAATAATTTC

CAACTGACAT TGAATAACGA TTCATACTAT CAAATTATGC CTAAGCTT
CAACTGACAT TGAATAACAA TCCAAACCCT AAAACTATCT ATTAGCTT
GAACTGACAC TGAGCCACAA CCCAAACAAC CCAGCTCTCC CTAAGCTT
AAACTGGCAC TGAGCAACAA CCCAAACAAC CCAGCTCTCC CTAAGCTT
AAGCTGACAC TGAGCAACAA CCCAAACAAT TCAACTCTCC CTAAGCTT
AAACTGATGC TGAACAACCA CCCAGACACT ACAACTCTCA CTAAGCTT
AAACTGACAC TGAACTGCAA CCCAAACGCT AGAACTCTCC CTAAGCTT
GAGCTGACAT TGAATAATGA CCCAAACACT AGACCTAACG CTAAGCTT
AAGCTGACAT TGAATAATAA CCCAAACACT AGACCTAACA CTAAGCTT
GAGTTGACAT TGAATAACAA CCCAAACATT AGACCTAACA CTAAGCTT
AAGCTGGCAC TGAATAATAA CCCAAACACT AAGCCTAACA TTAAGCTT
AAACTGACAC TGAATAACTA TCCAAACCAT CAAACTATCC CTAAGCTT

0 comments on commit 4d5e57a

Please sign in to comment.