Deep analyses of eumetazoan opsin phylogeny
See COMMANDLINE.md for list of commands and arguments used for analyses
Shell script for formatting fastas into blastable databases
Shell script for BLASTp search against blast dbs with a fasta containing query seqs. Retrieves the sequence data back
Shell script, runs CD-HIT to remove redundancy, adds bait and anchor seqs in, and aligns using MAFFT
Shell script using the perl seqConverter.pl script to convert alignment files to phylip
Shell script making ML trees using PROTGAMMAGTR model in RAxML. Must be executed in directory containing phylip files
R script for first round of phylogenetic focusing. Filtering out distantly related sequences in a taxon specific manner
(loop under construction). Shell script for retrieving sequence data for all the genes that were included in the trees export from tree_editor.R script
Shell script to concatenate all fastas together into the "total" dataset and aligns using MAFFT
Perl script: you provide file with the names of sequences wanted, the original fasta containing all the sequence data, and it outputs a fasta file containing the sequence data only from the names you provided
Perl script for converting alignments into phylip files
Python script for changing names back to original format after the alignment program PASTA strips capital letters and underscores
Python script for pruning branches that exceed a user defined branch length in a phylogeny
Python script: given a fasta file and the text file containing the headers of unwanted seqs, this script outputs a new file lacking the unwamted seqs specified in the text file