Skip to content
A collection of bioinformatics related scripts suited for specific taskes including automated multisequence alignment and protein comparisons such as accepted amino acid regions. Most tools are written in python 3.6.
Python Perl
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
scripts
.gitignore
LICENSE
README.md

README.md

Biotools


What is it?

A collection of bioinformatics related scripts suited for specific taskes including automated multisequence alignment and protein comparisons such as accepted amino acid regions. Most tools are written in python 3.6.

Installation

To use tools in this package, simply clone/download the repository and run the scripts on the commandline. Examples for each script are listed below demonstrating what options are available. In addition to this documentation scripts contain a description and usage example for aid.

Script Docs

is_aligned.py

Checks a directory or file of alignments in fasta format for alignment. Based on checking for dashes, this is a useful tool for verifying files before running them through analysis if there are large sets of alignment files.

Options

-path the directory or file to check

Example

./is_aligned.py -path /example/file.fasta

prot_comp.py

Computes comparisions on directory or file of alignments in fasta format between 2 groups of sequences. comparison options are specified in a control file.

Example

./prot_comp.py control.json

muscle_automate.py

Uses the MUSCLE command line program to perform alignment on a fasta file or directory of fasta files. (NOTE: the muscle program must be in your path for subprocess to call it properly, and it must be named 'muscle')

Example

./muscle_automate.py fasta_files/ output_dir/

pal2nal_automate.py

This script allows for running pal2nal on a directory or single file in order to automate the computation on a large set of data (REASON: pal2nal is a program used to create codon alignments from protein alignments and mrna sequences.)

Example

./pal2nal_automate.py peptide_alignments/ mrna_alignments/ output_dir/

prot_comp.py

This script allows for the comparative analysis of aligned fasta containing two groups of species at each site in the alignment. The control file contains options for the program which can the be simply run on the command line.

Example

./prot_comp.py path_to_control.file

Blast Databases

Tools for making local blast databases and running blast. In order to run blast locally we first must setup a blast database given a fasta file of protein sequences. After that we can then use a fasta file for some species and compute a blast with each protein sequence against the proteins sequences in the database.

Example

demonstrated in the overview file in the directory.

Contacts

Programmer / Researcher Robby Boney
robby.boney@wsu.edu

You can’t perform that action at this time.