Skip to content

A collection of bioinformatics related scripts suited for specific taskes including automated multisequence alignment and protein comparisons such as accepted amino acid regions. Most tools are written in python 3.6.

License

master
Switch branches/tags
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 

Biotools


What is it?

A collection of bioinformatics related scripts suited for specific taskes including automated multisequence alignment and protein comparisons such as accepted amino acid regions. Most tools are written in python 3.6.

Installation

To use tools in this package, simply clone/download the repository and run the scripts on the commandline. Examples for each script are listed below demonstrating what options are available. In addition to this documentation scripts contain a description and usage example for aid.

Script Docs

is_aligned.py

Checks a directory or file of alignments in fasta format for alignment. Based on checking for dashes, this is a useful tool for verifying files before running them through analysis if there are large sets of alignment files.

Options

-path the directory or file to check

Example

./is_aligned.py -path /example/file.fasta

prot_comp.py

Computes comparisions on directory or file of alignments in fasta format between 2 groups of sequences. comparison options are specified in a control file.

Example

./prot_comp.py control.json

muscle_automate.py

Uses the MUSCLE command line program to perform alignment on a fasta file or directory of fasta files. (NOTE: the muscle program must be in your path for subprocess to call it properly, and it must be named 'muscle')

Example

./muscle_automate.py fasta_files/ output_dir/

pal2nal_automate.py

This script allows for running pal2nal on a directory or single file in order to automate the computation on a large set of data (REASON: pal2nal is a program used to create codon alignments from protein alignments and mrna sequences.)

Example

./pal2nal_automate.py peptide_alignments/ mrna_alignments/ output_dir/

prot_comp.py

This script allows for the comparative analysis of aligned fasta containing two groups of species at each site in the alignment. The control file contains options for the program which can the be simply run on the command line.

Example

./prot_comp.py path_to_control.file

Blast Databases

Tools for making local blast databases and running blast. In order to run blast locally we first must setup a blast database given a fasta file of protein sequences. After that we can then use a fasta file for some species and compute a blast with each protein sequence against the proteins sequences in the database.

Example

demonstrated in the overview file in the directory.

Contacts

Programmer / Researcher Robby Boney
robby.boney@wsu.edu

About

A collection of bioinformatics related scripts suited for specific taskes including automated multisequence alignment and protein comparisons such as accepted amino acid regions. Most tools are written in python 3.6.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published