fastanalizer

Installation

You can install fastanalize like any other Python package, using pip to download it from PyPI_:

$ pip install fastanalize

or using setup.py if you have downloaded the source package locally:

$ python setup.py build
$ sudo python setup.py install

Usage

After installation, to use `` fastanalizer`` just invoke it in the folder where the multifasta is located:

$ fastanalizer myfasta.fa

You may also define a title to your job:

$ fastanalizer --title "Example" myfasta.fa

It is possible to define the type of image output. Default is HTML.

$ fastanalizer --output SVG myfasta.fa

Example

Fastanalizer performs protein sequence analysis in 5 steps:

Fasta sequences analysis:

The first step analyzes basic metrics of the delivered multifast, such as quantity and size of sequences, shortest and longest sequences, standard deviation between sequences size and the amino acid distribution of the sequences. The results are saved in the general folder.

Functional domains analysis:

The second step is the search for functional domains in the delivered proteins. The analysis is done using the NCBI Batch Cd-Search. The results are saved in the domainsearch folder. Each file inside the folder has a maximum information of 2000 sequences. There is no limit to the total number of sequences and the execution time of this step varies according to the number of sequences provided.
Sequence trimming

The search results from step 2 are used to select the function domain by parsimony: the domain with the highest amount of specific hits is used to cut the given sequences. Sequences with incomplete domain are discarded.

Sequence alignment

The trimmed sequences are align. Fastanalizer utilizes MAFFT with automatic settings to align. The results are saved in the align folder. The sequences used for the alignment are in job.fasta. The supplied multifasta is the base.fasta and the align sequences is align.fasta. Sequences are rename for better presentation at the phylogenetic analysis. The file rename.txt has the from-to table. MAFFT output can be found in align-stderr.fasta
Phylogenetic analysis

A Neighboor joining tree created using BioPython Phylo module.

Changelog

Under construction

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
dist		dist
example		example
fastanalizer.egg-info		fastanalizer.egg-info
fastanalizer		fastanalizer
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.rst		README.rst
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dist

dist

example

example

fastanalizer.egg-info

fastanalizer.egg-info

fastanalizer

LICENSE

LICENSE

MANIFEST.in

MANIFEST.in

README.rst

README.rst

setup.cfg

setup.cfg

setup.py

setup.py

Repository files navigation

fastanalizer - Fast protein domain analysis tool

Installation

Usage

Example

Changelog

About

Releases

Packages

Languages

License

alezanatta/fastanalizer

Folders and files

Latest commit

History

Repository files navigation

fastanalizer - Fast protein domain analysis tool

Installation

Usage

Example

Changelog

About

Resources

License

Stars

Watchers

Forks

Languages