You can install fastanalize like any other Python package,
using pip
to download it from PyPI_:
$ pip install fastanalize
or using setup.py
if you have downloaded the source package locally:
$ python setup.py build $ sudo python setup.py install
After installation, to use `` fastanalizer`` just invoke it in the folder where the multifasta is located:
$ fastanalizer myfasta.fa
You may also define a title to your job:
$ fastanalizer --title "Example" myfasta.fa
It is possible to define the type of image output. Default is HTML.
$ fastanalizer --output SVG myfasta.fa
Fastanalizer performs protein sequence analysis in 5 steps:
Fasta sequences analysis:
The first step analyzes basic metrics of the delivered multifast, such as quantity and size of sequences, shortest and longest sequences, standard deviation between sequences size and the amino acid distribution of the sequences. The results are saved in the
general
folder.
Functional domains analysis:
The second step is the search for functional domains in the delivered proteins. The analysis is done using the NCBI Batch Cd-Search. The results are saved in the
domainsearch
folder. Each file inside the folder has a maximum information of 2000 sequences. There is no limit to the total number of sequences and the execution time of this step varies according to the number of sequences provided.Sequence trimming
The search results from step 2 are used to select the function domain by parsimony: the domain with the highest amount of specific hits is used to cut the given sequences. Sequences with incomplete domain are discarded.
Sequence alignment
The trimmed sequences are align. Fastanalizer utilizes MAFFT with automatic settings to align. The results are saved in the
align
folder. The sequences used for the alignment are injob.fasta
. The supplied multifasta is thebase.fasta
and the align sequences isalign.fasta
. Sequences are rename for better presentation at the phylogenetic analysis. The filerename.txt
has the from-to table. MAFFT output can be found inalign-stderr.fasta
Phylogenetic analysis
A Neighboor joining tree created using BioPython Phylo module.
Under construction