Annotate DNA sequences for Gene Ontology terms
EMBOSS and HMMER version 3+ must be installed to use HMMER2GO. See the installing dependencies wiki page for instructions how to install these programs (or see below for Linux installation).
Perl must be installed to use HMMER2GO, and there are a couple of external modules required. Please download and install the latest HMMER executables manually. The system versions available from the package manager are incompatible with the latest model formats.
The installation can be done with the following command (note that this requires git):
For Ubuntu/Debian as the OS:
apt-get install -y emboss zlib1g-dev libxml2-dev curl -L cpanmin.us | perl - git://github.com/sestaton/HMMER2GO.git
yum install -y EMBOSS zlib-devel libxml2-devel curl -L cpanmin.us | perl - git://github.com/sestaton/HMMER2GO.git
Alternatively, download the latest release and run the following command in the top directory:
If any Perl dependencies are listed after running this command, install them through the CPAN shell (or any method you like). Then build and install the package.
perl Makefile.PL make make test make install
Starting with a file of DNA sequences, we first want to get the longest open reading frame (ORF) for each gene and translate those sequences.
hmmer2go getorf -i genes.fasta -o genes_orfs.faa
Next, we search our ORFs for coding domains.
hmmer2go run -i genes_orfs.faa -d Pfam-A.hmm -o genes_orf_Pfam-A.tblout
Now we can map the protein domain matches to GO terms.
hmmer2go mapterms -i genes_orfs_Pfam-A.tblout -o genes_orfs_Pfam-A_GO.tsv --map
If we want to perform a statistical analysis on the GO mappings, it may be necessary to create a GAF file.
hmmer2go map2gaf -i genes_orfs_Pfam-A_GO_GOterm_mapping.tsv -o genes_orfs_Pfam-A_GO_GOterm_mapping.gaf -s 'Helianthus annuus'
For a full explanation of these commands, see the HMMER2GO wiki. In particular, see the tutorial page for a walk-through of all the commands. There is also an example script on the demonstration page to fetch data for Arabidopsis thaliana and run the full analysis.
Each subcommand can be executed with no arguments to generate a help menu. Alternatively, you may specify help message explicitly. For example,
hmmer2go help run
More information about each command is available by accessing the full documentation at the command line. For example,
hmmer2go run --man
Also, the HMMER2GO wiki is a source of online documentation.
Report any issues at the HMMER2GO issue tracker: https://github.com/sestaton/HMMER2GO/issues
LICENSE AND COPYRIGHT
Copyright (C) 2014-2018 S. Evan Staton
This program is distributed under the MIT (X11) License, which should be distributed with the package. If not, it can be found here: http://www.opensource.org/licenses/mit-license.php