-
Download Prot2003-2014 fasta (ftp://ftp.ncbi.nih.gov/pub/COG/COG2014/data/prot2003-2014.fa.gz) to a COG_DB_directory
-
Gunzip prot (gunzip prot2003-2014.fa.gz)
-
Create Blast DB: (makeblastdb -dbtype prot -out db2003-2014 -title db2003-2014 -max_file_sz 300000000 -in prot2003-2014.fa -parse_seqids)
-
Download cog2003-2014.csv (ftp://ftp.ncbi.nih.gov/pub/COG/COG2014/data/cog2003-2014.csv) in the COG_DB_directory
-
Download cognames2003-2014.tab (ftp://ftp.ncbi.nih.gov/pub/COG/COG2014/data/cognames2003-2014.tab) in COG_DB_directory
-
Download fun2003-20014.tab (ftp://ftp.ncbi.nih.gov/pub/COG/COG2014/data/fun2003-2014.tab) in COG_DB_directory
-
Git clone in a workspace dir: (git clone https://github.com/aquacen/blast_cog.git)
-
Enter in the workspace dir (cd $HOME/blast_cog)
-
Run cog shell script with parameters:
./cog FASTA_WITH_AMINOACIDS COG_DB_directory
Example:
./cog fno.faa ../cogdb
-
NCBI-Blast+ Stand alone (makeblastdb and blastp): ftp://ftp.ncbi.nlm.nih.gov/blast/executables/blast+/LATEST/
-
Perl: https://www.perl.org/
-
GNU Parallel - The Command-Line Power Tool O. Tange (2011): GNU Parallel - The Command-Line Power Tool, login: The USENIX Magazine, February 2011:42-47.
-
To increase number of threads in BlastP update cog shell script THREADS variable
-
To remove parallel dependence see comments on cog shell script