Skip to content

Transforms a BLASTX output from UNIPROT database to any format, such as GI, RefSeq, GO, etc.

Notifications You must be signed in to change notification settings

estebanpw/uniprot2any

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

uniprot2any

This is a simple C++ program to convert the IDs of matches found with BLASTX on a UNIPROT database search to any other ID provided by official mapping files.

Usage is very simple, download the mapping file from:

ftp://ftp.uniprot.org/pub/databases/uniprot/current_release/knowledgebase/idmapping

(At time of writing, it is the file named "idmapping.dat.gz")

Unzip the file and then you can run the program as follows from within the src folder:

make all

../bin/uniprot2any idmapping.dat input.blastx GI output.blastx

where idmapping.dat corresponds to the file you just unzipped and input.blastx is your blastx file with uniprot IDs. The 3rd field, "GI" is used to specify to which type you wish to convert, e.g. you could also run:

../bin/uniprot2any idmapping.dat input.blastx KEGG output.blastx

or any other. The last file is the output file with the converted IDs. Those IDs that are not found in the mapping file are left untouched.

ATTENTION IF YOU ARE USING MEGAN

It is most convenient to use the GI mapping (first command line above) and then utilize the file mappings provided by MEGAN, since these will take care of the tags accordingly. You can download the mappings from GI to any database here:

http://ab.inf.uni-tuebingen.de/data/software/megan6/download/welcome.html

Happy converting!

About

Transforms a BLASTX output from UNIPROT database to any format, such as GI, RefSeq, GO, etc.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published