Converts Ensembl, Uniprot, and HGNC IDs to Entrez Gene Id
Switch branches/tags
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
PyEntrezId
dist
tests
.gitignore
.travis.yml
CONTRIBUTING.rst
LICENSE
MANIFEST
README.rst
index.html
requirements.txt
setup.cfg
setup.py

README.rst

PyEntrezId

Python package to convert between various gene or protein IDs
https://travis-ci.org/lwgray/pyEntrezId.svg?branch=master https://coveralls.io/repos/github/lwgray/pyEntrezId/badge.svg?branch=master

Summary

This is the first package I am releasing into the wild. Any feedback would be greatly appreciated! I hope this project can be used as an example for beginners on how to structure/code a simple bioinformatics tool.

I created this package because I needed a simple tool to convert between the multitude of IDs used to identify genes, proteins, etc... This is important because databases hosted by various scientific institutions (NIH, EMBL, etc) sometimes have different nomenclature to describe the same exact thing(gene, protein, dna, rna, etc).

Links to Discussion

Reddit: <https://www.reddit.com/r/bioinformatics/comments/4icres/new_bioinformatics_python_package_to_easily/>

Quick Start

$ pip install --upgrade pyEntrezId

Examples

Convert Ensemble Transcript Gene Id to Entrez Gene Id

from PyEntrezId import Conversion

EnsemblId = 'ENST00000407559'
# include your email address
Id = Conversion('dummyemail@dummybunny.info')
EntrezId = Id.convert_ensembl_to_entrez(EnsemblId)
# Returns a string
print(EntrezId)

Convert HGNC ID to Entrez Gene Id

from PyEntrezId import Conversion

# HGNCID can be just the number or 'HGNC:9425'
HGNCID = 9245
# include your email address
Id = Conversion('dummyemail@dummybunny.info')
EntrezId = Id.convert_hgnc_to_entrez(HGNCID)
# Returns a dictionary containing Symbol and Entrez Id
print EntrezID

Convert Entrez Gene Id to Uniprot ID

from PyEntrezId import Conversion

EntrezID = 39
# include your email address
Id = Conversion('dummyemail@dummybunny.info')
UniProtId = Id.convert_entrez_to_uniprot(EntrezID)
# Returns a string
print UniProtId

Convert Uniprot Id to Entrez Gene Id

from PyEntrezId import Conversion

UniProtId = 'Q9BWD1'
# include your email address
Id = Conversion('dummyemail@dummybunny.info')
EntrezID = Id.convert_uniprot_to_entrez(UniProtId)
# Returns a string
print EntrezID

Convert Accession Id to Taxonomy Id

from PyEntrezId import Conversion

AccessionId = 'AC131209'
# include your email address
Id = Conversion('dummyemail@dummybunny.info')
TaxID = Id.convert_accesion_to_taxid(AccesionId)
# Returns a string
print TaxID

Convert Gene Symbol to Entrez Gene Id

from PyEntrezId import Conversion

# HGNCID can be just the number or 'HGNC:9425'
Symbol = 'CDK2'
# include your email address
Id = Conversion('dummyemail@dummybunny.info')
EntrezId = Id.convert_symbol_to_entrezid(Symbol)
# Returns a dictionary containing Symbol and Entrez Id
print EntrezID

Contributing

Contributions to this library are always welcome and highly encouraged.

See CONTRIBUTING for more information on how to get started.

License

The MIT License (MIT) - See LICENSE for more information.