HistoneDB

A database for all histone proteins in NR organized by their known non-allelic protein isoforms, called variants. This resource can be used to understand how changes in histone variants affect structure, complex formation, and nucleosome function. For more information, please read our paper (citation below).

The database can be accessed at http://www.ncbi.nlm.nih.gov/projects/HistoneDB2.0/

Requirements

Python 3.8
Required python packages are specified in requirements/ (use "pip install -r requirements/full.txt"). They include:
HMMER 3.1b2
BLAST+ v 2.2.26
EMBOSS v6.5.7
MUSCLE v3.8.31
ClustalW2 v2.1 All executables must be present in the bin dir of virual environment.

Setup

If you want to test the server on your own machine, you must make sure have all of the dependencies listed above and follow these steps.

Create MySQL database, and store the login information in the file HistoneDB/NCBI_databse_info.txt, which is formatted in the following way (key = value):

name = DB_NAME
user = DB_USER
password = DB_PASS
host = DB_HOST
port = DB_PORT
SECRET_KEY = DJANGO_SECRET_KEY

#Reference another file with same paramters, useful if it needs be hidden
file = /path/to/other/file.txt

#Root of site when accessed from a browser
STATIC_URL = /static/

If running on the mweb, these values will already be set. Finally, make sure the database has the correct charset:

ALTER DATABASE DB_NAME CHARACTER SET utf8 COLLATE utf8_general_ci

Migrate Django models into database

python manage.py migrate

Build NCBI Taxonomy with djangophylocore

python manage.py buildncbi
python manage.py loadtaxonomy
python manage.py buildtaxonomytoc

WARNING: This will download the entire NCBI taxonomy database and load into the database, which can take a long time.

Classify sequences in NR

python manage.py buildvariants

WARNING: This will by default download the entire nr database and classify all sequences in the nr. If you want to build the HistoneDB using a smaller database of proteins in FASTA format using the NR formatted header (">gi|UNIQUE_GI|anything description [TAXONOMY_NAME]"), run the following command:

python manage.py buildvariants --db small_database.fasta

Build trees from seed sequences

python manage.py buildtrees

Build organism sunbursts for each variant

python manage.py buildsunburst

Build Blast database for custom sequence analysis

python manage.py buildblastdb

Build variantinfo

python manage.py buildvariantinfo

Build GFF sequence features for variants

python manage.py buildseedinfo

Update

If youe need to update or rebuild the database, e.g. if a new variant is discovered, you must rerun steps 4-8 adding the --force parameter after each command to make sure everything gets updated.

Adding new variants

Collect representative sequences of the new variant and create seed alignments using any method you wish. Please read our paper for more info on how we collected the sequences and aligned them.
Place seed alignments in appropriate static directory:

static/browse/seeds/[HISTONE_TYPE]/[VARIANT].fasta

Follow update instructions
If this is a new variant, please let us know by creating a pull request, a new issue (enhancement), or emailing us.

Run

You have several options to run the Django server. The easiest way is to run it through manage.py, specifying a port (we use port 8080 in the example):

python manage.py runserver 8080

For deployment, we use FastCGI on the NCBI webservers. While this will be deprecated in the next version of Django, it is what NCBI allows. For more info, please read https://docs.djangoproject.com/en/1.8/howto/deployment/fastcgi/

Cite

Coming soon.

Acknowledgements

Eli Draizen
Alexey K. Shaytan
Anna Panchenko
Leonardo Marino-Ramirez
David Landsman
Paul Talbert

Name		Name	Last commit message	Last commit date
Latest commit History 605 Commits
HistoneDB		HistoneDB
browse		browse
cur_aln_tools		cur_aln_tools
djangophylocore		djangophylocore
paper		paper
requirements		requirements
static		static
system_setup		system_setup
templates		templates
tools		tools
.Rhistory		.Rhistory
.gitignore		.gitignore
HistoneDB_schema.pdf		HistoneDB_schema.pdf
README.md		README.md
manage.py		manage.py
reinit_histdb.sh		reinit_histdb.sh
reinit_histdb_swissprot.sh		reinit_histdb_swissprot.sh
update_variants.sh		update_variants.sh
update_variants_swissprot.sh		update_variants_swissprot.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HistoneDB

Requirements

Setup

Update

Adding new variants

Run

Cite

Acknowledgements

About

Releases

Packages

Contributors 8

Languages

ncbi/histonedb

Folders and files

Latest commit

History

Repository files navigation

HistoneDB

Requirements

Setup

Update

Adding new variants

Run

Cite

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 8

Languages

Packages