GitHub - zeehio/cll-chromatin: Data analysis scripts for Rendeiro et. al, 2016 (doi:10.1038/ncomms11938)

Chromatin accessibility maps of chronic lymphocytic leukemia identify subtype-specific epigenome signatures and transcription regulatory networks

André F. Rendeiro^*, Christian Schmidl^*, Jonathan C. Strefford^*, Renata Walewska, Zadie Davis, Matthias Farlik, David Oscier, Christoph Bock Chromatin accessibility maps of chronic lymphocytic leukemia identify subtype-specific epigenome signatures and transcription regulatory networks. Nat. Commun. 7:11938 doi: 10.1038/ncomms11938 (2016).

^*Shared first authors

Paper: http://dx.doi.org/10.1038/ncomms11938

Website: cll-chromatin.computational-epigenetics.org

This repository contains scripts used in the analysis of the data in the paper.

Manuscript

The manuscript is written in scholarly markdown, therefore you need Scholdoc to render the markdown into a pdf, rst, word or html manuscript.

A rendered version is available here.

You can see the raw manuscript here along with the figures.

To render the pdf version of the manuscript, run:

make manuscript

this requires in general a full latex installation.

Analysis

In the paper website you can find most of the output of the whole analysis.

Here are a few steps needed to reproduce it (more than I'd want to, I admit):

Clone the repository: git clone git@github.com:epigen/cll-chromatin.git
Install required software for the analysis:make requirements or pip install -r requirements.txt

If you wish to reproduce the processing of the raw data (access has to be requested through EGA), run these steps:

Apply for access to the raw data from EGA.
Download the data localy.
Prepare Looper configuration files similar to these that fit your local system.
Run samples through the pipeline: make preprocessing or looper -c metadata/project_config_file.yaml
Get external files (genome annotations mostly): make external_files or use the files in the paper website (external folder).
Run the analysis: make analysis

Additionaly, processed (bigWig and narrowPeak files together with a gene expression matrix) are available from GEO with accession number GSE81274.

If you wish to reproduce the plots from the analysis you can, in principle:

run python src/analysis.py

Not all parts of the analysis are possible to run as is, though. The TF network interence is based on a R package (PIQ) which is really hard to script runs in a system-independent way.

Name	Name	Last commit message	Last commit date
Latest commit zeehio Update parmap API to remove DeprecationWarnings Nov 10, 2017 422e63f · Nov 10, 2017 History 342 Commits
manuscript	manuscript	a few changes in the manuscript css	May 11, 2016
metadata	metadata	update analysis code and auxiliary scripts	May 11, 2016
src	src	Update parmap API to remove DeprecationWarnings	Nov 10, 2017
.gitignore	.gitignore	add more project-specific ignores	Jun 25, 2015
LICENSE.md	LICENSE.md	initialized repo with biological sample annotation	Mar 24, 2015
Makefile	Makefile	add requirements.txt; update readme	Oct 5, 2015
README.md	README.md	Add Zenodo badge	Jan 6, 2017
requirements.txt	requirements.txt	Update parmap API to remove DeprecationWarnings	Nov 10, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Chromatin accessibility maps of chronic lymphocytic leukemia identify subtype-specific epigenome signatures and transcription regulatory networks

Manuscript

Analysis

About

Releases

Packages

Languages

License

zeehio/cll-chromatin

Folders and files

Latest commit

History

Repository files navigation

Chromatin accessibility maps of chronic lymphocytic leukemia identify subtype-specific epigenome signatures and transcription regulatory networks

Manuscript

Analysis

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages