Skip to content

A method to classify different cell types from their epigenomic profiles and detect genomic regions determining this classification

License

Notifications You must be signed in to change notification settings

david-juan/ChromDet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

89 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ChromDet

A method to classify different cell types from their epigenomic profiles and detect genomic regions determining this classification. This repository contains a modified version of S3det (Rausell et al. PNAS 2010) adapted to perform epigenomic analyses of sample segregation into different cell types. These modifications include an update to the boost library v1.64 (http://www.boost.org/), a more informative verbose output, and minor bug fixing. It also contains several scripts aimed to facilitate this analysis by providing an interface with S3det and retrieving easy to use output files containing the obtained Chromatin Sample Space and the Chromatin Determining Regions (CDRs).

Installation

ChromDet has been tested on UNIX/Linux and MacOS systems.

Directories

Steps

2) Uncompress ChromDet

  • ./unzip ChromDet-master.zip

3) Install S3det

  • cd ChromDet/S3Det_modified/
  • make

Running S3det

Files

Output Files

Main output files
  • Chromatin space file. A tsv file containing the coordinates of the samples in the Chromatin Space, as well as to what cluster tehy belong
  • Chromatin Determinants Regions (CDRs) File. A tsv file containing the genomic coordinates of the CDRs, as well as to the split of which clusters they contribute and their pattern of collapsed chromatin states in the different samples.
Other output files
  • Whole genome patterns of original chromatin states. A tsv file containing the genomic coordinates of every region of the genome characterized by a different pattern of collapsed chromatin states along different samples than their neighbouring regions.
  • Filtered regions with patterns of collapsed chromatin states. A tsv containing the genomic coordinates of every region of the genome with a pattern of collapsed chromatin states along different samples considered informative.
  • Fasta file of the filtered regions with patterns of collapsed chromatin states. A fasta file, where regions with patterns of collapsed chromatin states along different samples are represented as gap-free aligned sequences of chromatin states for every sample.
  • S3det raw resuls. File with the raw output for S3det

External software requirements

Steps

1) Go to scripts dir

  • cd ChromDet/scripts

2) Run the complete analysis (minimal)

  • ./run_S3det_analysis.pl -d <dir containing the bedfiles>

3) Run the complete analysis (recommended)

  • ./run_S3det_analysis.pl -d <dir containing the bedfiles> -c <file with the equivalences between chromatin states and their collapse> -a <file with human meaningful names for the samples> -s <path to the directory containining S3det> -v

Toy Example

  • ./run_S3det_analysis.pl -d ../test/ -c ../test/States_collapse.txt -a ../test/Samples_beds.tsv -s ../S3Det_modified/ -v

How to cite

Automatic identification of informative regions with epigenomic changes associated to hematopoiesis Enrique Carrillo de Santa Pau, David Juan, Vera Pancaldi, Felipe Were, Jose Ignacio Martin-Subero, Daniel Rico, Alfonso Valencia bioRxiv 082917; doi: https://doi.org/10.1101/082917

About

A method to classify different cell types from their epigenomic profiles and detect genomic regions determining this classification

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages