CAWS: CUT&Tag Analysis Workflow System

A Snakemake-based pipeline for comprehensive CUT&Tag data analysis

Overview

CAWS (CUT&Tag Analysis Workflow System) is a comprehensive Snakemake pipeline for analyzing CUT&Tag (Cleavage Under Targets and Tagmentation) sequencing data. CUT&Tag is an advanced epigenomic profiling method that maps protein–DNA interactions and chromatin modifications with greater sensitivity and specificity than traditional ChIP-seq approaches.

Why Use CAWS?

CUT&Tag Optimized – Purpose-built for CUT&Tag data with specialized QC and peak-calling strategies
Flexible Input – Supports single-end, paired-end, and mixed datasets in the same analysis
Comprehensive Analysis – End-to-end workflow from raw reads to interactive visualizations
Dual Peak Calling – Generates peak calls using both SEACR and MACS3 for robust peak detection
Interactive Reports – HTML dashboards with Plotly-based visualizations
Reproducible – Conda-based environment management for consistent results

Target Audience

CAWS is designed for:

Bioinformaticians analyzing CUT&Tag datasets
Researchers studying chromatin biology or epigenomics
Laboratories seeking standardized workflows
Users requiring comprehensive QC and visualization

Prerequisites

Required Software

Snakemake ≥9.0.0 (tested with 9.8.1)
Conda ≥24.7.1 or Mamba ≥1.5.0 (recommended)
Python ≥3.7
R ≥4.0

Important Notes:

Snakemake 9.0+ requires Conda ≥24.7.1 for environment management
Mamba is recommended as the conda frontend for faster environment creation (already configured in profile)
If you encounter conda version errors, see Troubleshooting section below

System Requirements

Storage: ~50 GB free space (typical dataset)
Memory: ≥8 GB RAM (16 GB+ recommended)
CPU: ≥4 cores
OS: Linux or macOS (cluster execution recommended)

Input Data Requirements

FASTQ files (gzipped) — supports both single-end and paired-end sequencing data
Reference genome (FASTA with Bowtie2 index)
Annotation file (GTF, optional for TSS enrichment)
Blacklist regions (BED)
Sample metadata (TSV)

Quick Start

1. Clone and Set Up

# Clone the repository
git clone https://github.com/ratan-lab/CAWS.git
cd CAWS

Note: CAWS uses conda environments managed automatically by Snakemake. No need to create a master environment.

2. Configure SLURM Profile (for HPC execution)

Edit profiles/slurm/config.yaml to set your SLURM allocation and partition:

default-resources:
  - runtime=180
  - mem_mb=5000
  - threads=1
  - slurm_account=your-allocation-name  # e.g., ratan
  - slurm_partition=your-partition      # e.g., standard

Skip this step if running locally (not on a cluster).

3. Prepare Analysis Configuration Files

Sample Sheet (samplesheet.tsv)

The pipeline supports paired-end, single-end, and mixed datasets. Specify single-end samples by using -, NA, or similar values in the read2 column:

sampleID	condition	group	read1	read2
sample1_PE	Treatment	group1	/path/to/sample1_R1.fastq.gz	/path/to/sample1_R2.fastq.gz
sample2_SE	Treatment	group2	/path/to/sample2.fastq.gz	-
sample3_PE	Control	group1	/path/to/sample3_R1.fastq.gz	/path/to/sample3_R2.fastq.gz
sample4_SE	Control	group2	/path/to/sample4.fastq.gz	NA

Single-end indicators: Use any of these values in the read2 column to mark a sample as single-end: -, NA, na, n/a, null, none, nan, nil, empty, 0, false, missing, absent, single, or se.

Configuration File (config.json)

Edit config.json with your file paths and parameters. See the Configuration Details section below for a complete description of all parameters.

4. Run the Analysis

Local Execution (small datasets)

snakemake --cores 8 --use-conda --configfile config.json

SLURM Cluster Execution (recommended for large datasets)

Option 1: Simple Execution with Profile

Submit as a SLURM job:

# Create a submission script
cat > run_caws.sh << 'EOF'
#!/bin/bash
#SBATCH -A your-allocation      # Change to your allocation name
#SBATCH -p standard             # Partition for controller job
#SBATCH -c 1                    # Controller only needs 1 core
#SBATCH -t 48:00:00             # Max runtime for controller
#SBATCH --mem=4G                # Memory for controller
#SBATCH -o caws_%j.out          # Output log
#SBATCH -e caws_%j.err          # Error log
#SBATCH -J caws_pipeline        # Job name

# Load required modules
module load miniforge/24.11.3-py3.12 snakemake/9.8.1
conda env create -f envs/snakemake-runner.yaml
conda activate caws-snakemake

# Run pipeline with profile
snakemake \
  --profile profiles/slurm \
  --configfile config.json
EOF

# Submit the job
sbatch run_caws.sh

Option 2: Advanced Execution with Custom Settings

For more control over execution and conda environment caching:

#!/bin/bash
#SBATCH -A your-allocation
#SBATCH -p standard
#SBATCH -c 1
#SBATCH -t 48:00:00
#SBATCH --mem=4G
#SBATCH -o caws_%j.out
#SBATCH -e caws_%j.err
#SBATCH -J caws_pipeline

# Load required modules
module load miniforge/24.11.3-py3.12 snakemake/9.8.1
conda env create -f envs/snakemake-runner.yaml
conda activate caws-snakemake

# Run pipeline with explicit parameters
snakemake \
  --profile /path/to/CAWS/profiles/slurm \
  --snakefile /path/to/CAWS/workflow/Snakefile \
  --configfile config.json \
  --cores 1 \
  --use-conda \
  --conda-prefix /scratch/your-user/conda-envs \
  --latency-wait 60 \
  --rerun-incomplete \
  --keep-going \
  --jobs 100

Interactive Execution:

# Load modules
module load miniforge/24.11.3-py3.12 snakemake/9.8.1
conda env create -f envs/snakemake-runner.yaml
conda activate caws-snakemake

# Run in current session (use screen or tmux recommended)
snakemake --profile profiles/slurm --configfile config.json

Notes:

--conda-prefix caches conda environments for faster subsequent runs
--jobs 100 overrides the profile setting (default: 50)
--notemp can be added to preserve temporary files for debugging
Use absolute paths if running from a different directory

5. View Results

reports/cutntag_analysis_report.html

Example Analysis with Public Data

This example demonstrates a complete CAWS analysis using publicly available CUT&Tag data. We'll analyze H3K4me3 (active promoter mark) samples from a published study.

Dataset Information

Study: CUT&Tag profiling of H3K4me3 in human K562 cells
Data Source: NCBI GEO (Gene Expression Omnibus)
Sample Size: 4 samples (2 H3K4me3 + 2 IgG controls)
Sequencing: Paired-end Illumina
Reference: Human (hg38)

Step 1: Download Example Data

# Create working directory
mkdir -p cutntag_example && cd cutntag_example

# Download FASTQ files from SRA using SRA Toolkit
# Install SRA Toolkit if needed: conda install -c bioconda sra-tools

# H3K4me3 treatment samples (example accessions)
fastq-dump --split-files --gzip SRR13577661
fastq-dump --split-files --gzip SRR13577662

# IgG control samples
fastq-dump --split-files --gzip SRR13577663
fastq-dump --split-files --gzip SRR13577664

# This will create files like:
# SRR13577661_1.fastq.gz, SRR13577661_2.fastq.gz (treatment replicate 1)
# SRR13577662_1.fastq.gz, SRR13577662_2.fastq.gz (treatment replicate 2)
# etc.

Note: Replace the SRR accessions above with actual accessions from your chosen GEO dataset. To find CUT&Tag datasets, visit NCBI GEO and search for "CUT&Tag".

Step 2: Prepare Reference Files

# Download human hg38 reference genome
wget https://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/hg38.fa.gz
gunzip hg38.fa.gz

# Build Bowtie2 index
bowtie2-build hg38.fa hg38_index

# Index FASTA file
samtools faidx hg38.fa

# Download blacklist regions
wget https://github.com/Boyle-Lab/Blacklist/raw/master/lists/hg38-blacklist.v2.bed.gz
gunzip hg38-blacklist.v2.bed.gz

# Download chromosome sizes
wget https://hgdownload.soe.ucsc.edu/goldenPath/hg38/bigZips/hg38.chrom.sizes

# Download GTF annotation for TSS enrichment (optional)
wget https://ftp.ensembl.org/pub/release-110/gtf/homo_sapiens/Homo_sapiens.GRCh38.110.gtf.gz
gunzip Homo_sapiens.GRCh38.110.gtf.gz

# E. coli reference (for spike-in normalization)
wget ftp://ftp.ncbi.nlm.nih.gov/genomes/all/GCF/000/005/845/GCF_000005845.2_ASM584v2/GCF_000005845.2_ASM584v2_genomic.fna.gz
gunzip GCF_000005845.2_ASM584v2_genomic.fna.gz
bowtie2-build GCF_000005845.2_ASM584v2_genomic.fna ecoli_index

Step 3: Create Sample Sheet

Create samplesheet.tsv:

sampleID	condition	group	read1	read2
H3K4me3_rep1	Treatment	H3K4me3	/path/to/SRR13577661_1.fastq.gz	/path/to/SRR13577661_2.fastq.gz
H3K4me3_rep2	Treatment	H3K4me3	/path/to/SRR13577662_1.fastq.gz	/path/to/SRR13577662_2.fastq.gz
IgG_rep1	Control	H3K4me3	/path/to/SRR13577663_1.fastq.gz	/path/to/SRR13577663_2.fastq.gz
IgG_rep2	Control	H3K4me3	/path/to/SRR13577664_1.fastq.gz	/path/to/SRR13577664_2.fastq.gz

Important: Replace /path/to/ with absolute paths to your downloaded FASTQ files.

Step 4: Create Configuration File

Create config.json:

{
    "samplesheet": "samplesheet.tsv",
    "reference_fa": "/path/to/hg38.fa",
    "bt2_idx": "/path/to/hg38_index",
    "reference_fai": "/path/to/hg38.fa.fai",
    "chromosome_file": "/path/to/hg38.chrom.sizes",
    "blacklist": "/path/to/hg38-blacklist.v2.bed",
    "ecoli_reference": "/path/to/GCF_000005845.2_ASM584v2_genomic.fna",
    "ecoli_bt2_idx": "/path/to/ecoli_index",
    "gtf_file": "/path/to/Homo_sapiens.GRCh38.110.gtf",
    "trim_adapters": true,
    "igg_control": true,
    "mt_chrom": "chrM",
    "dedup": true,
    "minquality": 10,
    "fragment_min": 0,
    "fragment_max": 700,
    "genome_size": "hs",
    "seacr_qvalue": 0.01,
    "macs3_qvalue_with_control": 0.05,
    "macs3_qvalue_no_control": 0.01,
    "heatmap_window": 3000,
    "outdir": "results"
}

Step 5: Run the Pipeline

Local execution (for testing with small datasets):

# Clone CAWS
git clone https://github.com/ratan-lab/CAWS.git

# Run pipeline
snakemake \
  --snakefile CAWS/workflow/Snakefile \
  --configfile config.json \
  --cores 8 \
  --use-conda

SLURM cluster execution (recommended):

# Edit CAWS/profiles/slurm/config.yaml to set your account and partition

# Submit as batch job
cat > run_example.sh << 'EOF'
#!/bin/bash
#SBATCH -A your-allocation
#SBATCH -p standard
#SBATCH -c 1
#SBATCH -t 12:00:00
#SBATCH --mem=4G
#SBATCH -o example_%j.out
#SBATCH -J caws_example

module load miniforge
module load snakemake/9.8.1

snakemake \
  --snakefile CAWS/workflow/Snakefile \
  --profile CAWS/profiles/slurm \
  --configfile config.json
EOF

sbatch run_example.sh

Step 6: Expected Results

Runtime: Approximately 2-4 hours on HPC with 8 cores (depends on read depth)

Output Structure:

results/
├── fastqc/                          # Quality control reports
├── trimmed/                         # Adapter-trimmed reads (if enabled)
├── aligned/                         # Bowtie2 alignments (BAM files)
├── bedgraph/                        # Coverage tracks
├── peaks/
│   ├── seacr/                      # SEACR peak calls
│   └── macs3/                      # MACS3 peak calls
├── frip/                            # Fraction of Reads in Peaks
├── heatmaps/                        # Peak visualization heatmaps
└── reports/
    └── cutntag_analysis_report.html # Interactive HTML report

Key Outputs:

Peak files: results/peaks/seacr/H3K4me3_rep1.stringent.bed
Coverage tracks: results/bedgraph/H3K4me3_rep1.bedgraph
QC metrics: Alignment rates, FRiP scores, TSS enrichment
Final report: results/reports/cutntag_analysis_report.html

Expected QC Metrics for H3K4me3:

Alignment rate: >80% (high-quality CUT&Tag)
FRiP score: 30-60% (active histone marks typically have high FRiP)
TSS enrichment: >5 (H3K4me3 is enriched at promoters)
Fragment size: Peak at ~150-200 bp (nucleosomal)

Step 7: Explore Results

Open the HTML report in a web browser:

# On local machine
open results/reports/cutntag_analysis_report.html

# Or copy from HPC to local
scp user@hpc:~/cutntag_example/results/reports/cutntag_analysis_report.html .

The interactive report includes:

Sample QC summary table
Alignment statistics
Fragment size distributions
Peak calling summary
Heatmaps of signal at peaks
TSS enrichment plots

Configuration Details

Click to expand configuration documentation

Parameter	Description
`samplesheet`	Path to TSV file describing samples (columns: `sampleID`, `condition`, `group`, `read1`, `read2`)
`reference_fa`	Reference genome FASTA file
`bt2_idx`	Bowtie2 index prefix for the reference genome
`reference_fai`	FASTA index file
`chromosome_file`	List of chromosomes to include
`blacklist`	BED file of genomic regions to exclude
`ecoli_reference`	E. coli reference genome (for contamination assessment)
`ecoli_bt2_idx`	Bowtie2 index prefix for E. coli
`trim_adapters`	`true`/`false` — enable adapter trimming via Trim Galore
`igg_control`	`true`/`false` — use IgG controls as background for peak calling
`mt_chrom`	Name of mitochondrial chromosome (e.g., `"chrM"`)
`dedup`	`true`/`false` — remove PCR duplicates before peak calling
`minquality`	Minimum mapping quality threshold
`fragment_min`, `fragment_max`	Fragment size range for alignments
`genome_size`	Genome size for MACS3 peak calling: `"hs"` (human), `"mm"` (mouse), `"ce"` (C. elegans), `"dm"` (Drosophila), or numeric value (e.g., `2.7e9`)
`seacr_qvalue`, `macs3_qvalue_with_control`, `macs3_qvalue_no_control`	Statistical thresholds for SEACR/MACS3
`heatmap_window`	Window size (bp) around peaks for heatmaps
`gtf_file`	Path to GTF file for TSS enrichment (optional)
`outdir`	Output directory for analysis results

Example config.json:

{
    "samplesheet": "samplesheet.tsv",
    "reference_fa": "/share/gatk_bundle/hg38/hs38DH.fa",
    "bt2_idx": "/share/gatk_bundle/hg38/hs38DH",
    "reference_fai": "/share/gatk_bundle/hg38/hs38DH.fa.fai",
    "chromosome_file": "/share/gatk_bundle/hg38/hs38DH.primary.txt",
    "blacklist": "/share/gatk_bundle/hg38/blacklist.bed.gz",
    "ecoli_reference": "/share/gatk_bundle/ecoli/MG1655.fa",
    "ecoli_bt2_idx": "/share/gatk_bundle/ecoli/MG1655",
    "trim_adapters": true,
    "igg_control": false,
    "mt_chrom": "chrM",
    "dedup": true,
    "minquality": 3,
    "fragment_min": 10,
    "fragment_max": 700,
    "genome_size": "hs",
    "seacr_qvalue": 0.01,
    "macs3_qvalue_with_control": 0.01,
    "macs3_qvalue_no_control": 0.001,
    "heatmap_window": 3000,
    "gtf_file": "",
    "outdir": "/data/CAWS/20230324"
}

Running on SLURM

Click to view SLURM execution details

Snakemake 9.x (Profile-based execution)

Step 1: Configure the SLURM profile

Edit profiles/slurm/config.yaml to set your allocation and partition in the default-resources section:

default-resources:
  - runtime=180
  - mem_mb=5000
  - threads=1
  - slurm_account=your-allocation-name  # e.g., ratan
  - slurm_partition=your-partition      # e.g., standard

Step 2: Load required modules

module load miniforge
module load snakemake/9.8.1

Step 3: Execute the pipeline

Simple execution:

snakemake --profile profiles/slurm --configfile config.json

Advanced execution with conda caching:

snakemake \
  --profile profiles/slurm \
  --configfile config.json \
  --conda-prefix /scratch/$USER/conda-envs \
  --jobs 100

As a SLURM job:

#!/bin/bash
#SBATCH -A your-allocation
#SBATCH -p standard
#SBATCH -c 1
#SBATCH -t 48:00:00
#SBATCH --mem=4G
#SBATCH -o caws_%j.out
#SBATCH -e caws_%j.err

module load miniforge
module load snakemake/9.8.1

snakemake \
  --profile profiles/slurm \
  --configfile config.json \
  --conda-prefix /scratch/$USER/conda-envs

Profile Configuration Details

The profile (profiles/slurm/config.yaml) handles:

Job submission to SLURM with executor plugin
Resource allocation per rule (memory, CPU, runtime)
Job monitoring and resubmission on failure
Concurrent job limits (default: 50, override with --jobs)

Useful Options

--conda-prefix /path/to/envs: Cache conda environments (recommended for repeated runs)
--jobs N: Override maximum concurrent SLURM jobs (default: 50)
--notemp: Keep temporary files for debugging
--rerun-incomplete: Rerun incomplete jobs (useful after interruptions)
--dryrun or -n: Preview execution without running
--printshellcmds: Show shell commands (already enabled in profile)

Monitoring

Monitor jobs with:

squeue -u $USER           # Check job status
tail -f caws_*.out        # Watch controller output
tail -f logs/rulename/*   # Watch specific rule logs

Features

Data Input Flexibility

Single-End & Paired-End Support – Process SE, PE, or mixed datasets seamlessly
Automatic Detection – Pipeline automatically identifies sequencing type from samplesheet
SE/PE Compatibility Validation – Ensures experimental samples and controls have matching sequencing types
Optimized Processing – Separate optimized workflows for SE and PE data with appropriate biological handling

Core Pipeline

Quality control (FastQC)
Adapter trimming (Trim Galore)
Alignment (Bowtie2)
Duplicate removal (Picard)
Peak calling (SEACR & MACS3)
Contamination detection (mtDNA, E. coli)
Fragment size & correlation analysis (PE only)
FRiP score and reproducibility metrics

Enhanced Analysis (v2.0)

TSS enrichment profiling
Peak width and shape distribution
Interactive HTML dashboards (Plotly)
SEACR vs MACS3 method comparison

Output Structure

Click to expand output directory layout

output_directory/
├── qc/                  # FastQC reports
├── trimmed/             # Adapter-trimmed reads
├── alignments/          # BAM alignments (deleted on successful completion)
├── dedupalignments/     # Deduplicated BAMs (quality-filtered versions preserved)
├── bedalignments/       # BED files for peak calling
├── peaks/
│   ├── seacr/           # SEACR peaks
│   └── macs3/           # MACS3 peaks
├── stats/
│   ├── seacr/           # SEACR metrics
│   └── macs3/           # MACS3 metrics
├── reports/             # Interactive HTML reports
├── annotations/         # GTF-based TSS data
└── logs/                # Log files

Note on alignment cleanup: Upon successful pipeline completion, the alignments/ directory is automatically deleted to save disk space. Quality-filtered BAM files used for peak calling are preserved in dedupalignments/ (if deduplication is enabled). If you need to retain the original alignment files, interrupt the pipeline before completion or modify the cleanup behavior in the Snakefile (lines 965-966).

Interactive HTML Dashboard

Report: reports/cutntag_analysis_report.html

Dashboard Sections

Overview: Sample summaries, alignment metrics, and FRiP scores
Peak Analysis: Width distributions, peak counts, and statistics
Quality Control: Fragment size profiles, reproducibility metrics, TSS enrichment
Method Comparison: SEACR vs MACS3 correlation and overlap

Interactive Features:

Plotly-based zoom/pan/hover
Sortable, searchable data tables
Conditional TSS panels (shown only if GTF is provided)

TSS Enrichment Analysis

Click to expand details

Enable by specifying a GTF file in your configuration:

"gtf_file": "/path/to/genes.gtf"

Features:

Extracts TSS sites from GTF (.gtf or .gtf.gz)
Calculates enrichment ±2 kb around TSS
Generates TSS enrichment plots and scores
Integrates results into the dashboard

Outputs:

annotations/tss.bed — Extracted TSS coordinates
stats/tss_enrichment.txt — Per-sample enrichment scores

Peak Width Analysis

Click to expand details

Metrics:

Summary statistics (mean, median, quartiles)
Peak categories:
- Narrow ≤ 500 bp (high specificity)
- Broad > 2000 bp (open chromatin regions)
Comparison between SEACR and MACS3

Visualizations:

Histograms and category proportions
Sample-level statistics tables
Interactive Plotly charts

Key Dependencies

Snakemake — Workflow management
Bowtie2 — Alignment
SEACR — CUT&RUN/CUT&Tag peak calling
MACS3 — Model-based ChIP-seq analysis
FastQC, Trim Galore, Picard, deepTools
R packages — ggplot2, plotly, flexdashboard

Troubleshooting

Click to view common issues and solutions

Conda Version Error

Error: CreateCondaEnvironmentException: Conda must be version 24.7.1 or later, found version X.X.X

Solution 1: Update Conda (Recommended)

# Update conda in your base environment
conda update -n base conda

# Verify version
conda --version  # Should be ≥24.7.1

Solution 2: Install Conda in Snakemake Environment

# Activate your snakemake environment
conda activate caws-snakemake

# Install updated conda
conda install -c conda-forge conda>=24.7.1

# Verify
conda --version

Module Load Errors

Error: module: command not found or modules don't load

Solution: Ensure you're on a system with environment modules. If running locally without modules:

# Skip module loading, just activate conda directly
conda activate your-snakemake-env
snakemake --cores 8 --use-conda --configfile config.json

SLURM Job Failures

Error: Jobs fail with memory or time limits

Solution: Adjust resource allocations in profiles/slurm/config.yaml:

set-resources:
  rule_name:
    mem_mb: 40000      # Increase memory
    runtime: 600       # Increase time (minutes)

Empty Peak Files

Issue: MACS3 produces empty peak files for some samples

Cause: MACS3 may fail to build a model for low-quality or single-end samples

Expected Behavior: The pipeline handles this gracefully by creating empty placeholder files. Check the logs:

cat logs/call_macs_peaks_se/sample_name.log

If this affects many samples, consider adjusting macs3_qvalue_no_control in config.json to a less stringent value (e.g., 0.01 instead of 0.001).

Conda Environment Creation Hangs

Issue: Environment creation takes very long or appears stuck

Solution: Use mamba instead of conda

# The profile already uses mamba, but ensure it's installed:
conda install -c conda-forge mamba

Permission Denied Errors

Issue: Cannot write to output directory or conda-prefix

Solution: Ensure you have write permissions:

# Check output directory
ls -ld /path/to/outdir

# Use a directory in your scratch space for conda-prefix:
snakemake --conda-prefix /scratch/$USER/conda-envs --configfile config.json

File Not Found Errors

Issue: Input files (FASTQ, reference, etc.) not found

Solution:

Check all paths in config.json are absolute paths
Verify files exist and are readable:

# Test file access
ls -lh /path/to/file

Ensure paths don't contain spaces or special characters

License

This project is licensed under the MIT License.
See the LICENSE file for details.

Acknowledgements

We thank the developers of all tools integrated into CAWS and the wider bioinformatics community for providing the foundational software that makes this workflow possible.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
envs		envs
manuscript		manuscript
profiles/slurm		profiles/slurm
workflow		workflow
LICENSE		LICENSE
README.md		README.md
config.json		config.json

License

ratan-lab/CAWS

Folders and files

Latest commit

History

Repository files navigation

CAWS: CUT&Tag Analysis Workflow System

A Snakemake-based pipeline for comprehensive CUT&Tag data analysis

Overview

Why Use CAWS?

Target Audience

Prerequisites

Required Software

System Requirements

Input Data Requirements

Quick Start

1. Clone and Set Up

2. Configure SLURM Profile (for HPC execution)

3. Prepare Analysis Configuration Files

4. Run the Analysis

5. View Results

Example Analysis with Public Data

Dataset Information

Step 1: Download Example Data

Step 2: Prepare Reference Files

Step 3: Create Sample Sheet

Step 4: Create Configuration File

Step 5: Run the Pipeline

Step 6: Expected Results

Step 7: Explore Results

Configuration Details

Running on SLURM

Snakemake 9.x (Profile-based execution)

Profile Configuration Details

Useful Options

Monitoring

Features

Data Input Flexibility

Core Pipeline

Enhanced Analysis (v2.0)

Output Structure

Interactive HTML Dashboard

Dashboard Sections

TSS Enrichment Analysis

Peak Width Analysis

Key Dependencies

Troubleshooting

Conda Version Error

Module Load Errors

SLURM Job Failures

Empty Peak Files

Conda Environment Creation Hangs

Permission Denied Errors

File Not Found Errors

License

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages