This contains the code and processed data for the Chevrier-lab combos project.
It's components are:
-
intermediate data: the processed data used by code in the analysis section
-
software:
-
preprocessing: the scripts for taking raw fastq files into read count tables
- pipeline.sh - the preprocessing and alignment portion of the ATACseq pipeline. Only handles one set of fastq files at a time and has to be run manually.
- Data structure: The pipeline is assuming the existence of a project folder where the data is and will be stored. It has a fastq/ folder with the fastq files, which are named [stim]_[replicate#]_R[1/2].fastq, and it has an out/ folder. pipe_line.sh will populate the out/ folder with the results of preprocessing, with the results from each pair of fastq files (each processed by their own call to pipeline.sh) being placed in their own folder. The downstream analysis files assume the existence of such folders in out/
- peak_merge.sh - a bash script using bedtools and samtools to make a unified peakset
- merge_and_count.r - contains an r function that calls peak_merge.r and then makes a table of read counts using the unified peaks
- pipeline.sh - the preprocessing and alignment portion of the ATACseq pipeline. Only handles one set of fastq files at a time and has to be run manually.
-
Analysis: the scripts for running our final analysis, as well as functions they depend on
-
The final folder: each script in this folder runs a portion of our final analysis and produces some number of plots for the paper. Some of the scripts are dependent on having others running for them (and will source the script it depends on when you run it)
- final_PI.r - analysis of cell proliferation data
- Figures: 1d, 2a, 2b, 2c, 2d, 2e
- Supplemental Figures: S4a, S4b, S4c, S5a, S5b, S5c, S6a, S6b, S7, S8a, S8b, S9a, S9b
- final_ligand_dilution.r - plots of ligand dilution curves
- Supplemental Figures: S2b
- final_SP7.r - analysis of mouse RNA-seq data
- Figures: 3b, 3c, 3d
- Supplemental Figures: S10, S11
- final_human.r - analysis of human RNA-seq data
- Figures: 3e, 3f
- Supplmental Figures: S12a, S12b, S12c, S12d
- final_atac.r - analysis of mouse ATAC-seq data
- Dependencies: final_SP7.r
- Supplemental Figures: S13g, S13h
- final_secretome.r - analysis of mouse secretome mass-spectrometry data and cytokine concentration assays
- Dependencies: final_atac.r
- Supplemental Figures: S13a, S13b, S13c, S14
- final_cyto_restim.R - analysis of restimulation assays
- Figures: 5c, 5d
- Supplemental Figures: S16a, S17a, S17b
- final_PI.r - analysis of cell proliferation data
-
diff_peaks.r: functions for the differential expression analysis for ATAC-seq data
-
rna_de.r: functions for the differential expression analysis for RNA-seq data
-
combo_analysis.r: Contains a variety of functions usefull for doing analysis of combinations of treatments
compare_combos_contrasts()
runs the bulk of the ATAC-seq data processingmake_comparison_binaries()
calculates which genes are new- various other functions for smaller tasks
-
index_functions.r: Contains various functions for Isserlis-formula related analyses
-
-