This repo contains the codes used for P11636 project.
This RNA-seq analysis is performed using CQSperl via the performRNASeq_gencode_mm10 function.
The main pipeline configuration file is:
20240801_11636_RNAseq_mouse.rnaseq.pl
Executing this file generates analysis scripts, which can then be submitted to the SLURM system. All step-specific scripts are collected in the pipeline_codes folder.
The pipeline consists of the following steps:
- Paired-end validation – Verifies paired-end sequencing data for each sample.
- FastQC (raw) – Quality control of raw FASTQ files.
- Cutadapt – Trims adapter sequences from reads.
- Cutadapt validation – Ensures trimming success.
- FastQC (post-trimming) – Quality control after adapter trimming.
- FastQC summary – Generates a QC report for all samples.
- STAR + featureCounts – Maps reads to the reference genome and quantifies gene expression.
- Gene table generation – Creates the gene count table.
- DESeq2 – Performs differential expression analysis.
- WebGestalt ORA – Over-representation pathway enrichment analysis.
- GSEA – Gene set enrichment analysis (GSEA) using all genes from DESeq2 results.
- Final report – Compiles results into a summary report.
Beyond the default ORA and GSEA pathway enrichment analyses, we implemented additional enhancements. The corresponding code and dependency files are stored in the enrichment_plots folder.
Additional gene sets for pathway enrichment analysis are available in enrichment_plots/geneset/:
-
Custom gene sets:
- Actin_Cell_Mobility
- EMT
- PI3K
- Integrin
- Wnt_signaling
- P53_signaling
- Innate_immune_response
-
Publicly available gene sets:
- Hallmark Genes (MSigDB)
- Curated Gene Sets (MSigDB)
- Regulatory Target Gene Sets (MSigDB)
- Ontology Gene Sets (MSigDB)
- Cell Type Signature Gene Sets (MSigDB)
- GO_BP, GO_CC, GO_MF (WebGestaltR)
- KEGG (WebGestaltR)
- Reactome (WebGestaltR)
- WikiPathways (WebGestaltR)
Figures have been optimized for clarity and presentation.
Pathway-specific visualizations are generated for:
- mmu04310 – Wnt signaling pathway
- mmu04810 – Regulation of actin cytoskeleton
- mmu04151 – PI3K-Akt signaling pathway
- mmu04115 – p53 signaling pathway
- Run
run_webgestaltr_and_GSEA.rto generate ORA and GSEA results. - Render
enrichment_plots/heather_report.rmdto generate an HTML report.