Analysis workflow for all data present in our PDAC Atlas Manuscript titled, "Transfer learning associates CAFs with EMT and inflammation in tumor cells in human tumors and organoid co-culture in pancreatic ductal adenocarcinoma"
Scripts for the creation of the atlas, and its downstream analyses can be found under the /Atlas Directory.
- PDAC atlas engineering_051622_BK.Rmd:
- RAW Counts from all manuscripts
- PDAC atlas analysis_051622_BK.Rmd:
- cds_combined_8_SHARED_GENES_QC_filtered_harmonized_preprocessed_aligned_manuscript_UMAP_learn_graph.rds
- PDAC_MHCII_DifferentialExprs.Rmd:
- cds_duct_SHARED_GENES_QC_filtered_harmonized_preprocessed_aligned_manuscript_UMAP_learn_graph.rds
- PDAC_CAF_MHCII_DifferentialExprs.Rmd:
- cds_combined_8_SHARED_GENES_QC_filtered_harmonized_preprocessed_aligned_manuscript_UMAP_learn_graph.rds
- 06222022_STEELE_PYSCENIC_HG38only.sh & Domino_PDAC_EpCAF_STEELE_HG38only.Rmd:
- cds_combined_8_SHARED_GENES_QC_filtered_harmonized_preprocessed_aligned_manuscript_UMAP_learn_graph.rds
- 06222022_PENG_PYSCENIC_HG38only.sh & Domino_PDAC_EpCAF_PENG_HG38only.Rmd:
- cds_combined_8_SHARED_GENES_QC_filtered_harmonized_preprocessed_aligned_manuscript_UMAP_learn_graph.rds
Scripts for the analysis of the 12HR PDO-CAF Coculture scRNA-seq data can be found under the /Multiseq Directory.
- Preprocess_fromCore:
- RAW FASTQ
- 12H_MultiSeq_01_ProjR_Inflammation.Rmd:
- dgeData.rda
- epiMat-e74511f6-8a1d-4929-ab42-1cb67c5a4fb6-result-8pattern.rds
- 12H_MultiSeq_02_Moffit.Rmd:
- 12H_MultiSeq_ProjR_InflammationData.rda
Scripts for the analysis of the 3 untreated PDO-CAF bulk RNA-seq data can be found under the /Bulkseq Directory.
- 00_Extract_Sample_Names.Rmd:
- Raw Data Folder names
- 01_salmonQuant.sh & 01_SG_Novogene_12202022_SalmonSetup.R:
- RAW FASTQ files from Novogene
- 02_SalmonQuant_to_txi.Rmd:
- Output of Salmon.sh (Salmon_Quants/)
- tgMap.tsv
- 03_PDACAtlas_CoGAPS_Projection.Rmd:
- txi_data_all.rds
- SG_Novogene_Annot.csv
- epiMat-e74511f6-8a1d-4929-ab42-1cb67c5a4fb6-result-8pattern.rds