Skip to content

UMCUGenetics/primary-met-wgs-comparison

Repository files navigation

'Pan-cancer whole genome comparison of primary and metastatic solid tumors'

This repository contains all the data and analysis related to the manuscript: https://www.nature.com/articles/s41586-023-06054-z

Preprint: https://www.biorxiv.org/content/10.1101/2022.06.17.496528v1

Project Structure

This repository is structured as follows:

Analysis-type
└── data/            # directory where all external agloblata data is stored
    
└── code/            # directory where all the code is stored
   
└── results/
    ├── data/        # supplementary tables and results
    ├── figures/     # raw figures
    
├── README.md        # this file
└── environment_analysisA.yaml # Anaconda environment YAML file for specific analysis "karyotype", "driver_enrichment_and_actionability" & "hartwig_pipeline_validation"
└── environment_analysisB.yaml # Anaconda environment YAML file for specific analysis "SBS1_mutrate" and "TEDs"
└── environment_snakemake.yaml # Anaconda environment YAML file for snakemake scripts

Data access

access PCAWG data

Somatic variant calls, gene driver lists, copy number profiles and other core data of the PCAWG cohort generated by the Hartwig analytical pipeline are available for download at https://dcc.icgc.org/releases/PCAWG/Hartwig. Researchers will need to apply to the ICGC data access compliance office (https://daco.icgc-argo.org) for the ICGC portion of the dataset. Similarly, users with authorized access can download the TCGA portion of the PCAWG dataset at https://icgc.bionimbus.org/files/5310a3ac-0344-458a-88ce-d55445540120. Additional information on accessing the data, including raw read files, can be found at https://docs.icgc.org/pcawg/data/.

access Hartwig data

Metastatic WGS data and metadata from the Hartwig Medical Foundation are freely available for academic use through standardized procedures. Request forms can be found at https://www.hartwigmedicalfoundation.nl/en/data/data-acces-request/

Supplementary Information

Supplementary Tables from the manuscript include relevant information to reproduce the analysis displayed in the manuscript.

The mutational profile and the pairwise similarity of all extracted mutational signatures can be found in https:///zenodo.org/record/7396538 and Supp. Data 1.

About

Code repo for all analysis done for the 'Pan-cancer whole genome comparison of primary and metastatic solid tumors' study.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published