Skip to content
Arnold Kuzniar edited this page Sep 4, 2019 · 1 revision
  • chr17 of the hg19 Human genome assembly is used as the test chromosome
  • SURVIVOR is used to introduce SVs only on one of the two homologous chromosomes
  • The ART read simulator is used to generate paired-end reads
  • bwa-mem is used to map the reads on the hg19 assembly

Output of the workflow are two categories of Tumor/Normal pairs of BAM alignment files: Somatic and Germline.

The Somatic category contains a Tumor BAM file with 50% tumor purity and a Normal BAM file which does not contain SVs. The model is trained to recognize SVs that are present in the Tumor but not in the Normal BAM files as 'somatic'.

The Germline category contains two BAM files with the same set of SVs, so the model is trained to recognize SVs that are present in both the Tumor and Normal BAM files as 'germline'.

Clone this wiki locally