Skip to content

Ambigram experiment materials: rundown, data, scripts...

Notifications You must be signed in to change notification settings

deepomicslab/Ambigram_paper

Repository files navigation

Introduction

DOI

This is a repository for experiments on the Nature Communication paper Deciphering complex breakage-fusion-bridge genome rearrangements with Ambigram. Here, we aim to benchmark the efficacy of Ambigram to decipher various BFB events on Illumina pair-end (PE) reads, Oxford Nanopore (ONT) long reads, Pacific Biosciences (PB) long reads, 10x Genomics linked reads with varying tumor purity and sequencing depth.

Steps of in silico experiments

  1. Benchmark based on simulated data: We simulated 4 sets of data to test the efficacy of Ambigram in deciphering different BFB paths, including fold-back inversions, deletions, and translocations.
  2. Construct a BFB path from a set of test data (sv and seg files) with Ambigram.
  3. Generate a BFB fasta file (base sequence) with reference to hg38.fa.
  4. Simulate sequencing reads on the BFB fasta with different simulators for PE, PB, ONT, and 10x, respectively.
  5. Align the simulated reads with the Homo sapiens (human) genome reference (e.g., hg38.fa).
  6. Extract SV (structural variant) information from the BAM file derived from alignment.
  7. Reconstruct the BFB path with Ambigram and new SV information.

**Note: There is a .sh file in each directory for the specific experiment. **

Steps of reconstructing BFB path using curated SV

  1. Extract partial reads from the whole genome bam file.
  2. (optional) Test the effect of tumor purity and sequencing depth.
    1. Merge the normal sample with the tumor (with SVs) sample in a ratio.
    2. Subsample the clipped bam file to generate bam files with different depths.
  3. Call SVs from the clipped bam file with some tools, e.g., svaba or sniffles.
  4. Convert the vcf file into a sv file.
  5. Generate a lh file with the sv file and seg file.
  6. Reconstruct the BFB path with Ambigram and new SV information.

Prerequisites

Citation

@article{li2023deciphering,
  title={Deciphering complex breakage-fusion-bridge genome rearrangements with Ambigram},
  author={Li, Chaohui and Chen, Lingxi and Pan, Guangze and Zhang, Wenqian and Li, Shuai Cheng},
  journal={Nature Communications},
  volume={14},
  number={1},
  pages={5528},
  year={2023},
  publisher={Nature Publishing Group UK London}
}