Skip to content
/ SARC Public

Systems Analysis of Regions with Copy Number Variations (CNVs). A small R package which can be used to flag which variants detected from WES/ WGS based CNV detection pipelines/ tools are unlikely to be false positives. This can aid in reducing the amount of variants for clinicians/ diagnostic scientists to interpret.

License

Notifications You must be signed in to change notification settings

Krutik6/SARC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SARC

Statistical Analysis on Regions with CNVs

SARC applies several statistical tests of CNVs detected from WES (whole exome sequencing)/ WGS (whole genome sequencing) tools or pipelines.

Most CNV detection strategies will lead to false positives, and SARC aims to 1) reduce the number of false positives by applying simple statisical tests on regions where a CNV was detected, and 2) provide a quicker and automated alternative to viewing CNVs in the Interactive Genome Browser (IGV).

Input data

SARC needs two file types:

  1. BED FILE - list of all CNVs detected from pipeline. Should contain columns which are named: CHROM, START, END, VALUE. BED files are the most common form of output from CNV detection tools/ annotation tools. https://en.wikipedia.org/wiki/BED_(file_format)

  2. COV FILE - matrix of genomic regions and reads per sample. This can be created from bedtools Multicov or samtools coverage. https://bedtools.readthedocs.io/en/latest/content/tools/multicov.html \n http://www.htslib.org/doc/samtools-coverage.html \n

Install package

library(devtools)
install_github("Krutik6/SARC")

Package has been submitted to bioconductor, until this is resolved, the tool is available from github.

Usage

Please read the vignette locted in the vignettes folder for explanations of the functions.

Citation

The tool will be used in several WES cohorts and citations will be upcoming.

Liscence

GPL-3.

About

Systems Analysis of Regions with Copy Number Variations (CNVs). A small R package which can be used to flag which variants detected from WES/ WGS based CNV detection pipelines/ tools are unlikely to be false positives. This can aid in reducing the amount of variants for clinicians/ diagnostic scientists to interpret.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages