Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

"autobin": determine bin sizes from given BAM(s) #170

Closed
etal opened this issue Dec 23, 2016 · 0 comments
Closed

"autobin": determine bin sizes from given BAM(s) #170

etal opened this issue Dec 23, 2016 · 0 comments

Comments

@etal
Copy link
Owner

etal commented Dec 23, 2016

This new command would dynamically determine on- and off-target bin sizes given a reference genome FASTA and one or more (?) BAM files corresponding to normal samples -- essentially doing access, target, antitarget and coverage simultaneously and emitting target and antitarget .cnn and perhaps BED files.

Possible implementations:

  1. Use a fast estimate of overall coverage depth(s) (e.g. samtools idxstats and the target and optionally access BED files to compute and compare the sizes of sequenced areas) and simply print the recommended average bin sizes to use with target and antitarget.
  2. Stream through one or more BAMs using samtools depth or pysam count() or count_coverage(), and emit target and antitarget bin coordinates and coverage depths designed to place approximately a constant number of reads in each bin. Two passes or some backtracking might be needed to avoid "remainder" reads/bins, unless we can use (1) for hints. Some more tuning options may be necessary.

See also: bcbio/bcbio-nextgen#1582

etal added a commit that referenced this issue Dec 31, 2016
Quickly estimate on- and off-target average coverage depth and reasonable
average bin sizes -- for all 3 supported sequencing methods (WGS, hybrid
capture, targeted amplicon sequencing).

The .bai trick doesn't seem to work as well as I thought it would for target
coverages; might need to refactor cnvlib.coverage.bedcov to make it reusable
here.
@etal etal closed this as completed in 29db4ab Feb 25, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant