Permalink
Commits on Aug 3, 2017
  1. Only run qualimap for standard RNA-seq projects.

    Thanks to @lpantano for pointing out the issue.
    roryk committed Aug 3, 2017
  2. Move transcriptome FASTA files to the final directory.

    This lets us know exactly what we quantitated against for a
    particular project.
    roryk committed Aug 3, 2017
  3. miraligner: ensure local anaconda java used

    Reworks java path detection and usage to try and ensure we use
    anaconda installed Java 8. Fixes #2029
    committed Aug 3, 2017
  4. archive: add option for cram-lossless

    Do conversion to CRAM without quality score binning. Thanks to
    Alison Meynert.
    committed Aug 3, 2017
Commits on Aug 2, 2017
  1. Use gffread instead of gtf_to_fasta to make transcriptome indexes.

    Closes #2025. Thanks to Rod Docker (@rdocking) for the great bug
    report and finding out the solution.
    roryk committed Aug 2, 2017
  2. log: attempt to retrieve IP from domain name

    Work around issues on machines with multiple IPs and no valid way
    to resolve by initially checking, then falling back to previous
    approaches if it's a local reference.
    committed Aug 2, 2017
Commits on Aug 1, 2017
  1. Add support for bioconda plink.

    bioconda plink is 1.90 but the exectuable is named plink2.
    roryk committed Aug 1, 2017
  2. Use atropos for multicore streaming read trimming

    Replaces cutadapt as default read trimmer. atropos
    (https://atropos.readthedocs.io/en/latest/) is a drop in
    replacement with multithreading and streaming outputs to bgzip. bcbio
    implementation uses the same paramters as previous cutadapt approach.
    
    - Removes extra trim and double grabix prep steps for alignment
      pre-trimming, avoiding IPython issues.
    - Removes old align_prep_method configuration parameter.
    committed Aug 1, 2017
  3. samtools multicore BAM indexing

    Replaces sambamba indexing with new native samtools multicore indexing,
    fixed in 1.5 release (samtools/samtools#687). Avoids intermittent
    system specific segfaults with sambamba.
    committed Aug 1, 2017
Commits on Jul 30, 2017
  1. GATK4: GenomicsDBImport parallelization

    Add multicore reading introduced in beta3 release.
    committed Jul 30, 2017
Commits on Jul 29, 2017
  1. Tumor-only prioritization: use vcfanno

    Swap to using vcfanno instead of GEMINI for retrieving metrics for
    tumor-only prioritization and germline estimation. Avoids needing
    a full GEMINI database for assessment.
    committed Jul 29, 2017
Commits on Jul 28, 2017
  1. GATK4 BQSR: Provide reference target for 2bit

    Parallel BQSR in GATK4 requires 2bit formatted genomes. Provide slot
    and correct lookup of these in preparation for installing them
    as part of bcbio.
    committed Jul 28, 2017
Commits on Jul 27, 2017
  1. Avoid confusing error message if aligner: true

    Avoids a resource calculation error before the code that correctly
    identifies and provides a useful error message for this error.
    Fixes #2024
    committed Jul 27, 2017
  2. merging: add GATK4 GatherVcfs and fix header issue

    - Adds GATK4 GatherVcfs, a new rapid way in GATK to combine VCFs
      similar to bcftools concat --naive. Fixes #2023
    - fixes issues with initial scattered contigs from chrM. These are
      haploid called so don't have PID phasing key/value pairs, causing
      missing header problems. Also fixes #1999 which has key errors
      due to checking for GATK variantcallers, which we no longer need
      to do.
    committed Jul 27, 2017
Commits on Jul 26, 2017
  1. Max coverage downsample: move to streaming

    Stream to maximum coverage downsampling during merging for align_split
    samples or after sorting for non split. Avoids slow runtimes for
    separate BAM downsampling. Fixes #2022
    committed Jul 26, 2017
  2. Add spike in quantification to the fast RNA-seq pipeline.

    Closes #1709. Thanks to @schelhorn for the suggestion.
    roryk committed Jul 26, 2017
  3. GATK4: support for VQSR and MuTect2 fixes

    - Document usage of GATK4 with tools on
    - Add support for VQSR, updating to latest command line
    - Avoid problematic MuTect2 BaseQualityRankSumTest annotation with
      latest GATK4 beta.2 release.
    committed Jul 26, 2017
Commits on Jul 25, 2017
  1. Tests: bump genome tarball to fix space issues

    Trying to get TravisCI working again with less space for tests
    (no STAR index for mm9).
    committed Jul 25, 2017
  2. Bump maximum number of splice junctions to 2 million for STAR.

    For super deep libraries, the default wasn't inadequate. Thanks
    to Manasa Surakala for pointing out the problem.
    roryk committed Jul 25, 2017
  3. Remove mm9 STAR index from integration test dependencies.

    Each STAR index uses ~1.5 GB, which was mucking up the ability to
    do automatic tests.
    roryk committed Jul 25, 2017
Commits on Jul 24, 2017
  1. Remove errant print statement.

    roryk committed Jul 24, 2017