-
-
Only run qualimap for standard RNA-seq projects.
Thanks to @lpantano for pointing out the issue.
-
-
Move transcriptome FASTA files to the final directory.
This lets us know exactly what we quantitated against for a particular project.
-
-
miraligner: ensure local anaconda java used
Reworks java path detection and usage to try and ensure we use anaconda installed Java 8. Fixes #2029
-
archive: add option for cram-lossless
Do conversion to CRAM without quality score binning. Thanks to Alison Meynert.
-
Use gffread instead of gtf_to_fasta to make transcriptome indexes.
-
log: attempt to retrieve IP from domain name
Work around issues on machines with multiple IPs and no valid way to resolve by initially checking, then falling back to previous approaches if it's a local reference.
-
Add support for bioconda plink.
bioconda plink is 1.90 but the exectuable is named plink2.
-
Use atropos for multicore streaming read trimming
Replaces cutadapt as default read trimmer. atropos (https://atropos.readthedocs.io/en/latest/) is a drop in replacement with multithreading and streaming outputs to bgzip. bcbio implementation uses the same paramters as previous cutadapt approach. - Removes extra trim and double grabix prep steps for alignment pre-trimming, avoiding IPython issues. - Removes old align_prep_method configuration parameter.
-
samtools multicore BAM indexing
Replaces sambamba indexing with new native samtools multicore indexing, fixed in 1.5 release (samtools/samtools#687). Avoids intermittent system specific segfaults with sambamba.
-
GATK4: GenomicsDBImport parallelization
Add multicore reading introduced in beta3 release.
-
-
Tumor-only prioritization: use vcfanno
Swap to using vcfanno instead of GEMINI for retrieving metrics for tumor-only prioritization and germline estimation. Avoids needing a full GEMINI database for assessment.
-
-
GATK4 BQSR: Provide reference target for 2bit
Parallel BQSR in GATK4 requires 2bit formatted genomes. Provide slot and correct lookup of these in preparation for installing them as part of bcbio.
-
Avoid confusing error message if aligner: true
Avoids a resource calculation error before the code that correctly identifies and provides a useful error message for this error. Fixes #2024
-
-
merging: add GATK4 GatherVcfs and fix header issue
- Adds GATK4 GatherVcfs, a new rapid way in GATK to combine VCFs similar to bcftools concat --naive. Fixes #2023 - fixes issues with initial scattered contigs from chrM. These are haploid called so don't have PID phasing key/value pairs, causing missing header problems. Also fixes #1999 which has key errors due to checking for GATK variantcallers, which we no longer need to do.
-
Max coverage downsample: move to streaming
Stream to maximum coverage downsampling during merging for align_split samples or after sorting for non split. Avoids slow runtimes for separate BAM downsampling. Fixes #2022
-
Add spike in quantification to the fast RNA-seq pipeline.
Closes #1709. Thanks to @schelhorn for the suggestion.
-
GATK4: correctly pass memscale to joint callers
Thanks to @NeillGibson. Fixes #2021
-
GATK4: support for VQSR and MuTect2 fixes
- Document usage of GATK4 with tools on - Add support for VQSR, updating to latest command line - Avoid problematic MuTect2 BaseQualityRankSumTest annotation with latest GATK4 beta.2 release.
-
Tests: bump genome tarball to fix space issues
Trying to get TravisCI working again with less space for tests (no STAR index for mm9).
-
Bump maximum number of splice junctions to 2 million for STAR.
For super deep libraries, the default wasn't inadequate. Thanks to Manasa Surakala for pointing out the problem.
-
Remove mm9 STAR index from integration test dependencies.
Each STAR index uses ~1.5 GB, which was mucking up the ability to do automatic tests.
-
-
-
-
-
-
Add function to check if a BAM file is empty.
roryk committedJul 20, 2017 -
Set comments to 80 character length.
roryk committedJul 20, 2017