Permalink
909a3b9 Nov 17, 2018
1 contributor

Users who have contributed to this file

497 lines (370 sloc) 24.4 KB

STAR 2.6.1c 2018/11/16

  • Fixed the problem causing BAM sorting error with large number of threads and small ulimit -n (github.com/alexdobin/STAR/issues/512).
  • Fixed the bug causing inconsistent output for mate1/2 in the Unmapped files (github.com/alexdobin/STAR/issues/222).
  • Fixed the non-thread safe error/exit (github.com/alexdobin/STAR/issues/514), and non-safe file size check (github.com/alexdobin/STAR/issues/516)
  • Many thanks to Paul Menzel for helping to track and fix these problems.

STAR 2.6.1c 2018/10/17

  • Enforced the consistent choice of supplementary chimeric alignments for overlapping mates.

STAR 2.6.1b 2018/09/06

  • Fixed a problem with --outSAMfilter KeepOnlyAddedReferences option.
  • Fixed a problem with output of an empty sorted BAM.

STAR 2.6.1a 2018/08/14

  • Process substitution can now be used with zipped VCF files, e.g. --varVCFfile <(zcat vcf.gz)
  • Implemented fatal error exception if no SNPs are found in VCF files.
  • Implemented --chimOutJunctionFormat 1 option to output some metadata (command lines and basic mapping statistics) at the end of Chimeric.out.junction file.
  • The default value of --peOverlapMMp is reduced to 0.01 for less aggressive mate merging.
  • Fixed the problem with control characters (ASCII<32) in genome and input read sequences. They used to be converted to N, now they are removed.
  • Fixed a bug that caused serious problems with --sjdbInsertSave All option.
  • Fixed a bug in merging mates (--peOverlap*) algorithm that was causing rare seg-faults.
  • Fixed the GtstrandBit problem.
  • Fixed a bug with multiple RG lines when inputting reads in SAM format.
  • Fixed a bug causing seg-faults with shared memory and --outStd options.
  • Fixed a bug with --outTmpDir and fifo files.

STAR 2.6.0c 2018/05/10

  • Fixed bugs in merging mates (--peOverlap*) and WASP filtering algorithms. Please see CHANGES and RELEASEnotes from 2.6.0a.

STAR 2.6.0b 2018/05/02

  • Fixed bugs introduced in 2.6.0a. Please see CHANGES and RELEASEnotes from 2.6.0a.

STAR 2.6.0a 2018/04/23

Major new features:

  • Merging and mapping of overlapping paired-end reads with new options --peOverlapNbasesMin and --peOverlapMMp. The developmment of this algorithm was supported by Illumina, Inc. Many thanks to June Snedecor, Xiao Chen, and Felix Schlesinger for their extensive help in developing this feature.
  • --varVCFfile option to input variant VCF file.
  • New SAM attributes in the --outSAMattributes, vG, vA, and vW to report variants overlapping alignments.
  • --waspOutputMode option for filtering allele specific alignments. This is re-implementation of the original WASP algorithm by Bryce van de Geijn, Graham McVicker, Yoav Gilad & Jonathan K Pritchard. Please cite the original WASP paper: Nature Methods 12, 1061–1063 (2015), https://www.nature.com/articles/nmeth.3582 . Many thanks to Bryce van de Geijn for fruitful discussions.
  • Detection of multimapping chimeras, with new options --chimMultimapNmax, --chimMultimapScoreRange and --chimNonchimScoreDropMin . Many thanks to Brian Haas for testing and feedback.

Minor new features:

  • --alignInsertionFlush option which defines how to flush ambiguous insertion positions: None: old method, insertions are not flushed; Right: insertions are flushed to the right.
  • --outSAMtlen option to select the calculation method for the TLEN field in the SAM/BAM files.
  • --outBAMsortingBinsN option to control the number of sorting bins. Increasing this number reduces the amount of RAM required for sorting.

STAR 2.5.4b 2018/02/09

  • Recompiled Linux executables with the correct version tag.
  • Updated manual.

STAR 2.5.4a 2018/01/23

New features:

  • Implemented read group ID output as the last column of the Chimeric.out.junction file.
  • Implemented --readFilesPrefix option for specifying prefix (e.g. directory path) for the file names in --readFilesIn .
  • Implemented standard SAM attribute "MC" to output the mate's CIGAR. Add MC to the list of attributes in the --outSAMattribute option.
  • Implemented the ability to input the reads from unmapped SAM/BAM file: --readFilesType SAM SE[PE] for single-end [paired-end] reads to read from the SAM file specified, as usual in --readFilesIn. For BAM files, in addition, specify --readFilesCommand samtools view -h .
  • Implemented --seedSplitMin option which was previously hardcoded at 12. his will allow mapping of mates shorter than 12nt.
  • Implemented --outFilterIntronStrands None option to switch off filtering by strand consistency of junctions.
  • Added new scripts extras/scripts/mergeLogFinal.awk,mergeSuperContig.awk,sjMotif.m

Bug fixes:

  • Fixed a bug in chimeric detection code which sometimes led to uninitialized memory access. The chimeric output may change for a very small number of reads.
  • Fixed a problem with --alignEndsProtrude implementation which prevented the output of alignments with protruded ends.
  • Fixed a bug which set non-primary bit 0x100 in the SAM FLAG for unmapped mates.
  • Fixed a bug in liftOver command that output an extra field in the GTF file.
  • Fixed a problem that can arise for very small genomes while using --alignIntronMax 1.

STAR 2.5.3a 2017/03/17

  • Fixed occasional seg-faults after the completion of the mapping runs with shared memory.
  • Implemented --genomeFileSizes option to supply sizes of the genome index files. This allows for streaming of index files.
  • Implemented extra references input in the SAM/AM header from user-created "extraReferences.txt" file in the genome directory.
  • Implemented --chimOutType HardClip OR SoftClip options to output hard (default) / soft clipping in the BAM CIGAR for supplementary chimeric alignments.
  • Implemented --chimMainSegmentMultNmax parameters, which may be used to prohibit chimeric alignments with multimapping main segments to reduce false positive chimeras.
  • Implemented new SAM attribute 'ch' to mark chimeric aligmments in the BAM file for --chimOutType WithinBAM option.
  • Fixed a problem with RNEXT field in the Chimeric.out.sam file: RNEXT now always points to the other mate start.
  • Implemented --bamRemoveDuplicatesType UniqueIdenticalNotMulti option, which (unlike the UniqueIdentical optipon) will NOT mark multi-mappers as duplicates.
  • For --bamRemoveDuplicatesType UniqueIdentical, the unmmapped reads are no longer marked as duplicates.

STAR 2.5.2b 2016/08/19

  • Fixed a problem with --outSAMmultNmax 1 not working for transcriptomic output.
  • Fixed a bug with chimeric BAM output for --chimOutType WithinBAM option.
  • Fixed a bug that could cause non-stable BAM sorting if the gcc qsort is unstable.
  • Fixed a bug with causing seg-faults when combining --twopassMode Basic --outSAMorder PairedKeepInputOrder .
  • Fixed a problem with SAM header in cases where reference sequences are added at the mapping stage.

STAR 2.5.2a 2016/05/10

  • Fixed the "GstrandBit" problem.
  • Fixed a bug introduced in 2.5.1a that caused problems with single-end alignments output in some cases.
  • Fixed a bug that can cause STARlong seg-faults in rare cases.
  • Fixed a bug that caused output of unmapped mates for single end alignments even with --outSAMunmapped None .
  • Implemented --winReadCoverageRelativeMin and --winReadCoverageBasesMin to control coverage of the alignment windows for STARlong.
  • Implemented --outSAMfilter KeepAllAddedReferences option which will keep all alignments to the added references.
  • Implemented --alignEndsProtrude option to control output of alignments with protruding ends.
  • Implemented --outTmpKeep All option to keep the temporary files.
  • Implemented --alignEndsType Extend5pOfReads12 option for full extension of 5' ends of both mates.

STAR 2.5.1b 2016/01/22

  • Fixed a bug in signal generation with --outWigType introduced in 2.5.1a

STAR 2.5.1a 2016/01/19

  • Fixed a bug in --quantMode TranscriptomeSAM that prevented output to Aligned.toTranscriptome.out.bam of the reads mapped to the very last annotated transcript.
  • Cleaned up the code to remove compilation warnings (thanks to github.com/yhoogstrate).
  • Implemented --outSAMunmapped Within KeepPairs option to record unmapped mate adjacent to the mapped one, in case single-end alignments are allowed. For multi-mappers, the unmapped mate will be recored mulitple times adjacent to the mappet mate of each alignment.

STAR 2.5.0c 2015/12/23

  • Implemented --genomeSuffixLengthMax option to control max suffix length at the genome generation step.
  • Fixed a bug that caused genome generation stalling in some cases.
  • In Aligned.toTranscriptome.out.bam (--quantMode TranscriptomeSAM), non-primary SAM flag is assigned to all but one randomly selected alignment in Aligned.toTranscriptome.out.bam .
  • Fixed a bug that filtered out some chimeric junctions.
  • Fixed a bug that prevented chimeric output for some of the "circular" configurations.

STAR 2.5.0b 2015/11/30

Bug-fix release:

  • Fixed a problem with non-primary alignment flags with --outSAMmultNmax option.
  • Added counting of chimeric reads into Log.final.out .
  • Fixed a bug in --outSAMfilter KeepOnlyAddedReferences.
  • Fixed a minor bug that caused rare seg-faults.
  • Fixed a minor bug in STARlong extension at the ends of the read.
  • Fixed a seg-fault that occurred when non-default value of --genomeChrBinNbits was used.
  • Fixed a seg-fault that occurred when junctions where inserted after inserting reference sequences.

STAR 2.5.0a 2015/11/06

STAR now uses essential c++11 features and requires gcc 4.7.0 or later.

Major new features:

  • Implemented on the fly insertion of the extra sequences into the genome indexes.
  • Implemented --outSAMmultNmax parameter to limit the number of output alignments for multimappers.
  • Implemented --outMultimapperOrder Random option to output multiple alignments in random order. This also randomizes the choice of the primary alignment. Parameter --runRNGseed can be used to set the random generator seed. With this option, the ordering of multi-mapping alignments of each read, and the choice of the primary alignment will vary from run to run, unless only one thread is used and the seed is kept constant.

Minor new features:

  • Implemented --outSAMattrIHstart parameter. Setting it to 0 may be required for compatibility with downstream software such as Cufflinks or StringTie.
  • Implemented --outSAMfilter KeepOnlyAddedReferences option.
  • Implemented --help option - thanks to @yhoogstrate for the code.
  • Implemented --alignEndsType Extend3pOfRead1 option for full extension of the 3' end of read 1.
  • Implemented --alignSJstitchMismatchNmax option to allow for mismatches around non-canonical junctions.
  • Implemented --chimSegmentReadGapMax parameter which defines the maximum gap in the read sequence between chimeric segments. By default it is set to 0 to replicate the behavior of the previous STAR versions.
  • Implemented --chimFilter banGenomicN | None options to prohibit or allow the N characters in the vicinity of the chimeric junctions. By default, they are prohibited - the same behavior as in the previous versions.

Bug fixes:

  • For STARlong, increased compilation-time max read length to 500000 and max number of exons to 1000
  • Fixed a bug which caused problems in some cases of genome generation without annotations.
  • Fixed a bug in the --alignEndsType Extend5pOfRead1 option.

Code improvements:

  • Improved compilation flags handling in Makefile - thanks to Christian Krause for the code.
  • Improved treatment of the streams and files - thanks to Alex Finkel for the code.
  • Merged pull request from Nathan S. Watson-Haigh: Makefile for manual;Travis-CI automated build; Update STAR-Fusion submodule to v0.3.1
  • Merged pull request from Alex Finkel to allow 'parameter=value' option formatting, e.g. --runThreadN=8.

STAR 2.4.2a 2015/06/19

  • Implemented --quantMode GeneCounts option for counting number of reads per gene, similar to htseq-count.
  • STARlong: fixed --outFilterIntronMotifs and --outSAMstrandField options.
  • Yet another fix for --sjdbOverhang logic.
  • Error message when shared memory and on the fly junction insertion are used together.
  • Fixed a bug causing unnecessary 1 base soft-clipping in rare cases with sparse suffix array.
  • Fixed a bug that caused problems with junction motifs in rare cases. Very few alignments affected, <1 per million.

STAR 2.4.1d 2015/05/19

  • Fixed problems with --sjdbOverhang default and user-defined values.
  • Fixed problems with occasional non-adjacent output of multiple alignments into the unsorted BAM file and transcriptome BAM file.
  • Fixed a bug causing seg-faults when shared memory options in --genomeLoad are used with --outStd SAM.
  • Fixed a bug causing seg-faults for small values of --limitIObufferSize.
  • Added STAR long pre-compiled executables.
  • Fixed very minor issues with filtering into SJ.out.tab .
  • Fixed some bugs in STARlong mapping algorithm.
  • Fixed --outFilter BySJout filtering for STARlong.
  • Fixed XS attrbutes in STARlong.
  • Added --runDirPerm option for permissions of run-time directories.

STAR 2.4.1c 2015/04/24

  • Added latest version of STAR-Fusion as a separate directory.
  • Fixed some compilation problems introduced in 2.4.1b.
  • Added Mac executable.

STAR 2.4.1b 2015/04/23

  • Fixed a bug introduced in 2.4.1a causing serious problems for genomes generated without annotations. If you generated a genome without annotations with 2.4.1a please re-generate it.
  • Fixed a bug causing seg-faults when generating genomes with a large (>500k) number of junctions.
  • Fixed a bug causing seg-faults with --chimOutType WithinBAM for single-end reads.
  • Fixed a bug with required --sjdbOverhang at the mapping step.

STAR 2.4.1a 2015/04/17

  • The annotations can now be included on the fly at the mapping step, without including them at the genome generation step.
  • New option to activate on the fly "per sample" 2-pass method: "--twopassMode Basic".
  • 2-pass mode can now be used with annotations, which can be included either at the run-time, or at the genome generation step.
  • Included link (submodule) to Brian Haas' STAR-Fusion code for detecting fusion transcript from STAR chimeric output: https://github.com/STAR-Fusion/STAR-Fusion
  • Included Gery Vessere's shared memory implementation for POSIX and SysV. To compile STAR with POSIX shared memory, use make POSIXSHARED
  • New option "--chimOutType WithinBAM" to include chimeric alignments together with normal alignments in the main (sorted or unsorted) BAM file(s).
  • New option "--quantTranscriptomeBan Singleend" allows insertions, deletions ans soft-clips in the transcriptomic alignments, which are allowed by some expression quantification software (e.g. eXpress).
  • New option "--alignEndsTypeExtension Extend5pOfRead1" to enforce full extension of the 5p of the read1, while all other ends undergo local alignment and may be soft-clipped.

2.4.0k 03/30/2015

  • Implemented new BAM sorting algorithm that reduces significantly the required RAM.

2.4.0j 02/04/2015

  • Fixed a problem with scoring alignments for STARlong. STARlong alignments are slightly modified.
  • Fixed a bug introduced in 2.4.0i that dropped a large number of aligmnents for --quantMode TranscriptomeSAM. Transcriptome alignments are now the same as in version 2.4.0h.
  • Fixed a problem with lower case read sequences for --outSAMtype BAM options.
  • Fixed a bug preventing parameter value to be "-".
  • Fixed --genomeLoad LoadAndRemove option.

2.4.0i 01/14/2015

  • Fixed a bug with the _STARtmp temporary directory name for the 2-pass runs.
  • Fixed a bug causing seg-faults for genome generation.
  • Fixed a bug causing seg-faults for --quantMode TranscriptomeSAM

2.4.0h 12/09/2014

  • Fixed the problem causing Ubuntu error: "sh: 1: Syntax error: Bad fd number".
  • Added --quantTranscriptomeBAMcompression option.
  • Add newline at the end of STAR_VERSION string (contributed by Daniel Nicorici).
  • Fixed a bug with parsing the last line of paired FASTA files (contributed by Alex Rolfe).

2.4.0g 11/26/2014

  • Fixed a bug with output score (AS attribute) of some chimeric alignments.
  • Added --alignSoftClipAtReferenceEnds No option which prevents soft clipping of alignments at the reference (chromosome) ends, for compatibility with Cufflinks/Cuffmerge.
  • Fixed wrong output of chimeric junctions boundaries in the Chimeric.out.junction file.
  • Added --outSAMflagOR, --outSAMflagAND options to set specific bits of the SAM FLAG.
  • --sjdbFileChrStartEnd can now accept multiple files which will be concatenated.
  • Fixed the header of the Log.progress.out .
  • Fixed a bug that prevented output of transcriptomic alignments (--quantMode TranscriptomeSAM) with 1 base junction overhangs.
  • Added --sysShell option to specify path to bash, in cases where bash is not the default shell.
  • --outBAMcompression default changed to 1, which apparently does not change deflation ratio, but is much faster.
  • Added --outBAMsortingThreadN option to specify number of threads for BAM sorting. By default (0) it's equal to min(6,runThreadN).

2.4.0f1 10/30/2014

  • Added read group (RG) BAM attributes to transcriptome BAM (contributed by https://github.com/godotgildor).
  • Fixed a bug with double ID field in the RG header line (contributed by https://github.com/godotgildor).
  • Fixed a bug in the 2-pass method (--twopass1readsN).
  • Fixed a problem with RAM allocation for BAM sorting.

2.4.0e 10/24/2014

  • Added manual in PDF.
  • New sub-directories: source, bin, doc.
  • Output more information about read files into Log.out.
  • Fixed some issues that may have caused dropping of multiple reads files.
  • Added more thorough error checking for genome generation.
  • Fixed a bug causing seg-faults with single-mate alignments for BAM sorting.
  • Fixed some compilation issues on Mac OS X. Note that the default Clang lacks openMP support which is required for STAR compilation.
  • Added Mac OS X executable.

2.4.0d 09/25/2014

  • Added .gitignore.
  • Fixed the problem with 2nd field in the read name shorter than 3 bases (non-Illumina fastq).
  • Added --outBAMcompression option.
  • Added --bamRemoveDuplicatesType and --bamRemoveDuplicatesMate2basesN options.
  • Added --outWigType wiggle read1_5p read2 options.
  • Added --outWigNorm option.

2.4.0c 09/07/2014

  • Automated git version.
  • Fixed a problem with overflowing SJ buffer.
  • Implemented options --twopass1readsN, --twopassSJlimit, --readMapNumber.

2.4.0b 08/29/2014

  • Fixed problems with spaces in --outFilePrefixName.
  • Fixed version information.

2.4.0a 08/11/2014

  • Implemented --outFilterMismatchNoverReadLmax option for a more consistent control of mismatches.

2.3.1z16 08/05/2014

Implemented --outWigReferencesPrefix option to filter references in the signal output. Implemented --runMode inputAlignmentsFromBAM --inputBAMfile

2.3.1z15
Implemented --outWigType bedGraph read1_5p option. Fixed a problem with chimeric alignments with overlapping segments. Fixed a problem with processing of fasta read input.

2.3.1z14 07/24/2014 Implemented 0x200 SAM flag for reads that did not pass Illumina filtering (i.e. contain “Y” as the 3rd character in the second field of the read name) Implemented comma-separated lists in the --outSAMattrRGline read groups that will assign different read groups to multiple comma-separated read files in --readFilesIn

2.3.1z13 07/04/2014 Fixed problems with STARlong.

2.3.1z12 06/30/2014 Fixed problems with SAM/BAM output to stdout.

2.3.1z11 06/27/2014 Switched to htslib samtools library. Fixed problem with indel near known splice junctions. Fixed problem with FASTA reads input.

2.3.1z10 06/20/2014 Fixed problem with compilation, samtools/ZLIB related.

2.3.1z9 06/19/2014 2.3.1z8 2.3.1z7 Fixed problems with transcriptomic output. Changed --sjdbFileChrStartEnd importing to allow direct import from SJ.out.tab

2.3.1z6 05/30/2014 2.3.1z5 05/30/2014 Fixed a bug causing problems with multiple zipped input files. Preliminary release of BAM sorting and wiggle output

2.3.1z4 05/06/2014 Preliminary release with transcriptome output.

2.3.1z2 04/29/2014 Fixed a bug causing problems in some chimeric alignments. Fixed a bug causing overflowing of SAM ISIZE. Fixed chimeric output problems with --outFilterType BySJout option Added extra Log.out messages for multi-threaded jobs.

2.3.1z1 03/13/2014 SAM header changes: removed "cl:" attribute from the @PG line, output it as a separate comment line added --outSAMheaderHD, --outSAMheaderPG, --outSAMheaderCommentFile options Added --outTmpDir, which set the path forSTAr temporary directory independent of --outFileNamePrefix

2.3.1z 02/05/2014 Fixed the incorrect behavior of --genomeLoad LoadAndRemove option.

2.3.1y 01/24/2014 Added read group sam attribute via --outSAMattrRGline parameter. Fixed gcc 4.7.0. compilation problem. Correct reverse complementarity of all IUPAC nucleotide codes in the SAM output.

2.3.1x 01/08/2014 Fixed a bug with --alignEndsType EndToEnd.

2.3.1v 12/21/2013 Added --alignEndsType EndToEnd option to align reads end-to-end, i.e. prohibit soft-clipping. --outSAMattributes now allows to specify the SAM attributes in the desired combination and order. Implemented standard (samtools-like) NM and MD tags. Added --outSAMmapqUnique parameter (=255 by default), MAPQ value for unique mappers.

2.3.1u 11/23/2013 Added --outSAMreadID={Standard,Number} parameter to output read numbered read IDs. Aded --outSAMmode NoQS option to suppress output of quality scores.

2.3.1t 11/20/2013 Fixed a bug that prevented alignment to the very beginning of the first reference.

2.3.1s 11/06/2013 Fixed a bug that produced incorrect placement of short deletions.

2.3.1r 10/01/2013 Compilation option to output "local alignment chanins". Compilation option to output suffix array as a text file.

2.3.1q 08/15/2013 Fixed a problem with junction overhang in SJ.out.tab file for overlapping mates.

2.3.1p 04/13/2013 Fixed GCC 4.7 compatibility problems. Changed min memory requirement for genome generation.

2.3.1o 04/13/2013 Fixed a bug with comma separated lists of input files.

2.3.1n 04/30/2013 Replaced incorretly released 2.3.1m.

2.3.1m 04/24/2013 Fixed a bug which in some cases caused problems with long reads.

2.3.1l 04/15/2013 Fixed a problem with --readFilesCommand.

2.3.1k 04/15/2013 Fixed chimeric output for single-end reads. Fixed a problem with --outFilterScoreMinOverLread 0 --outFilterMatchNminOverLread 0 settings.

2.3.1j 04/11/2013 Allowed spaces in paths: paths that contain spaces should be quoted with " ". Thanks for Tyler Hyndman for suggesting this patch.

2.3.1i 04/10/2013 Fixed problems with overflowing SJ buffer, new input parameters: --limitOutSJcollapsed, --limitOutSJoneRead.

2.3.1h 04/02/2013 Prevent output of strangely overlapping mates as chimeras. Report error if --sjdbOverhang=0 with set --sjdbFileChrStartEnd or --sjdbGTFfile.

2.3.1g 03/29/2013 Implemented detection of proximal same-strand chimeras. Now it is possible to detect circular RNA (conisdered "chimeric"). More accurate treatment of overlapping mates.

2.3.1f 03/21/2013 New option --outSAMorder PairedKeepInputOrder to output alignments in the same order as they appear in the input FASTQ/A files.

2.3.1e 03/18/2013 Fixed possible problems with multi-threaded runs for small files which could have caused empty Chimeric.* and Unmapped.* output on some systems.

2.3.1d 03/17/2013 New option --outSAMprimaryFlag AllBestScore for marking all alignments with the best score as primary. New parameter --limitOutSAMoneReadBytes 100000, limits the size of one SAM entry - important when a large number of multimappers is recorded. Fixed a possible problem with Unmapped.* and Chimeric.* output which could generate empty or truncated output on some systems. Coded a safer removal of the temporary directory _tmp which could have failed on some systems. Fixed a bug which resulted in unexpected behavior for alignIntronMax < 7.

2.3.1c 03/01/2013 Fixed a bug which duplicated output in Chimeric.* and Unmapped.* when --outFilterType BySJout option is used.

2.3.1b 02/28/2013 Fixed possible issue which in some cases could have resulted in empty Chimeric.out.*

2.3.1a 02/25/2013 Fixed incorrect processing of --sjdbGTFchrPrefix.

2.3.0e