Skip to content

Releases: HumanCellAtlas/sctools

Metric computation speed-up

30 Aug 19:42
047be89
Compare
Choose a tag to compare
Pre-release

Metric computation is done directly on the bam files.

Updates the fastqpreprocess with uint64

03 Nov 17:26
65ac1a0
Compare
Choose a tag to compare

We disabled the BGZF file EOF check to accommodate files that do not conform to this requirement.

FastqProcessing

28 Sep 20:39
fd352ae
Compare
Choose a tag to compare

This step adds the fastqprocessing step written in C++

FastqProcessing code provides more information on output

23 Oct 14:58
Compare
Choose a tag to compare

The inputs options and files are checked for valid values and existence of the input files.

SplitBamByCellBarcode writes the intermediate file in SAM instead of BAM

28 Jul 18:03
f82f789
Compare
Choose a tag to compare

SplitBamByCellBarcode writes the intermediate file in SAM instead of BAM. This speeds up the step but requires machines with more HDD.

Added cell metrics for mitochondrial gene

21 May 14:52
0a8c4df
Compare
Choose a tag to compare

Added cell metrics related to mitochondrial genes .
The followings metrics related to mitochondrial genes are added
For each cell we add the following metrics:

  • n_mitochondrial_genes: the number of mitochondrial genes
  • n_mitochondrial_molecules: the number of molecules from mitochondrial genes, i.e., sum of the counts from mitochondrial genes
  • pct_mitochondrial_molecules: percentage of n_mitochondrial_molecules in terms of the total number of molecules for the cell across all genes

v0.3.7

24 Mar 19:02
dc07854
Compare
Choose a tag to compare

Added functionality to ignore multi-gene annotations, introduced by Drop-seq tools 2.3.0, required for snRNA Seq

In order to annotate the intronic alignments Optimus is using a newer version of Drop-Seq tools 2.3.0.. In this new version of Drop-seq tools, the gn tag (which was GE in earlier versions) can have multiple gene names as value, a string with multiple gene names separated by a comma. However, in the CreateCountMatrix command, in sctools, the logic needs to ignore such alignments in the counting of the count matrix. The current release of sctools accomplishes this with appropriate code change.

This corrects the Dockerfile associated with v.0.3.6

v0.3.6

19 Mar 15:03
aaed0b9
Compare
Choose a tag to compare

Added functionality to ignore multi-gene annotations, introduced by Drop-seq tools 2.3.0, required for snRNA Seq

In order to annotate the intronic alignments Optimus is using a newer version of Drop-Seq tools 2.3.0.. In this new version of Drop-seq tools, the gn tag (which was GE in earlier versions) can have multiple gene names as value, a string with multiple gene names separated by a comma. However, in the CreateCountMatrix command, in sctools, the logic needs to ignore such alignments in the counting of the count matrix. The current release of sctools accomplishes this with appropriate code change.

v0.3.5

29 May 14:50
d4cbc45
Compare
Choose a tag to compare
  • Major SplitBam speed improvements with parallelization

  • --num-processes command-line argument to SplitBam can be used to specify how wide to parallelize

v0.3.4

15 May 17:23
5e04815
Compare
Choose a tag to compare
  • Fix handling of picard metrics from unpaired reads
  • Fix plumbing issues in the generic AttachBarcodes entry point
  • Improve linting