Crash on generating descriptive stats #105

mbassalbioinformatics · 2019-03-06T05:05:05Z

Hi

So when running the pipeline, i seem to be getting a crash when attempting to generate the describe stats as follows...

Descriptive statistics...
[1] "I am loading useful packages for plotting..."
[1] "2019-03-05 23:05:59 EST"
Error in if (any(i < 0L)) { : missing value where TRUE/FALSE needed
Calls: <Anonymous> ... as -> .class1 -> .TM.repl.i.mat -> [<- -> [<- -> int2i
In addition: Warning message:
In int2i(as.integer(i), n) : NAs introduced by coercion to integer range
Execution halted

thoughts/suggestions??

The text was updated successfully, but these errors were encountered:

cziegenhain · 2019-03-06T06:40:15Z

Hi,

We need a bit more information to troubleshoot this.
For instance: what kind of data are you processing, the YAML file, full verbose of the run

mbassalbioinformatics · 2019-03-06T08:01:35Z

So this was a ddseq3 run. Ive attached the terminal output and the yaml files (inc postmap). I can share the rds with you in confidence to help sort this out. Just need an email address to send the dl link.

N706-PBMC-CD34CD45-1-5-chip2.postmap.yaml.txt
N706-PBMC-CD34CD45-1-5-chip2.yaml.txt
analysis_dump.txt

Thanks!

gokceneraslan · 2019-03-06T13:52:11Z

I have the same issue. I tracked it down to countGenes function and it only happens with the inex matrix:

That's because the inex matrix is too large, so indexing leads to integer overflow. You can reproduce it with

This is also the same error as in https://bitbucket.org/hrue/r-inla/issues/1/logical-indexing-for-large-matrices-fails, but no idea how to solve this without breaking up the matrix into smaller pieces and counting genes separately.

cziegenhain · 2019-03-06T15:28:32Z

Hi @gokceneraslan - thanks for tracking the error down so quickly and the fix.

@mbassalbioinformatics : in addition to updating zUMIs with this fix, you should double check your ddseq settings. I dont think its reasonable to expect that many cell barcodes? If I remember correctly, ddseq should be run with the frameshift-correction in the read1 settings
correct_frameshift: TAGCCATCGCATTGC

Feel free to reopen the issue if further things arise!

gokceneraslan · 2019-03-06T16:01:01Z

@cziegenhain thanks for merging the PR. Could you please check if everything still works with the ExampleData dataset? I think fix is correct, but it's better to be on the safe side.

Actually, the proper way would be to add unit tests using the https://github.com/r-lib/testthat package, otherwise whole codebase becomes so fragile...

cziegenhain · 2019-03-06T20:07:51Z

Thanks again for the PR, I double-checked and example data runs as expected.
Appreciate your input!

Fixes #105

gokceneraslan mentioned this issue Mar 6, 2019

Fix integer overflow issues in countGenes for large inex matrix #106

Merged

cziegenhain closed this as completed in #106 Mar 6, 2019

smanne07 mentioned this issue Apr 17, 2019

Error in preprocessing cole-trapnell-lab/cicero-release#25

Closed

cziegenhain pushed a commit that referenced this issue Feb 18, 2020

Fix integer overflow issues in countGenes for large inex matrix

764d3b2

Fixes #105

smgogarten mentioned this issue May 25, 2021

Error in if (any(i < 0L)) { : missing value where TRUE/FALSE needed UW-GAC/GENESIS#66

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Crash on generating descriptive stats #105

Crash on generating descriptive stats #105

mbassalbioinformatics commented Mar 6, 2019

cziegenhain commented Mar 6, 2019

mbassalbioinformatics commented Mar 6, 2019

gokceneraslan commented Mar 6, 2019

cziegenhain commented Mar 6, 2019 •

edited

Loading

gokceneraslan commented Mar 6, 2019

cziegenhain commented Mar 6, 2019

Crash on generating descriptive stats #105

Crash on generating descriptive stats #105

Comments

mbassalbioinformatics commented Mar 6, 2019

cziegenhain commented Mar 6, 2019

mbassalbioinformatics commented Mar 6, 2019

gokceneraslan commented Mar 6, 2019

cziegenhain commented Mar 6, 2019 • edited Loading

gokceneraslan commented Mar 6, 2019

cziegenhain commented Mar 6, 2019

cziegenhain commented Mar 6, 2019 •

edited

Loading