Odd sized kinship file #55

pjotrp · 2017-07-09T12:09:40Z

When running the new test testCenteredRelatednessMatrixK I get different resulting files on two build setups. One is the correct size, but the second has 1940 rows instead of 1410. The extra rows are the same as the tail of the previous - so, somehow, the output buffer gets overwritten. I'll scrutinize this more closely.

pjotrp · 2017-07-09T12:10:42Z

Oh yes, the other tools using K don't seem to mind. They don't read the full file. So, this may have been happening before, but it did not matter for the result.

pcarbo · 2017-07-10T12:07:35Z

@pjotrp Should I close this? It seems that it is not a bug after all?

pjotrp · 2017-07-10T12:59:57Z

No, I need to figure out what happened first.

pjotrp · 2017-07-21T08:35:08Z

Getting the same bug on my laptop. Test on master branch fails with

testCenteredRelatednessMatrixK
Reading Files ... 
## number of total individuals = 1940
## number of analyzed individuals = 1410
## number of covariates = 1
## number of phenotypes = 1
## number of total SNPs = 12226
## number of analyzed SNPs = 10768
Calculating Relatedness Matrix ... 
Reading SNPs  ==================================================100.00%
## total computation time = 0.35778 min 
ASSERT:expected:<24.9799> but was:<29.691>

and

wc -l test/output/mouse_hs1940.cXX.txt 
1940 test/output/mouse_hs1940.cXX.txt

it appears it is outputting the number of individuals total, rather than the number used. @pcarbo what is the size of your output file?

pcarbo · 2017-07-21T22:41:41Z

@pjotrp I get an output file mouse_hs1940.cXX.txt with 1940 lines. I think the "number of analyzed individuals" is misleading (and confusing!) because it is about the phenotype data, which is irrelevant for computing the relatedness matrix. If you look at the phenotype file you will see that there are indeed 530 missing phenotype values:

$ cut -f 1 mouse_hs1940.pheno.txt | grep NA | wc -l
530

In any case, this doesn't explain the error you are getting.

pjotrp · 2017-07-22T07:02:43Z

Cool, I swear I have seen a smaller K file. Anyway, we are getting different answers on different systems, so I am looking into that.

pjotrp · 2017-07-26T07:59:03Z

Turns out that the output files are identical, but that awk gives a different result on different machines! Not a gemma problem - I'll fix the test.

…t to shunit2 in repo fixes genetics-statistics#55

pcarbo · 2017-07-26T11:46:23Z

@pjotrp Interesting, thanks!

pjotrp mentioned this issue Jul 18, 2017

Added test suite, make check and openblas support #54

Merged

pjotrp self-assigned this Jul 21, 2017

pjotrp added the bug label Jul 21, 2017

pjotrp closed this as completed Jul 26, 2017

pjotrp added a commit to genenetwork/GEMMA that referenced this issue Jul 26, 2017

test_suite: replace awk with perl to compute file contents and defaul…

b31ec4f

…t to shunit2 in repo fixes genetics-statistics#55

pjotrp mentioned this issue Jul 26, 2017

Tests #60

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Odd sized kinship file #55

Odd sized kinship file #55

pjotrp commented Jul 9, 2017

pjotrp commented Jul 9, 2017 •

edited

Loading

pcarbo commented Jul 10, 2017

pjotrp commented Jul 10, 2017

pjotrp commented Jul 21, 2017

pcarbo commented Jul 21, 2017 •

edited

Loading

pjotrp commented Jul 22, 2017

pjotrp commented Jul 26, 2017

pcarbo commented Jul 26, 2017

Odd sized kinship file #55

Odd sized kinship file #55

Comments

pjotrp commented Jul 9, 2017

pjotrp commented Jul 9, 2017 • edited Loading

pcarbo commented Jul 10, 2017

pjotrp commented Jul 10, 2017

pjotrp commented Jul 21, 2017

pcarbo commented Jul 21, 2017 • edited Loading

pjotrp commented Jul 22, 2017

pjotrp commented Jul 26, 2017

pcarbo commented Jul 26, 2017

pjotrp commented Jul 9, 2017 •

edited

Loading

pcarbo commented Jul 21, 2017 •

edited

Loading