[query] In Query-on-Batch, the calculation for IBD is incorrect #14052

jigold · 2023-11-29T21:15:11Z

What happened?

We're dividing by X in two places when it should be Y for computing E10.

https://hail.zulipchat.com/#narrow/stream/123010-Hail-Query-0.2E2-support/topic/P.28I.3D1.7CZ.3D0.29.20computation.20in.20IBD/near/403886796

This is a trivial fix, but I want to make sure the value for E11 is also correct. We're using T/2 where as in the paper it's 1.

Regardless, we need better tests for IBD that would have caught this error.

Version

0.2.126

Relevant log output

No response

jigold · 2023-12-04T17:48:17Z

We weren't actually running the tests in QoB previously. I enabled the tests in #14062, but they all still passed even with the error.

PASSED test/hail/methods/relatedness/test_identity_by_descent.py::test_ibd_default_arguments
PASSED test/hail/methods/relatedness/test_identity_by_descent.py::test_ibd_does_not_error_with_dummy_maf_float64
PASSED test/hail/methods/relatedness/test_identity_by_descent.py::test_ibd_0_and_1
PASSED test/hail/methods/relatedness/test_identity_by_descent.py::test_ibd_does_not_error_with_dummy_maf_float32

My guess is our test suite isn't robust enough. I don't think we test with any family relationships -- all unrelated samples.

CHANGELOG: Fixed bugs in the identity by descent implementation for Query on Batch This PR fixes #14052. There were two bugs in how we compute IBD. In addition, the tests weren't running in QoB and the test dataset we were using doesn't have enough variability to catch errors. I used Balding Nichols generated data instead. Do we need to set the seed in the tests here?

jigold added the needs-triage A brand new issue that needs triaging. label Nov 29, 2023

danking changed the title ~~Bad IBD implementation in QoB~~ [query] In Query-on-Batch, the calculation for IBD is incorrect Nov 30, 2023

danking added bug query and removed needs-triage A brand new issue that needs triaging. labels Nov 30, 2023

jigold self-assigned this Dec 4, 2023

jigold mentioned this issue Jan 3, 2024

[qob] Fix IBD and enable tests #14062

Merged

danking closed this as completed in #14062 Jan 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[query] In Query-on-Batch, the calculation for IBD is incorrect #14052

[query] In Query-on-Batch, the calculation for IBD is incorrect #14052

jigold commented Nov 29, 2023

jigold commented Dec 4, 2023

[query] In Query-on-Batch, the calculation for IBD is incorrect #14052

[query] In Query-on-Batch, the calculation for IBD is incorrect #14052

Comments

jigold commented Nov 29, 2023

What happened?

Version

Relevant log output

jigold commented Dec 4, 2023