-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
191 markers from Du et al 2023 #137
Conversation
@@ -1077,6 +1086,7 @@ Marker mh12WL-003 as defined in Yu2022G2 was defined previously and is redundant | |||
Marker mh12WL-003 as defined in Yu2022G3 was defined previously and is redundant | |||
Marker mh12WL-003 as defined in Yu2022G4 was defined previously and is redundant | |||
Marker mh12WL-004 as defined in Yu2022G2 was defined previously and is redundant | |||
Marker mh12WL-048 as defined in Du2023 was defined previously and is redundant |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I've confirmed the authors' claim that only a single MH published in this study is an exact duplicate of a previously published MH.
[Du2023] mh12SHY-001 --> mh12SHY-001.v4 | ||
[Du2023] mh02WL-002 --> mh02WL-002.v4 | ||
[Du2023] mh04WL-023 --> mh04WL-023.v5 | ||
[Du2023] mh11WL-005 --> mh11WL-005.v3 | ||
[Du2023] mh12WL-048 --> mh12WL-048.v1 | ||
[Du2023] mh18SCUZJ-0052717 --> mh18SCUZJ-0052717.v2 | ||
[Du2023] mh07ZHA-004 --> mh07ZHA-004.v2 | ||
[Du2023] mh01WL-077 --> mh01WL-077.v2 | ||
[Du2023] mh04WL-074 --> mh04WL-074.v2 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Less than 10 had any overlap whatsoever with previously published markers.
panel = microhapdb.markers[ | ||
microhapdb.markers.Source.str.contains("Pakstis2021") | ||
& microhapdb.markers.Name.str.startswith(loci) | ||
] | ||
return sorted(panel.Name) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Bugfix. Rather than explicitly specifying the precise marker identifier, updated this function to use the locus identifier and cross-reference with the Pakstis 2021 paper to grab the appropriate marker identifier.
This PR adds 191 markers from Du et al 2023. The markers range from 50-200 bp in length, and unfortunately seem to be relatively unimpressive in terms of allelic variation. Only two markers have a global average$A_e \gt 5.0$ , and both of these are alternative definitions for microhaps previously published by others: mh02LS-2qE spans the same 201 bp as mh02WL-002, but its 4 SNPs have an $A_e = 9.258$ versus the 5 SNPs from mh02WL-002.v2 (Yu 2022 Group 3) with an $A_e = 10.233$ ; likewise, mh18LS-18pB includes 3 SNPs spanning 110 bp with an $A_e = 5.35$ , versus the overlapping mh18SCUZJ-0052717.v1 (Zhu2023) which includes 6 SNPs spanning 347 bp with an $A_e = 11.444$ .
Closes #122.