Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

191 markers from Du et al 2023 #137

Merged
merged 5 commits into from
Aug 22, 2023
Merged

191 markers from Du et al 2023 #137

merged 5 commits into from
Aug 22, 2023

Conversation

standage
Copy link
Member

@standage standage commented Aug 22, 2023

This PR adds 191 markers from Du et al 2023. The markers range from 50-200 bp in length, and unfortunately seem to be relatively unimpressive in terms of allelic variation. Only two markers have a global average $A_e \gt 5.0$, and both of these are alternative definitions for microhaps previously published by others: mh02LS-2qE spans the same 201 bp as mh02WL-002, but its 4 SNPs have an $A_e = 9.258$ versus the 5 SNPs from mh02WL-002.v2 (Yu 2022 Group 3) with an $A_e = 10.233$; likewise, mh18LS-18pB includes 3 SNPs spanning 110 bp with an $A_e = 5.35$, versus the overlapping mh18SCUZJ-0052717.v1 (Zhu2023) which includes 6 SNPs spanning 347 bp with an $A_e = 11.444$.

ae_vs_length_du2023

Closes #122.

@@ -1077,6 +1086,7 @@ Marker mh12WL-003 as defined in Yu2022G2 was defined previously and is redundant
Marker mh12WL-003 as defined in Yu2022G3 was defined previously and is redundant
Marker mh12WL-003 as defined in Yu2022G4 was defined previously and is redundant
Marker mh12WL-004 as defined in Yu2022G2 was defined previously and is redundant
Marker mh12WL-048 as defined in Du2023 was defined previously and is redundant
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I've confirmed the authors' claim that only a single MH published in this study is an exact duplicate of a previously published MH.

Comment on lines +3068 to +3076
[Du2023] mh12SHY-001 --> mh12SHY-001.v4
[Du2023] mh02WL-002 --> mh02WL-002.v4
[Du2023] mh04WL-023 --> mh04WL-023.v5
[Du2023] mh11WL-005 --> mh11WL-005.v3
[Du2023] mh12WL-048 --> mh12WL-048.v1
[Du2023] mh18SCUZJ-0052717 --> mh18SCUZJ-0052717.v2
[Du2023] mh07ZHA-004 --> mh07ZHA-004.v2
[Du2023] mh01WL-077 --> mh01WL-077.v2
[Du2023] mh04WL-074 --> mh04WL-074.v2
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Less than 10 had any overlap whatsoever with previously published markers.

Comment on lines +206 to +210
panel = microhapdb.markers[
microhapdb.markers.Source.str.contains("Pakstis2021")
& microhapdb.markers.Name.str.startswith(loci)
]
return sorted(panel.Name)
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Bugfix. Rather than explicitly specifying the precise marker identifier, updated this function to use the locus identifier and cross-reference with the Pakstis 2021 paper to grab the appropriate marker identifier.

@standage standage merged commit 8e659a8 into master Aug 22, 2023
3 checks passed
@standage standage deleted the du2023 branch August 22, 2023 14:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Du et al. 188 MHs
1 participant