Update Varcode to work with new multi-species PyEnsembl #118
Conversation
|
Warning to @timodonnell, since his fear in #106 is coming true in this PR:
I do think it makes sense to get rid of it because |
|
Reviewed 10 of 11 files at r1. test/test_mouse.py, line 28 [r1] (raw file): varcode/maf.py, line 120 [r1] (raw file): varcode/reference_name.py, line 20 [r1] (raw file): varcode/reference_name.py, line 33 [r1] (raw file): varcode/reference_name.py, line 34 [r1] (raw file): varcode/reference_name.py, line 36 [r1] (raw file): Comments from the review on Reviewable.io |
|
Reviewed 1 of 11 files at r1. Comments from the review on Reviewable.io |
|
Use Review status: all files reviewed at latest revision, 6 unresolved discussions, all commit checks successful. test/test_mouse.py, line 28 [r1] (raw file): varcode/maf.py, line 120 [r1] (raw file): varcode/reference_name.py, line 20 [r1] (raw file): varcode/reference_name.py, line 33 [r1] (raw file): varcode/reference_name.py, line 34 [r1] (raw file): varcode/reference_name.py, line 36 [r1] (raw file): Comments from the review on Reviewable.io |
|
Review status: all files reviewed at latest revision, 1 unresolved discussion, all commit checks successful. varcode/reference_name.py, line 36 [r1] (raw file): Comments from the review on Reviewable.io |
…stop_codon code to work on Py3
Update Varcode to work with new multi-species PyEnsembl
The main change here is that we can no longer assume that a PyEnsembl release version refers uniquely to a genome, now we also need to figure out the species. Anywhere where the release version was used had to be updated to construct a species-specific EnsemblRelease object.
Another change is that we can now rely on PyEnsembl to tell us which genome is the most recent for a particular reference name (instead of hard-coding release versions in ladder of if-statements).
TODO in another PR: add unit tests with mm10 aligned VCFs