-
Notifications
You must be signed in to change notification settings - Fork 111
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
prepare - work around anndata bug #1260
Conversation
Codecov Report
@@ Coverage Diff @@
## master #1260 +/- ##
==========================================
+ Coverage 61.81% 62.40% +0.59%
==========================================
Files 66 66
Lines 4897 4918 +21
Branches 374 374
==========================================
+ Hits 3027 3069 +42
+ Misses 1778 1757 -21
Partials 92 92
Continue to review full report at Codecov.
|
@bkmartinjr posted a solution here. While I do think it is an improvement to the current implementation, perhaps we should consider using different I don't know if gene names follow a specific format, but perhaps it's possible to chose a character not in that format. Perhaps something like This might be a @sidneymbell question. |
How would this resolve the underlying algorithmic problem? I elected to use the Scanpy/Anndata standard rather than inventing another. What I think we need to do is change the algorithm so it is actually guaranteeing uniqueness. |
It wouldn't solve the algorithmic problem. Actually, I'd still push for my PR to go through in the anndata repo. It would simply make the names in the indices we produce more easily interpretable.
Technically, if we can convince ourselves that a char like |
There is no guarantee that the names will conform to this standard. And we really can't assume that either. They can be anything. |
Temporarily copy code from scverse/anndata#345 until the issue is resolved and released.
a5f7465
to
060de64
Compare
Work around for the issue scverse/anndata#344
Changes to launch are just lint (black autoformat). See change in prepare.py.