-
Notifications
You must be signed in to change notification settings - Fork 6
Include SortMeRNA index #24
Comments
👍 I was just going to ask for that. |
These indexed files are big, even when zipped.
Do we need to include all of them? Is the size justified given that SortMeRNA is not used by default? @wasade @gregcaporaso, is SortMeRNA about to become much more important in qiime? |
What about placing them on ftp.microbio.me? On Thu, Dec 3, 2015 at 1:12 PM, Colin Brislawn notifications@github.com
|
Sure, that would work. But would that defeat the purpose of These files are large enough that we would have to store them using Git LFS. Do we want to introduce that? |
Note that pypi limits the size of packages and we are nearing that limit with qiime-default-reference. I can't find this size published anywhere but release uploads will fail if too large. |
Doesn't defeat the purpose as setup.py could just source the files. On Thu, Dec 3, 2015 at 2:01 PM, Jai Ram Rideout notifications@github.com
|
True. I thought having the defaults in one repo was preferable, but idk about the original goals. It is functionally the same. Another idea: What if we index these files the first time they are used, and save them alongside greengenes. This takes about 20-40 mins, but would only have to be done once and prevents us from adding 700 mb to everyone's base qiime distribution. We already do this every time I'm interested in taking this, either for 1.9.2 or for 2. If this is going to get use as a default, either for OTU picking or tax assignment, I'm very interested. |
Computing the index on the 97% representative sequences is expensive, and easy to forget to do. It would be nice if a precomputed index were available in this repo.
The text was updated successfully, but these errors were encountered: