-
Notifications
You must be signed in to change notification settings - Fork 102
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
proof of concept genomepy integration #323
Conversation
@simonvh yes, this is a good idea! I think this proof of concept is pretty complete; I think all it would need is a test with and without genomepy, and maybe including genomepy in https://github.com/daler/pybedtools/blob/master/optional-requirements.txt. Thanks for the contribution! |
OK, sounds good, thanks! We'll update and add a few tests. I'll ping you when I think it's ready for review. |
@daler Tests are implemented, and I added a short text to the docs in what I think is the relevant place. Let me know if you need additional changes. |
…edtools into simonvh-genome_from_genomepy
@simonvh sorry for the lag time. Went down a couple different rabbit holes trying to fix the failing test. Some notes for my future reference:
|
Thanks for the contribution! |
Hey @daler, this is more of a question at the moment than a full-fledged PR. Would you be interested in including genomepy support in pybedtools? I have a proof-of-concept implementation in this PR. Genomepy is a Python module to manage and use genomes. It supports download of genomes from UCSC, Ensembl or NCBI and streamlines a lot of things. One file it creates by default is a file with chromosome sizes. This can be accessed using genomepy as follows:
The idea here would be that if genomepy is installed, the name of the genome could be used as the
genome
argument to all of the pybedtools functions that require agenome
. There is, I think, not a lot of overhead if you don't have genomepy installed, and it wouldn't touch any of the existing functionality. However, if you do have genomepy, there is no need to provide a full path. It would be really useful to us, but then again, as the developers of genomepy we use it a lot :).Let me know what you think!