-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
example data #6
Comments
Did you want to have hard coded BAM and vcfTabix examples in the repository? |
Yep. This is what I do with pybedtools, and it works out pretty well. It's really useful for testing and replicating the exact results on other machines. They need to be small though so the repo and distribution don't get to big (<<1MB). Do you have any good examples on hand? Or I can just make some random ones similar to the example bigWigs. |
I have some whole genome example files in https://github.com/ENCODE-DCC/trackhub_example, however this are quite big. You could choose data from 1 chromosome to make sure the distribution is <<1MB. |
That looks good -- I'll grab small parts of those files to use. |
Sounds good. I am looking into making a vcfTabix file, but haven't found a could method, have you? |
I don't use VCF in my work, so it's the least familiar format to me. I don't know of any method for generating random VCF. BEDTools/pybedtools handles the format, so it should be able to randomize an existing VCF easily enough. |
That might be the best way to go. I will see if I can find a VCF file. |
This is now implemented with @evajason's work with all the example hubs, using data from UCSC. |
currently have bigBed and bigWig example data, but need to make some BAM and vcfTabix example data as well
The text was updated successfully, but these errors were encountered: