Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adversarial testing datasets: PhiX #11

Closed
TransGirlCodes opened this issue Jul 3, 2019 · 1 comment
Closed

Adversarial testing datasets: PhiX #11

TransGirlCodes opened this issue Jul 3, 2019 · 1 comment

Comments

@TransGirlCodes
Copy link
Member

We should add some adversarial testing datasets either to BioSequenceGraphs.

They may take up some space and so we'll have to think about how to do this with bigger datasets.

But for now, I propose adding a PhiX dataset: We can use the PhiX reference genome sequence. Use Pseudoseq.jl to generate paired-end reads. We will need to decide on a read length and average insert size. Once we have the read files we can include the reference and the reads, and use that data to test how our graph functions are working.

@ardakdemir
Copy link
Member

ardakdemir commented Jul 5, 2019

I have used ecoli-ref.fasta available under the example folder of Pseudoseq.jl to generate very short reads (read_length = 20). Then successfully constructed the dbg for k= 10. The resulting graph contains 742343. I will be using this graph for testing the implemented functionalities. It is available in my local yet I can not push it as the file has size 575MB.

@TransGirlCodes TransGirlCodes moved this from To Do to In progress in Genome Assembly with Julia Aug 2, 2019
Genome Assembly with Julia automation moved this from In progress to Done Jan 17, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Development

No branches or pull requests

2 participants