Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

multi fasta input #39

Open
alstonius opened this issue May 18, 2020 · 2 comments
Open

multi fasta input #39

alstonius opened this issue May 18, 2020 · 2 comments

Comments

@alstonius
Copy link

Hi,
I'm trying out DeepSimulator on a Bacterial genome where the input fasta has 3 sequences ( chromosome, plasmid_1, plasmid_2). What parameters do I need to set to get, say, 250 simulated reads from this multi-fasta input?

If I simulate 250 reads (-n 250) I obtain 750 reads (i.e. 250 reads per input 'chunk'). I tried setting '-D 0' to concatenate the 3 input chunks but still see 750 reads. The chromosome is the biggest chunk of the genome so I would hope that most of the reads simulated would be derived from the chromosome and not split evenly across the 3 genome features. I must be missing something...

thanks!
Mark

@Markusjsommer
Copy link

Same issue here. Need to filter out reads from plasmids etc, otherwise it's very biased

@tim488
Copy link

tim488 commented Jan 28, 2021

Same here. @liyu95 do you have a recommendation, it's kinda tedious...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants