-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Read Simulation: reference length limitation? #30
Comments
NanoSim is able to deal with multiple entries in a fasta file and it should work fine for small reference genomes, just take longer to generate a proper read length. Could you send me your reference plasmid fasta file so I can test on it? Thanks! |
I send the file to your bcgsc.ca mail address. I did some more testing and it seems that every reference over ~40kb works but with ~20kb or less it always crashes for me. However, if I use --perfect, it works just fine. The problem probably lies within mutate_read. |
I found a solution: specifying the "max_len" as the length of the reference contigs. |
Oh, I see. NanoSim will try to mimic the length distribution as the training profile. So if your reference genome is smaller than the empirical length distribution, it may not able to find the right length for each read. Specifying the |
Hi,
I was hoping to use this tool to simulate some reads for a set of amplicons. However, it gives me an error when I try it:
the reference file contains 10 000 entries with a size of 1,3 kb. So I thought maybe it can not deal with so many entries. That is why I tried the simulation with a plasmid of 7.4 kb, but that outputs the same error.
When I try it with a 49kb lambda genome it works, that is why I assume there is a limitation in the reference size?
Is it possible to adjust NanoSim for shorter references or am I missing something?
The text was updated successfully, but these errors were encountered: