-
Notifications
You must be signed in to change notification settings - Fork 159
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about eventalign parallelization at file level #770
Comments
Yes, that is the recommended way to speed it up.
Jared
… On Apr 25, 2020, at 4:51 AM, Milad Miladi ***@***.***> wrote:
Hi,
Is it possible to speedup eventalign computations by splitting the files and/or region windowing?
For example to speedup nanopolish eventalign --reads all.fastq --bam all.bam --genome genome.fa > all.tsv, split the fastq file and the run:
nanopolish eventalign --reads half1.fastq --bam all.bam --genome genome.fa > half1.tsv
nanopolish eventalign --reads half2.fastq --bam all.bam --genome genome.fa > half2.tsv
cat half1.tsv half2.tsv > all.tsv
Best,
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
Great, Thanks. |
Sorry, I misread your issue initially (I shouldn't try to answer emails first thing in the morning...). Splitting the fastq would work, but isn't the recommended way since it will still iterate over every read in the bam, but ignore them because it won't find the signal data. You should provide a coordinate range as the last argument (without
|
Thanks a lot for your prompt supports. The coordinate option hint would be very life (time) saving :-) |
Hi @jts , I have got stumbled on the expected input of the eventalign range option. There are cases where the output tsv is empty with no errors:
Here, I have spliced reads with 5'end at the upstream of position 21000, but all the reads fully cover the range 21000-22000. It seems, though not so sure, I only get the aligned events if I use a start range that covers the 5'end of the read. Is it the expected behavior? |
Hi,
Is it possible to speedup eventalign computations by splitting the files and/or region windowing?
For example to speedup
nanopolish eventalign --reads all.fastq --bam all.bam --genome genome.fa > all.tsv
, split the fastq file and then run:Best,
The text was updated successfully, but these errors were encountered: