-
Notifications
You must be signed in to change notification settings - Fork 159
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No methylation calls #826
Comments
Hi @JesseBNL, I need a bit more information - what did Jared |
Hi, it did only print the first line, no hits for reads. The indexing worked fine and I have an average coverage of approximately 150X. The region is large (200+ kb) and is covered by multiple overlapping reads. However, not every part of the target region is covered by the reads, so there will be target regions without coverage. Jesse |
If you only get a header line, and no line that says |
Yes, it's only the header line. The linux system shows the run ended, and I am sure the computer is powerful enough to run this.. Is there something else that might disturb the methylation calling? |
Can you paste the exact command you ran, and all the messages printed to the terminal? |
Yes, its:
And then it stops after the last line, which ends within seconds. |
If it terminates right after that line its probably running out of memory. Try to monitor the memory usage in another terminal window using |
Ok, CPU % jumps to 99%, so it might be the issue. I do actually have a powerful computer here, so how can I solve this? |
CPU usage of 99% is normal, I don't think that is the issue |
Any other idea what might stop the methylation calling? |
Without some diagnostic information its pretty hard to debug. Can you try running this command, and paste all messages printed to the terminal result?
|
I just noticed that the region ( |
Hi, the mapping quality is >20, but yes, the region we are studying is highly repetitive. Might this be a problem? I will try the other command and post the terminal result! |
This is the terminal output:
|
Do you have any other regions to try? I believe the most likely explanation is that nanopolish couldn't use any of your data |
Hmm, no this is all we captured. And the alignment looks fine, mapping the reference genome where it should. It is also a CpG rich region, so we do expect a lot of methylation calls.. |
could you provide me with the BAM file for this region?
…On Thu, Aug 27, 2020 at 9:18 AM JesseBNL ***@***.***> wrote:
Hmm, no this is all we captured. And the alignment looks fine, mapping the
reference genome where it should.
It is also a CpG rich region, so we do expect a lot of methylation calls..
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#826 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAC7DH75EF7WFSPIZMB2HYDSCZMK5ANCNFSM4QKU3D7A>
.
|
Yes, how can I send this file to you? |
If you can temporarily host on dropbox or google drive it would be best
…On Thu, Aug 27, 2020 at 9:46 AM JesseBNL ***@***.***> wrote:
Yes, how can I send this file to you?
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#826 (comment)>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAC7DH34FZPYOA66TH3FU7LSCZPS7ANCNFSM4QKU3D7A>
.
|
Hi, you can download it here: |
I tried to look at the region of interest with
Can you let me know which region I should look at? |
Hi, I used this reference, its HG38, but the target region extracted. |
By extracting the region you've changed the coordinate system so the region specified in the nanopolish command ( |
I changed the coordinates to the extracted reference. But I will give it another try with the full reference and update you. Thanks. |
Sorry for my late reply. I have managed to get the reads mapped in the region in which I expect them to match. But I get back that all mapped reads in the target region have bad fast5 files: Which leaves the methylation_calls.tsv empty. What might this be? The indexing seems to be successful and the fastq files were only modified by merging with cat command. |
Hi, I have added a small program called
This will hopefully help us understand what the problem is. Jared |
Hi Jared,
Thanks for adding this tool. How to install the branch? I couldnt find it.
Thanks,
Jesse
|
Hi, I've just merged it into the master branch, so if you do Jared |
Thanks, that worked. The output is:
The readdb file contains 271 fast5 files
[fast5] OK: opened fast5_pass//FAO37809_pass_1d6d03df_0.fast5
[read] OK: found 6634 raw samples for 001cb348-3b42-4d56-ba2f-5d9a10613558
[read] OK: found 6513 raw samples for 00232cc4-78dd-4e68-aefc-878e5f21cb51
[read] OK: found 10278 raw samples for 002d3ff6-14b4-41d4-b877-1763a45098c3
[read] OK: found 101808 raw samples for 0032e176-144b-4ee1-8c75-eaac0677fa50
[read] OK: found 12390 raw samples for 0033508e-b2a3-41a7-9c63-6b3619faa3d4
[read] OK: found 16150 raw samples for 007d79f8-9ec8-4c6c-9770-230140e622e1
[read] OK: found 34351 raw samples for 008c13ad-a6b7-4133-a8a3-4ef8eea6d13c
[read] OK: found 8158 raw samples for 00946f69-5822-4b2e-94f7-33b42d47311e
[read] OK: found 18332 raw samples for 00a11355-95c7-4908-b220-9b2914885beb
[read] OK: found 28754 raw samples for 00c17827-162e-4a21-860d-331f8e40e4eb
[read] OK: found 21841 raw samples for 00ca68ad-0782-427b-a04e-4cac7553e79d
[read] OK: found 13863 raw samples for 00deb6b2-cda7-4d41-ac77-358cf18e333a
[read] OK: found 3529 raw samples for 00ebb8e3-8a38-40a7-b60a-ea99326821af
[read] OK: found 19348 raw samples for 00ed1d6d-41f3-4ecd-97d7-611b8d4454de
[read] OK: found 28173 raw samples for 00ee3e18-4076-4a64-80e3-d5e761dfd2d6
And this continues...
|
Let it go for awhile and see if any "ERROR" messages come up. Jared |
It finished without errors |
Ok, I will think of other possible causes and update the program
… On Oct 6, 2020, at 1:36 AM, JesseBNL ***@***.***> wrote:
It finished without errors
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or unsubscribe.
|
Can you try the following, from your
Then run the |
Ok, this got something out. I used the command:
And returned:
And so on.. So it seems it cannot define the path? How might this be fixed? |
Hi, It looks like you have joined the read_id ( Jared |
Hi Jared, Indeed this was a problem. But now I have the same problem as at the start.. The "iterating over region" ends suddenly and the tsv file ends up empty. I have too a little sample with only a few mapping reads, to speed up the testing. Might I send those files to you so you can give it a try? |
Hi, Commands: Terminal output: |
Hi,
I have been fixing this problem for several days trying debugging by indexing different paths or downloading reference genomes but they didn't help. Do you know how to fix the high no alignment problem? Thank you very much. |
Hi,
Can you describe the data? Is it R9.4?
… On Feb 19, 2022, at 9:16 AM, Yingzi Zhang 张樱子 ***@***.***> wrote:
Hi,
I cannot call the methylation as well. The no alignment of mine is very high. Reports here:
[post-run summary] total reads: 229, unparseable: 0, qc fail: 0, could not calibrate: 0, no alignment: 229, bad fast5: 0
[post-run summary] total reads: 353, unparseable: 0, qc fail: 0, could not calibrate: 0, no alignment: 353, bad fast5: 3
I have been fixing this problem for several days trying debugging by indexing different paths or downloading reference genomes but they didn't help. Do you know how to fix this problem? Thank you very much.
—
Reply to this email directly, view it on GitHub, or unsubscribe.
Triage notifications on the go with GitHub Mobile for iOS or Android.
You are receiving this because you commented.
|
Hi,
The report and the results were as shown in my last message.
Many thanks. |
Hi there, my problem still exists. Now I wonder if it is because my sequencing data were generated by R10.3. Could this be the case? Would appreciate it very much if you can reply to my problem. Thank you! Best, |
Hi, Sorry for the slow reply. Yes it is because it is R10.3. There is experimental support for R10 data in the Jared |
Hi,
We have captured a a genomic region, and want to define the methylation of those reads. However, whatever I try, there are no calls made on the data. What could go wrong?
If I use the tutorial data set, it does work, so the pipeline is running..
Thanks,
Jesse
The text was updated successfully, but these errors were encountered: