help for detecting heteroplasmy #90

Mirror1211 · 2019-06-08T07:51:02Z

Hi everybody,
I'm Eda. Recently, I am going to detect the heteroplasmy within chloroplast genome to investigate wether there is biparental heteroplasmy within my samples. I am really glad to find the NOVOPlasty since it can detect heteroplasmy for chloroplast genome.
However, there is no detailed introductions to this analysis and I always obtained blank result files for heteroplasmy analyses. Could anybody help me for the settings of the configure file???
Thanks a lot!

Mirror1211 · 2019-06-08T07:57:50Z

I have already obtained the complete chloroplast genomes assembled by my colleague, and took them as their own reference genome and seed genome, while the results files also were blank.

ndierckx · 2019-06-08T15:47:25Z

Hi, could you send me the log, so I can check if all the parameters are correct. And do you have enough coverage for heteroplasmy detection? I haven't test it much on chloroplast genome but it is more complicated because of the many duplicated regions in the mitochondrial genome

Mirror1211 · 2019-06-09T02:47:19Z

The mean coverage of cp genomes ranged from 200 to 7900 X. So, I think the coverage of cp genomes is enough to detece heteroplasmy. I have deleted the log file. I will conduct this analysis this afternoon and upload the log files here.
Thanks a lot

Mirror1211 · 2019-06-09T03:19:08Z

Reading Input......OK

Scan reference sequence......OK

Building Hash Table......OK

Subsampled fraction: 100.00 %

Retrieve Seed...

However, there was no futher infomation for the next step. I guess there may be some errors in my configure files.

Mirror1211 · 2019-06-09T03:21:40Z

heter.txt
log_test_chloro.txt
Here are my configure and log files. Thanks

ndierckx · 2019-06-09T04:54:57Z

Your MAF is way too low, you are looking for heteroplasmy of 0,01%, that is impossible. If you want 1%, put 0,01. I will put an automatic allert in the next version. Try what it gives with 0,01. But i see you have reads of 90bp, that means they are old i guess, so less accurate. So definitely don't go below 0.01

Mirror1211 · 2019-06-09T05:06:22Z

Thanks. Actually, I used 0.01 for MAF first. The result files were still blank.

Mirror1211 · 2019-06-09T05:08:12Z

Could I know when the next version will be uploaded in gitub????

Mirror1211 · 2019-06-09T05:10:52Z

Is there any errors in my configure files except for MAF????

ndierckx · 2019-06-09T06:28:10Z

Send me the log of the 0,01 then

Mirror1211 · 2019-06-10T02:53:33Z

OK. I will upload the log of the 0.01 this afternoon.

ndierckx · 2019-06-10T06:08:49Z

ok, and when you did it before, it also got stuck at Seed retrieval?
With a MAF lower than 0.01 that can happen

Mirror361025 · 2019-06-11T02:32:33Z

yes, it also got stuck at Seed retrieval.

Mirror361025 · 2019-06-11T02:33:54Z

Sorry, our serve is maintaining, so I can not download the logfile.

Mirror361025 · 2019-06-11T02:34:59Z

Exactly, I have arond 50 accessions that need to conduct the heteroplasmy detection.

ndierckx · 2019-06-11T02:41:37Z

Hi,

Could you send me the seed file?

Mirror361025 · 2019-06-11T02:49:06Z

ok. The serve maintaince will be finished tommorrow. After that, I will upload my seed file.

Mirror361025 · 2019-06-12T01:38:07Z

results.zip
hi, I have analyzed another sample with two different MAF. I obtained the results of heteroplasmy. Here are the all results.

Mirror361025 · 2019-06-12T01:39:05Z

I don't know whether the analysis is correct. In addition, the results confused me.

ndierckx · 2019-06-12T01:44:51Z

Did you use the chloroplast assembly as a reference and seed or some online reference?

ndierckx · 2019-06-12T02:28:56Z

Because you have a lot of homoplasmies, so your reference is probably not from that dataset?
if you run again, please use the extended log (set to 1), then I can see when something went wrong.
And you should try a high coverage dataset first

ndierckx · 2019-06-12T02:32:34Z

Ah sorry coverage seems enough.., but results are bit weird indeed, if you send me one dataset, I can also try myself if you want

ndierckx · 2019-06-12T02:35:36Z

But since all heteroplasmy is found in one short region, I would guess there is no heteroplasmy, something surely went wrong in that area

Mirror361025 · 2019-06-12T03:19:22Z

yes, I used the assembled chloroplast genome as reference genome. I am sure the chloroplast genome is consistent with the reads I used. I wonder whether I should use the raw reads from the whole-genome resequencing or use the filtered data that only contained chloroplast reads??

Mirror361025 · 2019-06-12T03:19:57Z

I wonder whether I can have your email for the upload of raw data???

ndierckx · 2019-06-12T03:26:51Z

nicolasdierckxsens at hotmail dot com

Mirror361025 · 2019-06-12T03:28:08Z

Thanks

Mirror361025 · 2019-06-12T03:47:14Z

hello, the reads should be the raw whole-genome sequences or the filter data that only contained chloroplast reads??

ndierckx · 2019-06-12T07:43:53Z

How did you filter them? And are the raw files very large?

Mirror1211 · 2019-06-12T23:06:55Z

We used the complete cp genomes of related species as reference genomes to screen out the cp genomes reads. The raw reads are ～40 G

Mirror1211 · 2019-06-13T02:29:26Z

hi, I have deleted some files and compressed them while there is also 8 G data. I will send these files in three emails to you. Thanks.

ndierckx · 2019-07-09T18:54:20Z

Hi, Sorry was busy and was on holiday so didn't had the time to try it any further
Are you still interested in heteroplasmy detection?

Mirror1211 · 2019-07-14T23:45:45Z

yes. Would you have any suggestions for my analyses??? 在2019-07-10 02:54:21，Nicolas Dierckxsensnotifications@github.com写道： Hi, Sorry was busy and was on holiday so didn't had the time to try it any further Are you still interested in heteroplasmy detection? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

ndierckx · 2019-07-15T04:28:11Z

Did you already have a successful chloroplast assembly?

Mirror1211 · 2019-07-15T08:59:26Z

yes. In fact, the cp genome was assembled by my colleague using other software, and I conducted the following phylogenetic analyses. I think that it is also important for us to test the bioparental interitance by detecting the heteroplasmic sites between different accessions. Fortunatelly, I found your software. The coverage of cp genomes is up to 9065 X and raw reads is around 40G. 在2019-07-15 12:28:12，Nicolas Dierckxsensnotifications@github.com写道： Did you already have a successful chloroplast assembly? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

ndierckx · 2019-07-15T09:10:18Z

Hi,
Ok as the heteroplasmy function is quite new and I am fixing some problems at the moment, it is better to wait until tomorrow or the day after to run the latest version

I haven't tried many chloroplast sequences, so when you do a run you can send me the results.

At the moment, you can already filter your reads to speed up the heteroplasmy detection:
https://github.com/ndierckx/NOVOPlasty/wiki/Heteroplasmy-detection

Use the filter_read.pl script on your original dataset

Mirror1211 · 2019-07-17T06:48:48Z

OK. Thanks. 在2019-07-15 17:10:18，Nicolas Dierckxsensnotifications@github.com写道： Hi, Ok as the heteroplasmy function is quite new and I am fixing some problems at the moment, it is better to wait until tomorrow or the day after to run the latest version I haven't tried many chloroplast sequences, so when you do a run you can send me the results. At the moment, you can already filter your reads to speed up the heteroplasmy detection: https://github.com/ndierckx/NOVOPlasty/wiki/Heteroplasmy-detection Use the filter_read.pl script on your original dataset — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

Mirror1211 · 2019-07-17T06:52:52Z

Hi I also wanna know whether the genome in the first step for the heteroplasmy detection must be assembled by NOVOPlasty??? could I use a complete genome assembled by other programs ?? Because most of the studied cp genomes have already assembled by my colleague last year. 在2019-07-15 17:10:18，Nicolas Dierckxsensnotifications@github.com写道： Hi, Ok as the heteroplasmy function is quite new and I am fixing some problems at the moment, it is better to wait until tomorrow or the day after to run the latest version I haven't tried many chloroplast sequences, so when you do a run you can send me the results. At the moment, you can already filter your reads to speed up the heteroplasmy detection: https://github.com/ndierckx/NOVOPlasty/wiki/Heteroplasmy-detection Use the filter_read.pl script on your original dataset — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

Mirror1211 · 2019-07-17T07:04:10Z

oh, sorry Also, the introduction of perl scripts said that no repeatitive sequences should be inclueded in this analysis. Cp genomes have two large repeative copies (also called IRa and IRb) that were reverse and complementary with each other. So should I remove these two sequences and concatenate the remaing sequences as a new whole sequence to conduct this analysis, or should I keep one of the two IR regions? 在2019-07-15 17:10:18，Nicolas Dierckxsensnotifications@github.com写道： Hi, Ok as the heteroplasmy function is quite new and I am fixing some problems at the moment, it is better to wait until tomorrow or the day after to run the latest version I haven't tried many chloroplast sequences, so when you do a run you can send me the results. At the moment, you can already filter your reads to speed up the heteroplasmy detection: https://github.com/ndierckx/NOVOPlasty/wiki/Heteroplasmy-detection Use the filter_read.pl script on your original dataset — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

ndierckx · 2019-07-23T12:33:08Z

Always better to use a NOVOPlasty assembly, but you can also do it with assembled genome, as long as it is from the same dataset. You can keep the IR, but if they are not identical, there will be false positives in those regions. I am still working on the heteroplasmy model, still needed some debugging, so I will upload a new version tomorrow (best to wait for that one)

Mirror1211 · 2019-07-24T00:40:32Z

Thanks a lot! 在2019-07-23 20:33:09，Nicolas Dierckxsensnotifications@github.com写道： Always better to use a NOVOPlasty assembly, but you can also do it with assembled genome, as long as it is from the same dataset. You can keep the IR, but if they are not identical, there will be false positives in those regions. I am still working on the heteroplasmy model, still needed some debugging, so I will upload a new version tomorrow (best to wait for that one) — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

ndierckx · 2019-07-31T02:25:58Z

Hi,

Sorry took a bit longer, but the new version is uploaded, should perform better

Mirror1211 · 2019-08-02T05:39:41Z

Thanks 在2019-07-31 10:25:58，Nicolas Dierckxsensnotifications@github.com写道： Hi, Sorry took a bit longer, but the new version is uploaded, should perform better — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or mute the thread.

ndierckx closed this as completed Jun 22, 2023

help for detecting heteroplasmy #90

help for detecting heteroplasmy #90

Comments

Mirror1211 commented Jun 8, 2019

Mirror1211 commented Jun 8, 2019

ndierckx commented Jun 8, 2019

Mirror1211 commented Jun 9, 2019

Mirror1211 commented Jun 9, 2019

Mirror1211 commented Jun 9, 2019

ndierckx commented Jun 9, 2019

Mirror1211 commented Jun 9, 2019

Mirror1211 commented Jun 9, 2019

Mirror1211 commented Jun 9, 2019

ndierckx commented Jun 9, 2019

Mirror1211 commented Jun 10, 2019

ndierckx commented Jun 10, 2019

Mirror361025 commented Jun 11, 2019

Mirror361025 commented Jun 11, 2019

Mirror361025 commented Jun 11, 2019

ndierckx commented Jun 11, 2019

Mirror361025 commented Jun 11, 2019

Mirror361025 commented Jun 12, 2019

Mirror361025 commented Jun 12, 2019

ndierckx commented Jun 12, 2019

ndierckx commented Jun 12, 2019

ndierckx commented Jun 12, 2019

ndierckx commented Jun 12, 2019

Mirror361025 commented Jun 12, 2019

Mirror361025 commented Jun 12, 2019

ndierckx commented Jun 12, 2019

Mirror361025 commented Jun 12, 2019

Mirror361025 commented Jun 12, 2019

ndierckx commented Jun 12, 2019

Mirror1211 commented Jun 12, 2019 • edited Loading

Mirror1211 commented Jun 13, 2019

ndierckx commented Jul 9, 2019

Mirror1211 commented Jul 14, 2019 via email

ndierckx commented Jul 15, 2019

Mirror1211 commented Jul 15, 2019 via email

ndierckx commented Jul 15, 2019

Mirror1211 commented Jul 17, 2019 via email

Mirror1211 commented Jul 17, 2019 via email

Mirror1211 commented Jul 17, 2019 via email

ndierckx commented Jul 23, 2019

Mirror1211 commented Jul 24, 2019 via email

ndierckx commented Jul 31, 2019

Mirror1211 commented Aug 2, 2019 via email

Mirror1211 commented Jun 12, 2019 •

edited

Loading