methyl_summary generation problem #65

PraddyumnaR · 2024-07-09T07:38:28Z

Methylframe <- generate_methylframe(

methyl_bed_list = methyl_bed,
Sample_count = 0,
Methyl_call_type = "Dorado",
filter_NAs = 1,
max_read_depth = 100,
gene_info = TRUE,
gene_coordinate_file = Geneco,
Gene_column = "gene.ID", # Pass the column name as a string
target_info = FALSE,
File_prefix = "Sample"
)
Creating the Megaframe
|--------------------------------------------------|
|==================================================|
Error in [.data.frame(Methyl_bed, , c(1, 2, 3, 4, 5, 6)) :
undefined columns selected

This is the error I am getting after submiting the DeepSignal3 output bed file as input here what should be done to resolve it

The text was updated successfully, but these errors were encountered:

decibel-tom · 2024-07-09T16:09:12Z

Can you submit your bed file so I can try to reproduce the error?

PraddyumnaR · 2024-07-11T09:11:17Z

I have solved the problem of methyl_frame geneartion but now other problem I am facing is this

methyl_summary <- create_methyl_summary(dmr_obj,
control = 'C',
treated = 'T',
additional_summary_cols = list(
c('sd', 'Group')
))
[1] "Output is in a different order. Try running again."
Error in [<-.data.frame(*tmp*, , ncol(Output_Frame) + 1, value = c(-50, :
replacement has 2887333 rows, data has 3076303
In addition: Warning messages:
1: In GenePercentPlant$Zeroth_pos == dmr_obj$ZoomFrame_filtered$Zeroth_pos :
longer object length is not a multiple of shorter object length
2: In GenePercentPlant$Gene == dmr_obj$ZoomFrame_filtered$Gene :
longer object length is not a multiple of shorter object length
head(dmr_obj$ZoomFrame_filtered$Zeroth_pos)
[1] 15 16 22 23 36 42
head(dmr_obj$ZoomFrame_filtered$Gene)
[1] "Chr2" "Chr2" "Chr2" "Chr2" "Chr2" "Chr2"
head(dmr_obj$LongPercent)
A tibble: 6 × 9
Groups: Gene, Zeroth_pos, Plant, Position, CX, Strand, Group [6]
Gene Zeroth_pos Plant Position CX Strand Group Chromosome Percent

1 Chr2 15 S1 15 CHH + C Chr2 50
2 Chr2 15 S2 15 CHH + T Chr2 0
3 Chr2 16 S1 16 CHH + C Chr2 17
4 Chr2 16 S2 16 CHH + T Chr2 0
5 Chr2 22 S1 22 CHH + C Chr2 33
6 Chr2 22 S2 22 CHH + T Chr2 0

length(dmr_obj$ZoomFrame_filtered$Zeroth_pos)
[1] 3076303
length(dmr_obj$ZoomFrame_filtered$Gene)
[1] 3076303
length(dmr_obj$LongPercent$Zeroth_pos)
[1] 5915114
length(dmr_obj$LongPercent$Gene)
[1] 5915114

can you tell me how should I resolve this and what might be the issue.I have added most of the information needed to resolve this issue hope you reply soon. Also I have used Dorado with 5mC model

PraddyumnaR · 2024-07-15T03:51:28Z

can anyone help me solve this issue

decibel-tom · 2024-07-15T16:16:14Z

Can you share either your R environment or attach your dmr_object here?

decibel-tom · 2024-07-18T20:50:09Z

I think it's going to be difficult for me to import this representation of your dmr_obj into my code to try to reproduce the error. Can you save your r environment and attach it here as a .Rdata file?

PraddyumnaR · 2024-07-20T03:54:31Z

sorry that is confidential can you suggest me a solution by the above data or if you need any other data to get to solution I can provide you. What I can tell you is I followed the example data for demo run for which I got the results without facing any problems and I have edited my input file in same way as the example file were but I am facing this problem now.

For experimental data

sapply(dmr_obj, length)
ZoomFrame_filtered LongPercent LonUnMeth
13 9 8
LongMeth experimental_design_df Inputpers
9 6 3

For example data in package

sapply(dmr_obj, length)
ZoomFrame_filtered LongPercent LonUnMeth
22 9 8
LongMeth experimental_design_df Inputpers
9 6 6

could this be making the problem ?

decibel-tom · 2024-07-22T14:49:46Z

I appreciate the additional information! So I'm curious what your fix was to the very first issue you raised? Our pipeline was set up for an older version of deepsignal and I'm curious if the output file is drastically different from what the code expects. Our latest improvements have been geared toward the use of Dorado since we have found that works better for us. It does appear like in your input data you have multiple rows for each position? That is the most likely culprit that I can think of with what's been provided since the older version of deepsignal / dorado output produces one row per position in the genome

PraddyumnaR · 2024-07-23T11:36:12Z

I am attaching the R environment below as a workspace. I have removed multiple rows for each position from the input bed file but I am getting the same error. Please look through it and suggest how to solve the problem. new_workspace.zip <https://drive.google.com/file/d/1ZcB5i16qlHO9xCCQTYnh1Vz3TJdHsvXN/view?usp=drive_web>

…

On Mon, 22 Jul 2024 at 20:20, Tom Cairns ***@***.***> wrote: I appreciate the additional information! So I'm curious what your fix was to the very first issue you raised? Our pipeline was set up for an older version of deepsignal and I'm curious if the output file is drastically different from what the code expects. Our latest improvements have been geared toward the use of Dorado <https://github.com/nanoporetech/dorado> since we have found that works better for us. It does appear like in your input data you have multiple rows for each position? — Reply to this email directly, view it on GitHub <#65 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/BEEDUDMYYJZ37I35RNFJUQDZNUL2BAVCNFSM6AAAAABKSITWZ2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDENBTGE2TENRQGU> . You are receiving this because you authored the thread.Message ID: ***@***.***>

PraddyumnaR · 2024-07-24T03:47:48Z

As for the earlier issue I raised about methylframe generation it was because I got the output from deepsignal3 had that format which I later adjusted as per the example files you provided after doing that the issue was resolved.

jcolicchio-soundag · 2024-07-26T00:40:25Z

The issue is indeed because there are multiple rows for the same position. This leads to "LongMeth" and "LongPercent" not being the appropriate lengths since they get created by using a grouping function. If you remove duplicate positions from your methylframe it should work.

Best,
Jack

PraddyumnaR · 2024-07-26T06:49:53Z

I have removed multiple rows for same position but it is still not working I am doing this trial run for two samples to see if it works then do it for rest of the files so I made subset of file for chromosome 2 & 3 and tried to run them through and facing the error now I am attaching two bed files for you to run through which have entries from one chromosome of my trial file try to find the error if you can and please help me sort it out. Thank you. Bed_files.zip <https://drive.google.com/file/d/1wV8WAnQntVWKV_kfV6Ltpgl7SyzkGyla/view?usp=drive_web>

…

On Fri, 26 Jul 2024 at 06:10, jcolicchio-soundag ***@***.***> wrote: The issue is indeed because there are multiple rows for the same position. This leads to "LongMeth" and "LongPercent" not being the appropriate lengths since they get created by using a grouping function. If you remove duplicate positions from your methylframe it should work. Best, Jack — Reply to this email directly, view it on GitHub <#65 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/BEEDUDIQMKGXELMJUNSVUEDZOGLI5AVCNFSM6AAAAABKSITWZ2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDENJRG4YTEMJSGU> . You are receiving this because you authored the thread.Message ID: ***@***.***>

jcolicchio-soundag · 2024-07-26T14:23:39Z

Just confirmed, there are still rows with the same chromosome and positions:

nrow(ZoomFrame_filtered)
[1] 3076303
distinct(ZoomFrame_filtered, Chromosome, Position, .keep_all = FALSE)->Zoom_RemoveDuplicates
nrow(Zoom_RemoveDuplicates)
[1] 2887333

If you run this distinct command first to remove duplicates (on the megaframe) your commands should work.

PraddyumnaR · 2024-07-27T04:47:36Z

Can you write the whole script which you used to do this because I am still getting errors. And if you were able to do complete analysis please provide the complete script on my email ID praddyumnapdm@bicpu.edu.in

PraddyumnaR · 2024-08-01T07:05:14Z

Hey Jack, can you share with me the R commands you used for running those bed files I gave you and how you removed duplicates from it. you can write those here or can mail me on above given mail ID, Thankyou.

PraddyumnaR · 2024-08-02T10:25:39Z

After removing multiple entries for chromosome and positions this is what I am getting can you help me solve this

methyl_summary <- create_methyl_summary(dmr_obj,
control = 'C',
treated = 'T',
additional_summary_cols = list(
c('sd', 'Group')
))
[1] "Number of columns in methyl_summary is correct"
methyl_summary <- find_DMR(methyl_summary, dmr_obj, fixed = c('Group'),
random = c('Individual'), reads_threshold = 5,
control = 'C', model = 'beta-binomial',
analysis_type = 'individual')
cbind(Meth, UnMeth) ~ (1 | Individual) + Group
<environment: 0x55d7b284a510>
|======================================================================| 100%
There were 50 or more warnings (use warnings() to see the first 50)
warnings()
Warning messages:
1: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) :
Model convergence problem; . See vignette('troubleshooting'), help('diagnose')
2: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) :
Model convergence problem; . See vignette('troubleshooting'), help('diagnose')
3: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) :
Model convergence problem; . See vignette('troubleshooting'), help('diagnose')
4: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) :
Model convergence problem; . See vignette('troubleshooting'), help('diagnose')
5: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) :
Model convergence problem; . See vignette('troubleshooting'), help('diagnose')

changepoint_cols = find_changepoint_col_options(methyl_summary)
character(0)
methyl_summary <- find_DMR(methyl_summary, dmr_obj, fixed = c('Group'),
random = c('Individual'), reads_threshold = 3,
control = 'C', model = 'binomial',
analysis_type = 'group')
cbind(Meth, UnMeth) ~ (1 | Individual) + Group
<environment: 0x55d745b5b8d8>
boundary (singular) fit: see help('isSingular')
[1] "37 No bobyqa Converge, trying Nelder"
[1] "37 No Converge"
[1] "78 No bobyqa Converge, trying Nelder"
[1] "78 No Converge"
[1] "80 No bobyqa Converge, trying Nelder"
[1] "80 No Converge"
[1] "95 No bobyqa Converge, trying Nelder"
[1] "95 No Converge"
[1] "172 No bobyqa Converge, trying Nelder"
[1] "172 No Converge"
[1] "174 No bobyqa Converge, trying Nelder"
[1] "174 No Converge"

jcolicchio-soundag · 2024-08-02T15:02:51Z

Nice work! That means it is working, but the nice model cannot converge, which is expected if there is only one individual per treatment group. You made it as far as is possible with just a single sample in each group. If you have two or more individuals per group this next step would work. Jacm

On Fri, Aug 2, 2024 at 3:26 AM PraddyumnaR ***@***.***> wrote: After removing multiple entries for chromosome and positions this is what I am getting can you help me solve this methyl_summary <- create_methyl_summary(dmr_obj, - control = 'C', - treated = 'T', - additional_summary_cols = list( - c('sd', 'Group') - )) [1] "Number of columns in methyl_summary is correct" methyl_summary <- find_DMR(methyl_summary, dmr_obj, fixed = c('Group'), - random = c('Individual'), reads_threshold = 5, - control = 'C', model = 'beta-binomial', - analysis_type = 'individual') cbind(Meth, UnMeth) ~ (1 | Individual) + Group <environment: 0x55d7b284a510> |======================================================================| 100% There were 50 or more warnings (use warnings() to see the first 50) warnings() Warning messages: 1: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) : Model convergence problem; . See vignette('troubleshooting'), help('diagnose') 2: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) : Model convergence problem; . See vignette('troubleshooting'), help('diagnose') 3: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) : Model convergence problem; . See vignette('troubleshooting'), help('diagnose') 4: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) : Model convergence problem; . See vignette('troubleshooting'), help('diagnose') 5: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) : Model convergence problem; . See vignette('troubleshooting'), help('diagnose') changepoint_cols = find_changepoint_col_options(methyl_summary) character(0) methyl_summary <- find_DMR(methyl_summary, dmr_obj, fixed = c('Group'), - random = c('Individual'), reads_threshold = 3, - control = 'C', model = 'binomial', - analysis_type = 'group') cbind(Meth, UnMeth) ~ (1 | Individual) + Group <environment: 0x55d745b5b8d8> boundary (singular) fit: see help('isSingular') [1] "37 No bobyqa Converge, trying Nelder" [1] "37 No Converge" [1] "78 No bobyqa Converge, trying Nelder" [1] "78 No Converge" [1] "80 No bobyqa Converge, trying Nelder" [1] "80 No Converge" [1] "95 No bobyqa Converge, trying Nelder" [1] "95 No Converge" [1] "172 No bobyqa Converge, trying Nelder" [1] "172 No Converge" [1] "174 No bobyqa Converge, trying Nelder" [1] "174 No Converge" — Reply to this email directly, view it on GitHub <#65 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ATXTYGJICQIKYOYXLHSJ7LDZPNNDRAVCNFSM6AAAAABKSITWZ2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDENRVGA3DINBUHE> . You are receiving this because you commented.Message ID: ***@***.***>

-- *Jack Colicchio* Research Scientist Sound Agriculture 908 451 4985 | ***@***.*** sound.ag | Twitter <https://twitter.com/sound_ag>| LinkedIn <https://www.linkedin.com/company/sound-agriculture/>

PraddyumnaR · 2024-08-09T07:20:22Z

Thankyou for the insight Jack I am in process of generating rest of my files hope it works without any problem since methyl summary generation was successful.

PraddyumnaR · 2024-08-16T09:54:17Z

Now that I have 3 samples in each Control and treated group still I am getting this error what could be the reason. Methyl summary was generated successfully but now this problem is there.

methyl_summary <- create_methyl_summary(dmr_obj,

                                    control = 'C',

                                    treated = 'T',

                                    additional_summary_cols = list(

                                      c('sd', 'Group')

))

[1] "Number of columns in methyl_summary is correct"

methyl_summary <- find_DMR(methyl_summary, dmr_obj, fixed = c('Group'),

                       random = c('Individual'), reads_threshold = 3,

                       control = 'C', model = 'binomial',

                       analysis_type = 'group')

cbind(Meth, UnMeth) ~ (1 | Individual) + Group
<environment: 0x560204c00df8>
boundary (singular) fit: see help('isSingular')
boundary (singular) fit: see help('isSingular')
boundary (singular) fit: see help('isSingular')
boundary (singular) fit: see help('isSingular')

PraddyumnaR · 2024-08-17T04:17:59Z

I am facing problem in doing group or individual DMR analysis that is why I am attaching link for my bed files below please check them once and help me resolve the issue. https://drive.google.com/file/d/1G6iGbVv-GngvTxoMOFEHi43mWOL2Hqqw/view?usp=drive_link , https://drive.google.com/file/d/19uAc-zTp2oBot7c0Zibc8uaXY5wsFGg1/view?usp=drive_link , https://drive.google.com/file/d/1LNwmVZwFnsa6aSHWs0c4Ii-mRBBILZhh/view?usp=drive_link , https://drive.google.com/file/d/1RM2xPELraePGzvhSLTAhs7cgokxoh961/view?usp=drive_link On Fri, 2 Aug 2024 at 20:33, jcolicchio-soundag ***@***.***> wrote:

…

Nice work! That means it is working, but the nice model cannot converge, which is expected if there is only one individual per treatment group. You made it as far as is possible with just a single sample in each group. If you have two or more individuals per group this next step would work. Jacm On Fri, Aug 2, 2024 at 3:26 AM PraddyumnaR ***@***.***> wrote: > After removing multiple entries for chromosome and positions this is what > I am getting can you help me solve this > > methyl_summary <- create_methyl_summary(dmr_obj, > > > - > > control = 'C', > > - > > treated = 'T', > > - > > additional_summary_cols = list( > > - > > c('sd', 'Group') > > - > > )) > > > [1] "Number of columns in methyl_summary is correct" > > methyl_summary <- find_DMR(methyl_summary, dmr_obj, fixed = c('Group'), > > > - > > random = c('Individual'), reads_threshold = 5, > > - > > control = 'C', model = 'beta-binomial', > > - > > analysis_type = 'individual') > > > cbind(Meth, UnMeth) ~ (1 | Individual) + Group > <environment: 0x55d7b284a510> > |======================================================================| > 100% > There were 50 or more warnings (use warnings() to see the first 50) > > warnings() > Warning messages: > 1: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) : > Model convergence problem; . See vignette('troubleshooting'), > help('diagnose') > 2: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) : > Model convergence problem; . See vignette('troubleshooting'), > help('diagnose') > 3: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) : > Model convergence problem; . See vignette('troubleshooting'), > help('diagnose') > 4: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) : > Model convergence problem; . See vignette('troubleshooting'), > help('diagnose') > 5: In finalizeTMB(TMBStruc, obj, fit, h, data.tmb.old) : > Model convergence problem; . See vignette('troubleshooting'), > help('diagnose') > > changepoint_cols = find_changepoint_col_options(methyl_summary) > character(0) > methyl_summary <- find_DMR(methyl_summary, dmr_obj, fixed = c('Group'), > > > - > > random = c('Individual'), reads_threshold = 3, > > - > > control = 'C', model = 'binomial', > > - > > analysis_type = 'group') > > > cbind(Meth, UnMeth) ~ (1 | Individual) + Group > <environment: 0x55d745b5b8d8> > boundary (singular) fit: see help('isSingular') > [1] "37 No bobyqa Converge, trying Nelder" > [1] "37 No Converge" > [1] "78 No bobyqa Converge, trying Nelder" > [1] "78 No Converge" > [1] "80 No bobyqa Converge, trying Nelder" > [1] "80 No Converge" > [1] "95 No bobyqa Converge, trying Nelder" > [1] "95 No Converge" > [1] "172 No bobyqa Converge, trying Nelder" > [1] "172 No Converge" > [1] "174 No bobyqa Converge, trying Nelder" > [1] "174 No Converge" > > — > Reply to this email directly, view it on GitHub > <#65 (comment)>, > or unsubscribe > < https://github.com/notifications/unsubscribe-auth/ATXTYGJICQIKYOYXLHSJ7LDZPNNDRAVCNFSM6AAAAABKSITWZ2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDENRVGA3DINBUHE> > . > You are receiving this because you commented.Message ID: > ***@***.***> > -- *Jack Colicchio* Research Scientist Sound Agriculture 908 451 4985 | ***@***.*** sound.ag | Twitter <https://twitter.com/sound_ag>| LinkedIn <https://www.linkedin.com/company/sound-agriculture/> — Reply to this email directly, view it on GitHub <#65 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/BEEDUDN5SVZBPBXVTXUG7MTZPONTFAVCNFSM6AAAAABKSITWZ2VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDENRVGYYDAOJVGQ> . You are receiving this because you authored the thread.Message ID: ***@***.***>

decibel-tom · 2024-08-20T21:02:15Z

What is the error you're facing here? The output looks correct to me

PraddyumnaR · 2024-08-21T05:06:47Z

boundary (singular) fit: see help('isSingular')
boundary (singular) fit: see help('isSingular')
boundary (singular) fit: see help('isSingular')
boundary (singular) fit: see help('isSingular') is this supposed to happen while generating the output because it is taking too long for the output to generate even when I have given bed file of 2 chromosomes ?

PraddyumnaR · 2024-08-22T03:52:56Z

It is running for more than a day now is there any way that we can speed up this process ? I have provided you with the bed files if you can try running it on your end and figure out something it would be great.

PraddyumnaR · 2024-08-22T12:34:45Z

I am running two analysis from yesterday one on the bed file with entries on chromosome 1 & 2 and another on only chromosome 1 now the line count in the megaframe generated for those two sets are 3649274 for chr 1&2 and 1936024 for Chr1 how much time will it take I have to do the analysis for 12 chromosomes so with this speed it will take 24 days to complete the analysis. Is it possible to speed up the process of this Group DMR analysis. Hope you reply soon.

decibel-tom · 2024-08-23T16:16:15Z

For larger analysis we definitely recommend splitting the data into smaller chunks and running on that. For example with whole genome analysis we recommend splitting the data by chromosome or more across multiple machines to run in parallel. Unfortunately we don't have immediate plans to speed up the code on the back end as that would require a major overhaul of the code, but it is something we hope to get to in the next year

PraddyumnaR · 2024-08-24T04:21:26Z

Okay if that's the case then i think I will have to wait for it be done chromosome by chromosome. Also I completed analysis on one chromosome file but while doing that I faced yet another issue with following command:

changepoint_cols = find_changepoint_col_options(methyl_summary)
[1] "Z_GroupT_small"
target_genes <- unique(dmr_obj$ZoomFrame_filtered$Gene)
methyl_summary <- changepoint_analysis(methyl_summary,
CG_penalty = 5,
CHG_penalty = 7,
CHH_penalty = 6,
target_genes = c(),
save_plots = TRUE,
z_col = "Z_GroupT_small",
whole_genome = TRUE)
Running whole genome changepoint analysis on chromosomes: Chr1
|======================================================================| 100%
Warning message:
The x argument of as_tibble.matrix() must have unique column names if .name_repair is omitted as of tibble 2.0.0.
ℹ Using compatibility .name_repair.
ℹ The deprecated feature was likely used in the sounDMR package.
Please report the issue to the authors.
This warning is displayed once every 8 hours.
Call lifecycle::last_lifecycle_warnings() to see where this warning was generated.

So here what happened is it did changepoint analysis successfully but the volcano and line plots it was supossed to save were not saved also I tried replacing TRUE with T, the volcano plots were generated on R Graphics device but were not saved to the directory. whereas when a month before I ran analysis on demo data it worked perfectly fine. what can be the issue here?

decibel-tom · 2024-08-28T19:49:04Z

Do you have any target genes that you're working with? The issue with the plots is that we found the line plots took too much RAM to produce for the entire genome, crashing a lot of our machines. So any data with > 100,000 rows the plots will not be displayed. Are you running this on the data split by chromosome? If that's the case I would recommend setting whole_genome = FALSE (even if your analysis is for whole genome) as the data will then be split gene by gene with a new plot being created and saved for each gene. If you do want to save the volcano plot for the whole genome I would recommend saving the plots on the R Graphics device. This is something that can be changed in a future version of sounDMR, but I don't have a good timeline on when that will be.

PraddyumnaR · 2024-08-29T04:13:57Z

I was not using the whole genome file instead I ran the analysis on just Chr1 data also i tried it with Whole_genome = FALSE but still it didn't save the plots line plots were not even visible on R Graphics Device only volcano plot was generated. Apart from this I
had a query regarding the output of analysis where two files are generated methyl_summary with headers:
"Chromosome" "Gene" "Position" "Strand" "CX" "Zeroth_pos" "S4_Z" "S4_MethChange" "S4_RD" "S5_Z" "S5_MethChange" "S5_RD" "S6_Z" "S6_MethChange" "S6_RD" "S1" "S2" "S3" "S4" "S5" "S6" "Treat_V_Control" "Control" "Treated" "C_sd" "T_sd" "Z_GroupT_small" "Estimate_GroupT_small" "StdErr_GroupT_small" "MethRegion_Z_GroupT_small" "MethRegionLength_Z_GroupT_small" "MethCytosines_Z_GroupT_small" "MethGroup_Z_GroupT_small"
here S4, S5 & S6 was my treatment group.

and DMR_region_summary with headers:
"cp_group","Start","Stop","Gene","CX","Count","MethRegion_Z_GroupT_small","MethRegionLength_Z_GroupT_small","Treat_V_Control","Control","Estimate_GroupT_small","dmr_score","dmr_score2","dmr_score_Percentile","dmr_score2_Percentile"

Can I know the interpretation of these headers, it would be very helpful if I get the context of what these column entries signify.
Also I would like to Thankyou for the support throughout the issues.

PraddyumnaR · 2024-08-29T12:42:47Z

Also can you tell me should we consider sample wise methylchange in our case S4_MethChange", S5_MethChange" and S6_MethChange" or should we consider Treat_V_Control as we are performing group wise analysis which one is significant.

jcolicchio-soundag · 2024-08-30T17:12:46Z

Sample wise shifts in methylation are the "MethChange" columns, group wise shifts in methylation are Treat_V_Control.

As far as the other columns, they refer to dmr regions, or groups of regions with similar shifts in methylation. That file refers to these different regions and includes summary stats for each one. the "dmr_score" column refer to a scoring metric described in our publication, while the other columns reflect the control methylation, shifts in methylation, T statistic, size of the region, and its start and stop. I will look to add better documentation for this as soon as i can.

PraddyumnaR · 2024-09-03T08:54:05Z

Thank you for the reply. Hope you put out a detailed description of output in future which will be helpful for interpretation of users. also if you can update the code such that the experimental design starter is generated in the way it is supposed to be by taking input from user in R-environment itself.

PraddyumnaR changed the title ~~methyl_frame generation problem~~ methyl_summary generation problem Jul 11, 2024

PraddyumnaR closed this as completed Aug 9, 2024

PraddyumnaR reopened this Aug 16, 2024

PraddyumnaR closed this as completed Sep 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

methyl_summary generation problem #65

methyl_summary generation problem #65

PraddyumnaR commented Jul 9, 2024

decibel-tom commented Jul 9, 2024

PraddyumnaR commented Jul 11, 2024 •

edited

Loading

PraddyumnaR commented Jul 15, 2024

decibel-tom commented Jul 15, 2024

decibel-tom commented Jul 18, 2024

PraddyumnaR commented Jul 20, 2024 •

edited

Loading

decibel-tom commented Jul 22, 2024 •

edited

Loading

PraddyumnaR commented Jul 23, 2024 via email

PraddyumnaR commented Jul 24, 2024

jcolicchio-soundag commented Jul 26, 2024

PraddyumnaR commented Jul 26, 2024 via email

jcolicchio-soundag commented Jul 26, 2024

PraddyumnaR commented Jul 27, 2024

PraddyumnaR commented Aug 1, 2024

PraddyumnaR commented Aug 2, 2024 •

edited

Loading

jcolicchio-soundag commented Aug 2, 2024 via email

PraddyumnaR commented Aug 9, 2024

PraddyumnaR commented Aug 16, 2024

PraddyumnaR commented Aug 17, 2024 via email

decibel-tom commented Aug 20, 2024

PraddyumnaR commented Aug 21, 2024 •

edited

Loading

PraddyumnaR commented Aug 22, 2024

PraddyumnaR commented Aug 22, 2024 •

edited

Loading

decibel-tom commented Aug 23, 2024

PraddyumnaR commented Aug 24, 2024 •

edited

Loading

decibel-tom commented Aug 28, 2024

PraddyumnaR commented Aug 29, 2024 •

edited

Loading

PraddyumnaR commented Aug 29, 2024

jcolicchio-soundag commented Aug 30, 2024

PraddyumnaR commented Sep 3, 2024

methyl_summary generation problem #65

methyl_summary generation problem #65

Comments

PraddyumnaR commented Jul 9, 2024

decibel-tom commented Jul 9, 2024

PraddyumnaR commented Jul 11, 2024 • edited Loading

PraddyumnaR commented Jul 15, 2024

decibel-tom commented Jul 15, 2024

decibel-tom commented Jul 18, 2024

PraddyumnaR commented Jul 20, 2024 • edited Loading

decibel-tom commented Jul 22, 2024 • edited Loading

PraddyumnaR commented Jul 23, 2024 via email

PraddyumnaR commented Jul 24, 2024

jcolicchio-soundag commented Jul 26, 2024

PraddyumnaR commented Jul 26, 2024 via email

jcolicchio-soundag commented Jul 26, 2024

PraddyumnaR commented Jul 27, 2024

PraddyumnaR commented Aug 1, 2024

PraddyumnaR commented Aug 2, 2024 • edited Loading

jcolicchio-soundag commented Aug 2, 2024 via email

PraddyumnaR commented Aug 9, 2024

PraddyumnaR commented Aug 16, 2024

PraddyumnaR commented Aug 17, 2024 via email

decibel-tom commented Aug 20, 2024

PraddyumnaR commented Aug 21, 2024 • edited Loading

PraddyumnaR commented Aug 22, 2024

PraddyumnaR commented Aug 22, 2024 • edited Loading

decibel-tom commented Aug 23, 2024

PraddyumnaR commented Aug 24, 2024 • edited Loading

decibel-tom commented Aug 28, 2024

PraddyumnaR commented Aug 29, 2024 • edited Loading

PraddyumnaR commented Aug 29, 2024

jcolicchio-soundag commented Aug 30, 2024

PraddyumnaR commented Sep 3, 2024

PraddyumnaR commented Jul 11, 2024 •

edited

Loading

PraddyumnaR commented Jul 20, 2024 •

edited

Loading

decibel-tom commented Jul 22, 2024 •

edited

Loading

PraddyumnaR commented Aug 2, 2024 •

edited

Loading

PraddyumnaR commented Aug 21, 2024 •

edited

Loading

PraddyumnaR commented Aug 22, 2024 •

edited

Loading

PraddyumnaR commented Aug 24, 2024 •

edited

Loading

PraddyumnaR commented Aug 29, 2024 •

edited

Loading