-
Notifications
You must be signed in to change notification settings - Fork 268
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about calling differential binding events #156
Comments
The command I used for calling differential binding events: macs2 callpeak -B -t D567_RNApolII_Control.mapped.rgid.sorted.filtered.dups_remove.bam -c D567_Input_Control.mapped.rgid.sorted.filtered.dups_remove.bam -n RNApolII_CT --nomodel --gsize 2.7e9 --tsize 67 --extsize 182 --keep-dup all --qvalue 0.05 macs2 callpeak -B -t D567_RNApolII_RT.mapped.rgid.sorted.filtered.dups_remove.bam -c D567_Input_RT.mapped.rgid.sorted.filtered.dups_remove.bam -n RNApolII_RT --nomodel --gsize 2.7e9 --tsize 67 --extsize 182 --keep-dup all --qvalue 0.05 macs2 bdgdiff --t1 RNApolII_CT_treat_pileup.bdg --c1 RNApolII_CT_control_lambda.bdg --t2 RNApolII_RT_treat_pileup.bdg --c2 RNApolII_RT_control_lambda.bdg --d1 15849561 --d2 16226600 -g 60 -l 182 --o-prefix diff_c1_vs_c2 |
Hi @toxin08x, the values reported in bdgdiff are log10 likelihood ratios indicating that a condition has higher signal than the other. Higher values in cond1.bed indicate higher signals in cond1 compared with cond2; and higher values in cond2.bed indicate higher signals in cond2 compared with cond1. They are all in log10 form. In your case, values in cond1 and cond2 should be larger than 3 (or likelihood ratio 1000), and values in common file should be smaller than 3. But log10 likelihood ratios we reported in common.bed are always non-negative -- they are the maximum of log10LR(cond1 vs cond2) and log10LR(cond2 vs cond1). |
I have a similar kind of question, the values in the last columns of all the three bed files are negative , something like this: is it usual to have such a high negative value? what does this states? |
@Kminie This must be an overflow issue... Could you confirm that MACS2 was compiled as a 64bit software in your computer (also, python and Numpy)? |
yes it was compiled dat way |
Please,
Best, Mozart |
Dear TaoLiu, I'm experiencing the same issue in one of my samples. All my ChIPs peak calling works well identifying differential peaks between the conditions. But in one of them I'm getting negative values:
I'm using macs2 --version macs2 2.1.1.20160309 Any help will be really appreciated, Igor |
As taoliu said above, it is indeed an overflow issue most probably by the incorrect-compiled-dependenies. I used macs2.1.0 and it works perfectly one year before. But recently I met the same issue. I was troubleshooting several days until I noticed that I have unmatched numpy and python from miniconda env installed several months before (macs2 was installed before by root without miniconda env). Now I fixed this issue by creating a new conda env and installing macs2 there. |
Hi there,
Macs2 is a great software and I used macs2 bdgdiff to call differential binding events of RNA polymerase ii following this link: https://github.com/taoliu/MACS/wiki/Call-differential-binding-events
I have two questions.I got three output files including diff_c1_vs_c2_c3.0_common.bed, diff_c1_vs_c2_c3.0_cond1.bed and diff_c1_vs_c2_c3.0_cond2.bed. But all the values in the last column of diff_c1_vs_c2_c3.0_common.bed are more than 0. As that link mentioned, the values are in log10 likelihood ratios, so some values should be negative value unless all the binding regions in this file enriched in condition 1 comparing to condition 2. In my observation, most binding regions are enriched in condition2 and most values are less than 1. In this case, I guess these values might be not in log10 likelihood ratios. Are these values really in log10?
After I get these values, I would like to do pathway analysis. I will annotate these regions by HOMER to obtain gene names. Then if these values are not in log10, is it okay to use log10(1/cond1_value), log10(1/common_value) and log10(cond2_value) with their gene names to do the pathway analysis?
Thanks,
Yunpeng
The text was updated successfully, but these errors were encountered: