Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Out of 70041 loci 70041 were discarded with biases not in range [0.5 2] #56

Open
nisar-BIGPM opened this issue Jul 15, 2022 · 3 comments

Comments

@nisar-BIGPM
Copy link

Hi,
Thank you for such a nice tool for Hi-C data processing!
I am processing DNase-HiC data. I want to apply Fithic2 to my data; I have prepared the input files from HiC-Pro.

For example:
-i
Screen Shot 2022-07-15 at 4 25 27 PM
-f
Screen Shot 2022-07-15 at 4 27 43 PM
-t
Screen Shot 2022-07-15 at 4 28 24 PM

I proceed to fithic with these input files; however, I encounter error #42. After, reading all comments about #42 and #39, I have removed the biases using: awk '{if ($3 > "0.5" && $3 < "2"){ print}}' and from fragments file I've removed all the validpairs with 0 using: awk '{if ($4 > "0" && $5 > "0"){print }}'.

After, that now I have
-t
Screen Shot 2022-07-15 at 4 38 14 PM
-f
Screen Shot 2022-07-15 at 4 37 20 PM

After this trimming these files, I used this command to run Fithic:

  • fithic -i fithic.interactionCounts.gz -f fragmentMappability.gz -t biases.gz -r 10000 -o ./fithic-out3/

After Fithic2 ran successfully on my files, when I checked the log file, I noticed that " Out of 70041 loci 70041 were discarded with biases not in range [0.5 2]".

Here, is the log file:
Screen Shot 2022-07-15 at 4 54 35 PM

And my significance file has all the values are equal to zero and this is the significance file:
Screen Shot 2022-07-15 at 4 56 39 PM

I am wondering how to resolve this issue. I don't know where I am making mistakes. I really appreciate it if you have a look into this and help me to fix this problem.

Thank you!
Best,
Nisar

@ay-lab
Copy link
Owner

ay-lab commented Jul 15, 2022 via email

@nisar-BIGPM
Copy link
Author

Hi,
Thank you for your reply.
I ran Fithic2 without biases file, and it gives me the results:
Screen Shot 2022-07-16 at 2 01 21 PM

However, I want to run it with a biases file. Keeping your comment in mind, I checked the biases file and also created new biases file using HiCKRy.py with HiC-Pro interactions and fragments.

Screen Shot 2022-07-16 at 2 31 50 PM

I also filtered the biases with awk '{if ($3 > 0){ print}}'| awk '{if ($3 < 2){ print}}' and fragments with awk '{if ($5 > 0){print }}'

Screen Shot 2022-07-16 at 2 54 15 PM

I used these files to run fithic2. However, it gives me the same results and all the loci were again discarded.

Could you please tell me how to fix the biases file? I appreciate any suggestions regarding this issue.

I am looking forward!

Many thanks for considering my request.

Best,
Nisar

@ay-lab
Copy link
Owner

ay-lab commented Aug 1, 2022

Seems like there is an issue with your normalization. Maybe about its convergence or maybe about how your reads are distributed. Could you plot coverage tracks per each bin across the genome and/or look at a histogram of coverage (reads per bin)? Bias vector should be normally distributed around 1 (with a mean 1) for FitHiC to work reasonably.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants