-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
help with an erro please #8
Comments
Hi, thanks for point out this error. It seems like you are running into a problem that happens with the automated binding sites width estimation. This is done through the Without having seen some of your data as an example I can't follow it up in more detail. For the moment I suggest, that you try to manually estimate the correct binding site size. Please have a look at the chapters 5.2 and 5.3 in the vignette on how to do this. Hope this helps! |
Hi, it seems like I also encountered the error right here. As the default vignette showed subset.chromosome to be limited, I have created a list to incorporate about ten more chromosomes, but still encountered the error. I tried to manually estimate the bsSize following the manual, but it cannot detect binding site signals: Current definition does not result in binding sites on all chromosomes where signal was present. Any suggestion? Thank you. |
Hi, |
I made a new issue for this as it sounds like a different problem for me. |
So you used your own data set and you find crosslinks on all of these chromosomes right? Yes, I have completed in RBFOX2 eCLIP-seq PureCLIP crosslinking site definitions globally. **Did you get pureclip peaks on all of these chromosomes? Yes, when I manually checked the crosslinking sites outputted by PureCLIP peaks, all peaks are present in the chromosomes. I am thinking on possibly the bigwig file that is having the problem, if the signal is not well captured. I will get back on the IGV screenshots in chromosome 8 which shows high enrichment score according to PureCLIP. Any recommendations on making a bigwig file? I was using deeptools bamcoverage function. |
I think one problem are indeed the bigwig files. BindingSiteFinder is build to work with single nucleotide crosslinks in bigwig format. For this you cannot turn the complete bam file into a bigwig but you need to select the nucleotide position 1nt upstream of the 5'end of the read in the bam file. To do that I usually turn the bam first into bed, select the upstream nucleotide and then turn it to bigwig after. (This approach comes from the iCLIP processing workflow, published by Busch et al 2020: https://www.sciencedirect.com/science/article/pii/S1046202318304948) Here is a possible bash code to do this:
It uses bedtools and bedGraphToBigWig from the kentUtils suite. Both can be installed with conda. |
The code above comes from our clip processing tool racoon_clip, which retrieves the single nucleotide crosslinks from the raw fastq files. There racoon_clip also does the alignment to obtain the bam files a bit differently from encode, because it does not allow soft-clipping at the 5' end of the read and by default does not allow multimappers. Both make the final crosslink positions a bit cleaner. So maybe you could consider running racoon_clip on the encode fastq files instead (https://racoon-clip.readthedocs.io/en/latest/). But it should also work with using the code above. |
If you rather want to do the shifting in R instead of bash you could also:
|
Awesome! Thanks for the suggestion. I will try the above options and get back to you as soon as possible. By the way, thank you for developing an awesome software! I actually had a very hard time to filter the robust crosslinking peaks in PureCLIP. |
Dear Melina, I have converted the bigwig file as what you described, and it works without any errors. Apparently it was caused by my lack of knowledge on CLIP-seq bigwig file. Hope this forum can help on the others who are having the same troubles. Thank you. |
Hello, I was trying to use your BindingSiteFinder and got an error:
Error in if (all(!df.global$increaseOverMin)) { :
missing value where TRUE/FALSE needed
I was using BSFind(object = KObds, anno.genes = gns, anno.transcriptRegionList = regions,
and it also gave following message:
2.16% (17/788) peaks overlap with multiple anno.genes in the given gene annotation.
A single instance of each peak is kept. This is recommended.
Could you please tell me waht could be the problem? thanks
The text was updated successfully, but these errors were encountered: