-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
input only 2~3 isomiRs / missing mismatch for 3'trim variation #17
Comments
I have noticed something similar to the first issue described by @kicheolkim. I am using isoMirs version 1.16.2 and output from miraligner version 3.5, and observe that the counts I get from an IsomirDataSeq object created using IsomirDataSeqFromFiles differs from the counts I calculate using base R code. I also observe that the results differ using IsomiRs functions when I load different files at a time. I have attached some test files and some test code. Is there perhaps some filtering happening, or lines dropped during merging? Thanks for making this package! Happy to provide more information if it would help.
|
After looking at the code I bit, I see that there is some internal filtering happening (e.g. clean_noise and remove_gt_n_changes applied to raw data), which explains the discrepancy. Would be helpful to have these filters noted in the documentation, and perhaps have an option to output the raw data without any filtering. But it is also useful to have the filters! |
Hi, I totally missed this, it was when somehow I got turned off notification, so I apologize. It was not intentional. I need to look a the code, it is true is due to some filtering: see this for more explanation #19 (comment), the parameters that affect the filtering are those. The mirtop issue is more concerning. Did you use mirtop to convert from other tool? if you could share the command you use, I can do more, or the inputs. Sorry I missed this, happy to help if still needed. cheers |
Hi, I'm glad you come back. I'm using sRNAtoolbox. So, I converted mirtop from sRNAtoolbox output. Here are the command lines that I used.
|
Thank you. I am aware that data coming from that tool can generate some issues. Can you check those sequences in the output of sRNAbench and see how they are defined? Normally Mir top will take the definition from the tool, so I am wondering how are defined from the very beginning. Let me know, I am very intrigued. You can share the output file as well from sRNAbench if you would like. |
Sure, here is the information of 3 isomiRs. You can also find sRNAbench (sRNAtoolbox) output from here. Hopefully, this would be helpful to solve the problem. sequence matureName hairpinName isoLabel sequenceVariant readCount RPMlib RPMtotal Thank you!! |
Hi, thank you for this. Actually, I can see how these three sequences are annotated only as 3p variants with -4 as the nucleotide variation ( Let me know your thoughts. Thanks! |
Hi,
I found 2 possible error/bug while using the package. I think first one is isomir package issue, but the second one may be related to miRTop.
input only 2~3 isomiRs
Data I tested for input is here, and I used this for input dataset:
I saw many warnings during input, is it possible that is related to this warning?
missing mismatch for 3'trim variation
Here is an example... Those sequences have different mismatches and 3' trimming together, but mism is 0 in the table.
I used the development version for both miRTop and isomiR package which I installed last week.
Thanks!!
Kicheol
The text was updated successfully, but these errors were encountered: