-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Understanding the concept behind the metrics used for FRASER calculations #9
Comments
Hi @jjyotikataria, did you already have a look at the explanations of these metrics in our vignette and in our preprint on bioRxiv? |
Yes, @c-mertes @ischeller . I went through the vignette and the paper. But I am not so much confident about what the splicing metrics sigh5, sigh3, theta3, theta5 are basically denoting. Also, can you give me a basic explanation regarding the p values and z scores. |
@c-mertes @ischeller I went through almost all the papers and references. But I am not able to be understanding it completely. Can you please tell me what the p value, z score basically represents in context of the genes of the samples we are working upon. Also, what does psi of 0 to range 1 represents practically? |
I still not really sure what part you did not understand regarding the p values and z scores, so I will give a brief basic explanation and hope that it helps: Basically, psi5 quantifies acceptor site usage for a given donor site, and psi3 quantifies donor site usage for a given acceptor site. The interpretation of them is therefore on the intron-level, and not at the gene-level. In FRASER, we model the expected value of psi3/psi5/theta for every intron, and then compute p values based on the beta-binomial distribution by comparing the observed read count of the intron with this expected psi3/psi5/theta value to assess the statistical significance of the observed count. We also provide delta psi3/psi5/theta values (i.e. the difference between observed and expected values). We don't recommend to use the z scores to find aberrant events, we mainly include them for comparison with other methods. Rather use the delta psi3/psi5/theta value (which are easy to interpret) together with the p value to find aberrant splicing events that are of interest to you. I hope this helps to answer your questions. |
@ischeller Very elaborately explained. Thank you. I clearly understood now what psi5 and psi3 ratios indicate. I have another question dear. Like you said, "Therefore, an aberrant psi5 value in one sample indicates aberrant acceptor site usage (and analogously aberrant donor site usage for aberrant psi3 values." How do one determine and tag that particular psi value be it psi5 or psi3 as aberrant? Also, observed or the experimental psi values are the values which we get from the real time data by giving the required bam files. What are exactly the expected psi values and what is the criteria for this expectation It will make clear what delta psi is i.e. difference between actual observed and the expected values. |
@jjyotikataria sorry for lack of response. I was wondering if these questions still remained unanswered or if you still require help here? Thanks! |
@c-mertes @ischeller It will be great if you guys can explain more about the metrics sigh3 and sigh5 and theta 5 and theta3 used for the calculations of FRASER. It will be really helpful.
The text was updated successfully, but these errors were encountered: