Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

scoreType argument #87

Closed
pascaltimshel opened this issue Feb 19, 2021 · 2 comments
Closed

scoreType argument #87

pascaltimshel opened this issue Feb 19, 2021 · 2 comments

Comments

@pascaltimshel
Copy link

hi fgsea dev team!

The functions fgseaSimple() and fgseaMultilevel() have a scoreType argument documented as:
scoreType: This parameter defines the GSEA score type. Possible options are ("std", "pos", "neg")

This docstring isn't very helpful because I can't find any documentation for "GSEA score type" for the original GSEA software. I've also looked through the fgsea source code, but I still don't understand what the argument does.

Q1: can you explain what the GSEA score type parameter does? (Can you refer to a description of "GSEA score type" in the original GSEA software?)
Q2: when is it appropriate to use e.g. "pos" instead of "std"?
Q3: can you provide an example showing the differences in the results/interpretation when running with "std", "pos" and "neg"?

Thanks for creating this amazing and blazing fast piece of software!

@assaron
Copy link
Member

assaron commented Mar 1, 2021

It's not a parameter from the original GSEA paper. It was developed by request for one-tailed tests, when you are interested in either only positive enrichment ("pos") or negateive ("neg"). You can check out discussion here: #27

There are two particular use cases:

  1. When your stats vector is positive (not signed, as usual), for example, the absolute value of logFC, and you're interested in deregulated pathways that has more positive logFC compared to a random pathway
  2. When your stats vector is not balanced, for example, when you do differeitntial expression in scRNA-seq between a single cluster and all other clusters. Usually, there are some specific genes (with high positive metric), but no negative specific genes (so there are no highly negative values, only somewhat negative). Similarly, you're looking for a pathways "positively" specific to the cluster, and there are no "negatively" specific pathways.

Some people would fine it useful, but we didn't do any analysis to show what are properties of this non-standard score types.

@assaron assaron closed this as completed Mar 16, 2021
@pascaltimshel
Copy link
Author

thank you @assaron !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants