Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is it possible to dig into relative contributions of various genes to the enrichit score #14

Closed
tomthomas3000 opened this issue Feb 3, 2021 · 9 comments

Comments

@tomthomas3000
Copy link

This is a great package - thank you for making it!

I was wondering whether it would be possible to look at the relative contribution of individual genes to the enrichIt score by any chance? Thank you.

@ncborcherding
Copy link
Owner

Hey Tom,

Interesting idea - do you have an example of what you have in mind?

Thanks,
Nick

@tomthomas3000
Copy link
Author

For ex: out of a set of 20 genes supplied to the enrichit function, perhaps only 4-5 genes play an overwhelmingly important role in the enrichit score output. If this was the case, would it be possible identify the relative contribution of the individual genes to the score?

@daccachejoe
Copy link

hi - I am also wondering this. Any updates?

@ncborcherding
Copy link
Owner

ncborcherding commented Aug 3, 2021

Hey thanks for the question - working on a major overhaul of the package that will have the enrichment plot function. I do not have a timeline yet though.

@ncborcherding
Copy link
Owner

This took quite awhile (actually a lot harder than I thought), but the newest dev version of escape has enrichmentPlot() a function to examine the distribution of ranked gene order across single-cell groups.

https://ncborcherding.github.io/vignettes/escape_vignette.html#55_6_Enrichment_Plots

As of right now it just shows the mean rank across the group - I will work on adding the enrichment score and p-value calculation directly to the plot.

@tomthomas3000
Copy link
Author

tomthomas3000 commented Oct 1, 2021 via email

@ncborcherding
Copy link
Owner

Hey Tom,

Absolutely - I think there are two approaches right now in the field for gene set for single-cell data, 1) pseudobulk and 2) true single-cell enrichment. There are some pluses/minuses to both approaches - I think pseudobulk produces more robust enrichment results for example. The calculation of enrichment at a single-cell level is probably better for the analysis of heterogeneity. You can use escape for either approach.

Let me know if you have any other questions.
Nick

@tomthomas3000
Copy link
Author

tomthomas3000 commented Oct 4, 2021 via email

@ncborcherding
Copy link
Owner

Hey Tom,

Both the "ssGSEA" and the "UCell" method use raw count data. No need for normalization before running.

Nick

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants