Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

can base_level be multiple if has many groups? #66

Closed
whiteorchid opened this issue Jan 1, 2023 · 2 comments
Closed

can base_level be multiple if has many groups? #66

whiteorchid opened this issue Jan 1, 2023 · 2 comments

Comments

@whiteorchid
Copy link

whiteorchid commented Jan 1, 2023

Dear. author,

Thanks a lot for the great tools!

May I know if it's possible to designate the base_level to multiple values or only one? If have multiple groups/comparisons, need to run separately (separate the units file and the sample file)?

Thank you very much!

diffexp:
  # samples to exclude (e.g. outliers due to technical problems)
  exclude:
  # model for sleuth differential expression analysis
  models:
    model_X:
      full: ~condition + batch
      reduced: ~batch
      # Binary valued covariate that shall be used for fold change/effect size
      # based downstream analyses.
      primary_variable: condition
      # base level of the primary variable (will be considered as denominator
      # in the fold change/effect size estimation).
      **base_level**: untreated
  # significance level to use for volcano, ma- and qq-plots
@dlaehnemann
Copy link
Collaborator

The base_level: is only meant for calculating fold changes in expression. And this usually only makes sense against some control condition, like a set of untreated samples in the example. Thus, you can only have one base_level per model that you compare against. You could hack around this, by specifying the same model twice, only differing in the base_level that you specify. But you then have to double-check that your comparisons both make sense.

I hope that helps?

That said, maybe you want to do something else, e.g. do multiple group comparisons. For that, you can have any number of levels in the primary_variable: column (condition in the example). So you could for example have values genotype_X, genotype_Y, and genotype_Z in the respective column of your samples.tsv.

And if you want to understand more details about the differential expression testing, you could check out the sleuth documentation:
https://pachterlab.github.io/sleuth/walkthroughs

@whiteorchid
Copy link
Author

Thanks a lot! Much appreciate! Best!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants