Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generating pseudobulk #5

Closed
inofechm opened this issue Nov 2, 2021 · 2 comments
Closed

Generating pseudobulk #5

inofechm opened this issue Nov 2, 2021 · 2 comments

Comments

@inofechm
Copy link

inofechm commented Nov 2, 2021

Can you please direct me to the code used to generate pseudobulk rna-seq from the paper and how to run the pam50 subtyping on the pseudobulk?
I see the bulk-rna seq pam50 code but want to apply the pseudobulk method for my own breast samples so that would be appreciated.
Thank you

@dlroden
Copy link
Collaborator

dlroden commented Nov 10, 2021

Hi, thanks for your query.
This code isn't in the repo. For the Pseudobulk, we just summed up all the reads for each gene across all cells. So, it's the basic rowSums() function in R that was applied to the count matrix of each individual tumor.

We have also found that using the raw R2 fastq files as input to a bulk RNAseq pipeline will give comparable results to the count summation method.

Hope this helps

@yewero
Copy link

yewero commented Feb 14, 2022

@inofechm I find the related codes are in the ecotypes/generate_pseudobulk_mixture_file.snakemake.R file. There are two methods mentioned: sum and average. The first one could be what you need.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants