Generating pseudobulk #5

inofechm · 2021-11-02T19:52:49Z

Can you please direct me to the code used to generate pseudobulk rna-seq from the paper and how to run the pam50 subtyping on the pseudobulk?
I see the bulk-rna seq pam50 code but want to apply the pseudobulk method for my own breast samples so that would be appreciated.
Thank you

dlroden · 2021-11-10T11:16:02Z

Hi, thanks for your query.
This code isn't in the repo. For the Pseudobulk, we just summed up all the reads for each gene across all cells. So, it's the basic rowSums() function in R that was applied to the count matrix of each individual tumor.

We have also found that using the raw R2 fastq files as input to a bulk RNAseq pipeline will give comparable results to the count summation method.

Hope this helps

yewero · 2022-02-14T13:22:38Z

@inofechm I find the related codes are in the ecotypes/generate_pseudobulk_mixture_file.snakemake.R file. There are two methods mentioned: sum and average. The first one could be what you need.

dlroden closed this as completed Nov 10, 2021

ZeroLi-Bio mentioned this issue Jan 17, 2022

Question about Pseudobulk #7

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generating pseudobulk #5

Generating pseudobulk #5

inofechm commented Nov 2, 2021

dlroden commented Nov 10, 2021

yewero commented Feb 14, 2022 •

edited

Loading

Generating pseudobulk #5

Generating pseudobulk #5

Comments

inofechm commented Nov 2, 2021

dlroden commented Nov 10, 2021

yewero commented Feb 14, 2022 • edited Loading

yewero commented Feb 14, 2022 •

edited

Loading