Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What is the pseudo normal used to call mutations in cell lines? #24

Closed
ytakemon opened this issue Apr 22, 2021 · 5 comments
Closed

What is the pseudo normal used to call mutations in cell lines? #24

ytakemon opened this issue Apr 22, 2021 · 5 comments

Comments

@ytakemon
Copy link

Hello,

I'm trying to process WGS data from a cell line similar to how your pipeline at DepMap. I noticed in your workflow doc there was a note under the mutations slide that "this pipeline requires a matched normal, so we use a pseudo normal for all cell lines samples". Could you explain what this pseudo normal is and would it be possible for you to share this data with me?

Thank you

@ytakemon ytakemon changed the title What is pseudo normal used to call mutations in cell lines? What is the pseudo normal used to call mutations in cell lines? Apr 23, 2021
@javadnoorb
Copy link
Contributor

javadnoorb commented Apr 23, 2021

Hi,

For our Agilent WES samples we use:
gs://firecloud-tcga-open-access/tutorial/bams/C835.HCC1143_BL.4.bam
I think this is publicly available.

For our ICE WES samples we use a germline blood from the CCLF project's samples:
gs://fc-38a1a377-72c6-4e90-917f-e4bb709b8f2c/CCLF_RCRF1009-Normal-SM-F3R8L/seq_data_v2/CCLF_RCRF1009GL.bam
I'm not sure if this is publicly available, but give it a shot and let me know.

For WGS we use this GTEx sample:
GTEX-111FC-0001-SM-6WBTJ
I think you'd need dbGaP access for this one and get it through the GTEx project.

@jkobject feel free to chime in if I'm missing some information

@ytakemon
Copy link
Author

ytakemon commented Apr 23, 2021

Thanks for the quick response!

It looks like I am not able to access CCLF data, but that okay since I'm only interested in the WGS data at the moment.
While trying to figure it out on my own I actually duplicated my question on the DepMap community forum. Sorry about this.

Now having re-read the Ghandi et al (2019) paper again, I see that a pseudo normal (ie panel-of-normal, PoNs) were created via 8,000 TCGA normal samples. Is that the same as the GTEx GTEX-111FC-0001-SM-6WBTJ you are referring to?

@javadnoorb
Copy link
Contributor

javadnoorb commented Apr 23, 2021 via email

@ytakemon
Copy link
Author

I see. Thank you for clarifying what the psuedo-normal is and where the PoNs originated. For the PoN, are you able to share the sample IDs were and what were the criteria used to select them (besides that they were normal tissue)? I will see if I can request them from GTEx biobank.

@javadnoorb
Copy link
Contributor

closing this as it can be followed here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants