Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cBioDataPack() fails despite valid studyId #69

Closed
ghost opened this issue Jul 1, 2023 · 2 comments
Closed

cBioDataPack() fails despite valid studyId #69

ghost opened this issue Jul 1, 2023 · 2 comments
Assignees
Labels

Comments

@ghost
Copy link

ghost commented Jul 1, 2023

hello, followed the docs, and while, as presented there, mae<-cBioDataPack("acc") downloads a multiassayexperiment, the same command,

mae<-cBioDataPack("brain_cptac_2020")

fails with

Error in .check_study_id_building(cancer_study_id, "pack_build", ask = ask) : 
  'studyId', brain_cptac_2020, not found. See 'getStudies()'.

however,

getStudies(cbio)[30,]$studyId

returns "brain_cptac_2020" - precisely what is input. What is going on here?

@LiNk-NY
Copy link
Contributor

LiNk-NY commented Jul 3, 2023

Hi @vlaufer
Thanks for reporting. It is resolved here a42b2a0
There is still an issue with mapping SAMPLE_ID to PATIENT_ID in the assays.
Currently when there is no match with the PATIENT_ID, the assays are put in the metadata(mae).
I will work on a fix.
Best,
Marcel

@LiNk-NY LiNk-NY closed this as completed Jul 3, 2023
@LiNk-NY
Copy link
Contributor

LiNk-NY commented Jul 3, 2023

I've incorporated information from SAMPLE_ID from the datasets to map and build SummarizedExperiment objects.
Now, you should get an object that looks like the following:

> (mae <- cBioDataPack("brain_cptac_2020"))
A MultiAssayExperiment object of 7 listed
 experiments with user-defined names and respective classes.
 Containing an ExperimentList class object of length 7:
 [1] cna: SummarizedExperiment with 19380 rows and 190 columns
 [2] linear_cna: SummarizedExperiment with 19380 rows and 190 columns
 [3] mrna_seq_v2_rsem_zscores_ref_all_samples: SummarizedExperiment with 18209 rows and 188 columns
 [4] mrna_seq_v2_rsem: SummarizedExperiment with 18209 rows and 188 columns
 [5] mutations: RaggedExperiment with 9951 rows and 200 columns
 [6] protein_quantification_zscores: SummarizedExperiment with 6429 rows and 218 columns
 [7] protein_quantification: SummarizedExperiment with 6429 rows and 218 columns
Functionality:
 experiments() - obtain the ExperimentList instance
 colData() - the primary/phenotype DataFrame
 sampleMap() - the sample coordination DataFrame
 `$`, `[`, `[[` - extract colData columns, subset, or experiment
 *Format() - convert into a long or wide DataFrame
 assays() - convert ExperimentList to a SimpleList of matrices
 exportClass() - save data to flat files

These changes are in the latest version of cBioPortalData in Bioc-devel (package version 2.13.4).

@LiNk-NY LiNk-NY self-assigned this Jul 5, 2023
@LiNk-NY LiNk-NY added the bug label Jul 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant