Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

standR Workflow for Cancer Transcriptome Atlas (CTA) #27

Open
ErickMUO opened this issue Jan 12, 2024 · 2 comments
Open

standR Workflow for Cancer Transcriptome Atlas (CTA) #27

ErickMUO opened this issue Jan 12, 2024 · 2 comments
Labels
enhancement New feature or request

Comments

@ErickMUO
Copy link

Hey standR team!

I am reaching out with a question regarding the application of the standR workflow to the Cancer Transcriptome Atlas (CTA). I am encountering challenges while attempting to utilize the readGeoMx import function, and I believe it may be related to the specific characteristics of the CTA dataset.

The issue revolves around the presence of several probes for the same gene within the CTA, leading to errors in the import process. I am seeking guidance on whether the standR workflow is compatible with the CTA dataset, especially considering the multiple probes for individual genes.

Is there any way I can circumvent this issue?

Thank you in advance for your time and assistance.

spe <- readGeoMx(countFile, sampleAnnoFile, featureAnnoFile,NegProbeName = "Negative Probe")

Error in .rowNamesDF<-(x, value = value) :
duplicate 'row.names' are not allowed
In addition: Warning messages:
1: Setting row names on a tibble is deprecated.
2: non-unique values when setting 'row.names': ‘A2M’, ‘ABCB1’, ‘ABCF1’, ‘ABL1’, ‘ACOT12’, ‘ACSF3’, ‘ACTA2’, ‘ACTB’, ‘ACTR3B’, ‘ACVR1B’, ‘ACVR1C’, ‘ACVR2A’, ‘ACY1’, ‘ADA’, ‘ADAM12’, ‘ADGRE1’, ‘ADGRE5’, ‘ADH1A/B/C’, ‘ADH4’, ‘ADH6’, ‘ADM’, ‘AFDN’, ‘AICDA’, ‘AIRE’, ‘AKAP1’, ‘AKR1C4’, ‘AKT1’, ‘AKT2’, ‘AKT3’, ‘ALCAM’, ‘ALDOA’, ‘ALDOC’, ‘ALK’, ‘ALKBH2’, ‘ALKBH3’, ‘AMBP’, ‘AMER1’, ‘AMH’, ‘ANGPT1’, ‘ANGPT2’, ‘ANGPTL4’, ‘ANKRD28’, ‘ANLN’, ‘ANP32B’, ‘ANXA1’, ‘APC’, ‘APH1B’, ‘API5’, ‘APLNR’, ‘APOA1’, ‘APOA2’, ‘APOA4’, ‘APOB’, ‘APOC2’, ‘APOC3’, ‘APOE’, ‘APOL6’, ‘APOM’, ‘APP’, ‘AQP9’, ‘AR’, ‘AREG’, ‘ARG1’, ‘ARG2’, ‘ARID1A’, ‘ARID1B’, ‘ARID2’, ‘ARMH3’, ‘ARNT’, ‘ARNT2’, ‘ASCL1’, ‘ASL’, ‘ASNS’, ‘ASPA’, ‘ASPG’, [... truncated]

sessionInfo()
R version 4.3.0 (2023-04-21)
Platform: aarch64-apple-darwin20 (64-bit)
Running under: macOS 14.2.1

Matrix products: default
BLAS: /System/Library/Frameworks/Accelerate.framework/Versions/A/Frameworks/vecLib.framework/Versions/A/libBLAS.dylib
LAPACK: /Library/Frameworks/R.framework/Versions/4.3-arm64/Resources/lib/libRlapack.dylib; LAPACK version 3.11.0

locale:
[1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8

time zone: Europe/Stockholm
tzcode source: internal

attached base packages:
[1] stats4 stats graphics grDevices utils datasets methods base

other attached packages:
[1] readxl_1.4.3 lubridate_1.9.3 forcats_1.0.0
[4] stringr_1.5.1 purrr_1.0.2 readr_2.1.4
[7] tidyr_1.3.0 tidyverse_2.0.0 standR_1.5.4
[10] tibble_3.2.1 ggforce_0.4.1 dplyr_1.1.3
[13] GeoMxWorkflows_1.8.0 GeomxTools_3.5.0 NanoStringNCTools_1.10.0
[16] ggplot2_3.4.4 S4Vectors_0.40.1 Biobase_2.62.0
[19] BiocGenerics_0.48.1

loaded via a namespace (and not attached):
[1] IRanges_2.36.0 vroom_1.6.4 progress_1.2.2
[4] pacman_0.5.1 vsn_3.70.0 goftest_1.2-3
[7] Biostrings_2.70.1 vctrs_0.6.4 spatstat.random_3.2-1
[10] digest_0.6.33 png_0.1-8 ggrepel_0.9.4
[13] deldir_1.0-9 parallelly_1.36.0 MASS_7.3-60
[16] reshape_0.8.9 reshape2_1.4.4 httpuv_1.6.12
[19] foreach_1.5.2 qvalue_2.34.0 withr_2.5.2
[22] xfun_0.41 ggfun_0.1.3 ellipsis_0.3.2
[25] survival_3.5-7 memoise_2.0.1 ggbeeswarm_0.7.2
[28] clusterProfiler_4.10.0 gson_0.1.0 systemfonts_1.0.5
[31] tidytree_0.4.5 zoo_1.8-12 pbapply_1.7-2
[34] GGally_2.1.2 prettyunits_1.2.0 KEGGREST_1.42.0
[37] promises_1.2.1 httr_1.4.7 restfulr_0.0.15
[40] globals_0.16.2 fitdistrplus_1.1-11 rstudioapi_0.15.0
[43] miniUI_0.1.1.1 generics_0.1.3 DOSE_3.28.0
[46] ggalluvial_0.12.5 reactome.db_1.86.0 curl_5.1.0
[49] zlibbioc_1.48.0 ggraph_2.1.0 polyclip_1.10-6
[52] GenomeInfoDbData_1.2.11 ExperimentHub_2.10.0 SparseArray_1.2.2
[55] interactiveDisplayBase_1.40.0 xtable_1.8-4 evaluate_0.23
[58] S4Arrays_1.2.0 BiocFileCache_2.10.1 preprocessCore_1.64.0
[61] hms_1.1.3 GenomicRanges_1.54.1 irlba_2.3.5.1
[64] colorspace_2.1-0 filelock_1.0.2 ROCR_1.0-11
[67] reticulate_1.34.0 spatstat.data_3.0-3 magrittr_2.0.3
[70] lmtest_0.9-40 later_1.3.1 viridis_0.6.4
[73] ggtree_3.10.0 lattice_0.22-5 spatstat.geom_3.2-7
[76] future.apply_1.11.0 scattermore_1.2 XML_3.99-0.15
[79] shadowtext_0.1.2 cowplot_1.1.1 matrixStats_1.1.0
[82] RcppAnnoy_0.0.21 pillar_1.9.0 nlme_3.1-163
[85] iterators_1.0.14 compiler_4.3.0 RSpectra_0.16-1
[88] stringi_1.8.1 minqa_1.2.6 tensor_1.5
[91] SummarizedExperiment_1.32.0 GenomicAlignments_1.38.0 MPO.db_0.99.7
[94] plyr_1.8.9 crayon_1.5.2 abind_1.4-5
[97] BiocIO_1.12.0 gridGraphics_0.5-1 locfit_1.5-9.8
[100] sp_2.1-1 graphlayouts_1.0.2 bit_4.0.5
[103] fastmatch_1.1-4 codetools_0.2-19 openssl_2.1.1
[106] plotly_4.10.3 mime_0.12 ff_4.0.9
[109] splines_4.3.0 Rcpp_1.0.11 fastDummies_1.7.3
[112] dbplyr_2.4.0 sparseMatrixStats_1.14.0 HDO.db_0.99.1
[115] cellranger_1.1.0 knitr_1.45 blob_1.2.4
[118] utf8_1.2.4 BiocVersion_3.18.0 lme4_1.1-35.1
[121] fs_1.6.3 listenv_0.9.0 oligo_1.66.0
[124] DelayedMatrixStats_1.24.0 ggplotify_0.1.2 Matrix_1.6-3
[127] statmod_1.5.0 tzdb_0.4.0 tweenr_2.0.2
[130] pkgconfig_2.0.3 pheatmap_1.0.12 tools_4.3.0
[133] cachem_1.0.8 RSQLite_2.3.3 numDeriv_2016.8-1.1
[136] viridisLite_0.4.2 DBI_1.1.3 celldex_1.12.0
[139] graphite_1.48.0 rmarkdown_2.25 fastmap_1.1.1
[142] scales_1.2.1 grid_4.3.0 outliers_0.15
[145] ica_1.0-3 Seurat_5.0.0 Rsamtools_2.18.0
[148] AnnotationHub_3.10.0 patchwork_1.1.3 BiocManager_1.30.22
[151] dotCall64_1.1-0 graph_1.80.0 RANN_2.6.1
[154] farver_2.1.1 tidygraph_1.2.3 scatterpie_0.2.1
[157] yaml_2.3.7 MatrixGenerics_1.14.0 ggthemes_4.2.4
[160] rtracklayer_1.62.0 cli_3.6.1 leiden_0.4.3
[163] lifecycle_1.0.4 askpass_1.2.0 uwot_0.1.16
[166] BiocParallel_1.36.0 MeSHDbi_1.38.0 timechange_0.2.0
[169] gtable_0.3.4 rjson_0.2.21 umap_0.2.10.0
[172] ggridges_0.5.4 progressr_0.14.0 parallel_4.3.0
[175] ape_5.7-1 limma_3.58.1 jsonlite_1.8.7
[178] RcppHNSW_0.5.0 affxparser_1.74.0 bitops_1.0-7
[181] progeny_1.24.0 HPO.db_0.99.2 bit64_4.0.5
[184] depmap_1.16.0 Rtsne_0.16 yulab.utils_0.1.0
[187] ReactomePA_1.46.0 spatstat.utils_3.0-4 SeuratObject_5.0.0
[190] GOSemSim_2.28.0 lazyeval_0.2.2 shiny_1.7.5.1
[193] htmltools_0.5.7 affy_1.80.0 enrichplot_1.22.0
[196] GO.db_3.18.0 sctransform_0.4.1 rappdirs_0.3.3
[199] glue_1.6.2 spam_2.10-0 XVector_0.42.0
[202] RCurl_1.98-1.13 treeio_1.26.0 gridExtra_2.3
[205] EnvStats_2.8.1 boot_1.3-28.1 igraph_1.5.1
[208] R6_2.5.1 SingleCellExperiment_1.24.0 DESeq2_1.42.0
[211] ggiraph_0.8.7 cluster_2.1.4 aplot_0.2.2
[214] GenomeInfoDb_1.38.1 nloptr_2.0.3 DelayedArray_0.28.0
[217] tidyselect_1.2.0 vipor_0.4.5 xml2_1.3.5
[220] oligoClasses_1.64.0 AnnotationDbi_1.64.1 future_1.33.0
[223] munsell_0.5.0 KernSmooth_2.23-22 BiocStyle_2.30.0
[226] affyio_1.72.0 data.table_1.14.8 htmlwidgets_1.6.2
[229] fgsea_1.28.0 RColorBrewer_1.1-3 biomaRt_2.58.0
[232] rlang_1.1.2 spatstat.sparse_3.0-3 spatstat.explore_3.2-5
[235] lmerTest_3.1-3 uuid_1.1-1 fansi_1.0.5
[238] beeswarm_0.4.0

@ningbioinfo
Copy link
Member

Hi @ErickMUO , as long as you're using the "probeQC count" as the count input, then the negative probes can be removed from the data in most cases since they were used to generate the "probeQC count", and this is essentially what the function is doing. In the current version unfortunately, the best way to do it is change the name of the negative probes beforehand and pass the names of the negative probes to the data. I'll mark this as an improvement so that the function can recognise this types of error and address for the users automatically (with on and off switch of course). This will probably happen in the next couple weeks.

@ningbioinfo ningbioinfo added the enhancement New feature or request label Jan 31, 2024
@ErickMUO
Copy link
Author

ErickMUO commented Feb 1, 2024

That sounds fantastic! I'll give the recommendations a try and also explore the updated version. Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants