Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE] Subsetting SRA projects by samples prior to making the RangedSummarizedExperiment object #11

Open
schelhorn opened this issue Nov 29, 2021 · 0 comments
Labels
enhancement New feature or request
Milestone

Comments

@schelhorn
Copy link

schelhorn commented Nov 29, 2021

Hi Leonardo,

thanks for recount3; I was wondering whether there is a hack to produce the RangedSummarizedExperiment objects using recount3::create_rse() for only a subset of samples in a particular SRA project. The main reason for this is that making the SE object seems to take a lot of memory (and time) for the larger, 3000+ sample projects in SRA just get to the subset of samples one is interested in.

Possibly this is already implemented somewhere in the package (since the read counts function seems to have a sample selector, see read_counts <- function(counts_file, samples = NULL)); if so, I'd be happy about a pointer, thanks.

If it is not implemented yet, perhaps a simple feature implementation would be allowing the user to look into the SRA project metadata by recount3::available_samples(my_sra_project), selecting samples by their external_id, and then providing these to recount3::create_rse() using a new parameter external_sample_ids=NULL. This could then be directly used by recount3::create_rse_manual() in its call to recount3::read_counts() (perhaps after some sanity checking for existing and/or duplicated external sample IDs).

@schelhorn schelhorn changed the title [FEATURE] Your bug or feature request [FEATURE] Subsetting SRA projects by samples prior to making the RangedSummarizedExperiment object Nov 29, 2021
@lcolladotor lcolladotor added the enhancement New feature or request label Nov 30, 2023
@lcolladotor lcolladotor added this to the bioc v3.20 milestone Nov 30, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: Todo
Development

No branches or pull requests

2 participants