P-value aggregate #148

pimentel · 2017-11-13T23:14:25Z

No description provided.

lynnyi

TODOs

lynnyi · 2017-11-14T18:46:49Z

R/model.R

@@ -308,7 +308,8 @@ tests.sleuth <- function(obj, lrt = TRUE, wt = TRUE) {
 #' results_table <- sleuth_results(sleuth_obj, 'conditionIP')
 #' @export
 sleuth_results <- function(obj, test, test_type = 'wt',
-  which_model = 'full', rename_cols = TRUE, show_all = TRUE) {
+  which_model = 'full', rename_cols = TRUE, show_all = TRUE,
+  aggregate_pval = obj$gene_aggregate) {


Change obj$gene_aggregate to obj$aggregate_pval. Job of @pimentel to make sure changing obj$gene_aggregate is global.

On the question --> yes.
And probably not expose aggregate_pval as an external argument; just do internal check (obj$pval_aggregate and obj$gene_mode should not both be TRUE, but ok if both are FALSE). That should just be set with sleuth_prep and then the user doesn't have to think about it again after that. The internal check will make sure that the users didn't inappropriately mess with those booleans between sleuth_prep and sleuth_results.

lynnyi · 2017-11-14T18:47:26Z

R/model.R

+    if(any(res$pval < 10^-323, na.rm=TRUE)) {
+    		warning('Extreme p-values around and below 10^-320 will generate 0 pvalues in aggregation')
+    }
+    res <- res %>% group_by(ens_gene) %>% summarise(ext_gene = unique(ext_gene), num_aggregated_transcripts = length(!is.na(pval)), sum_mean_obs_counts = sum(mean_obs, na.rm=TRUE), pval = lancaster(pval, mean_obs))


Change to data.table

warrenmcg · 2017-11-14T20:35:41Z

R/sleuth.R

@@ -238,7 +238,7 @@ sleuth_prep <- function(
  obs_raw <- dplyr::bind_rows(lapply(kal_list, function(k) k$abundance))

  counts_test <- data.table::as.data.table(obs_raw)
-  counts_test <- counts_test[, total = .(total = sum(est_counts)), by = "sample"]
+  counts_test <- counts_test[, .(total = sum(est_counts)), by = "sample"]


missed this. Thanks!

lynnyi · 2017-11-14T22:23:56Z

R/model.R

+    if(any(res$pval < 10^-323, na.rm=TRUE)) {
+    		warning('Extreme p-values around and below 10^-320 will generate 0 pvalues in aggregation')
+    }
+    res <- data.table::as.data.table(res)[, .(num_aggregated_transcripts = length(!is.na(pval)), sum_mean_obs_counts = sum(mean_obs, na.rm=TRUE), pval = as.numeric(lancaster(pval, exp(mean_obs)))), by=eval(obj$gene_column)]


Changed to data.table implementation. sleuth_results(aggregate_pval = TRUE) now takes ~1.5 seconds instead of 6 seconds with data.frame. Please check that I've optimized data.table, @warrenmcg , @pimentel. Thanks!

Also I'm going to ask this question here: I noticed that using a newly generated sleuth object, I get much fewer significant genes compared to my older sleuth object (using sleuth about a year or so ago) on the same data set. I just want to make sure this is correct; i.e. there were changes in sleuth to support this observation.

Looks good to me! I would only suggest a formatting change to keep the columns to under 100 characters (suggested R style tips; recommends 80, but I usually go with 100 because of some nesting that happens).

line 401 would read like this:

res <- data.table::as.data.table(res)[, .(num_aggregated_transcripts = length(!is.na(pval)), sum_mean_obs_counts = sum(mean_obs, na.rm=TRUE), pval = as.numeric(lancaster(pval, exp(mean_obs)))), by = eval(obj$gene_column)]

The # of spaces may differ between here and your editor, so just align the period with the 1st 't' in data.table, the other columns with num_aggregated_transcripts, and the by column with the period.

# of genes called issue resolved: false alarm

pimentel and others added 5 commits November 10, 2017 21:59

scaffold for pvalue aggregation #vc

766e0e9

added p-value aggregation

8f0b7eb

added error handling in aggregating p-values

6f59253

fixed indentation issues

433fb40

fix indentation

335cf46

pimentel assigned lynnyi Nov 13, 2017

fix minor data.table issue in counts_test (sleuth_prep) #vc

425ccfc

lynnyi reviewed Nov 14, 2017

View reviewed changes

dplyr dependencies

a71cb10

warrenmcg reviewed Nov 14, 2017

View reviewed changes

lynnyi added 2 commits November 14, 2017 14:16

changed to data.table implementation

a5c9ddc

added lancaster.R

e8da6dd

lynnyi reviewed Nov 14, 2017

View reviewed changes

pimentel and others added 7 commits November 15, 2017 09:05

style as well as cleanup documentation #vc

4a88b73

change dependence to CRAN package

5ee2516

depend on aggregation package

0d768f0

Updated .Rd

bee98d6

added Harold's changes

d963b58

remove aggregation from depends #vc

4803b68

update change log #vc

70ec63f

pimentel merged commit f457a10 into devel Mar 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

P-value aggregate #148

P-value aggregate #148

pimentel commented Nov 13, 2017

lynnyi left a comment

lynnyi Nov 14, 2017 •

edited

Loading

warrenmcg Nov 14, 2017

lynnyi Nov 14, 2017

warrenmcg Nov 14, 2017

lynnyi Nov 14, 2017 •

edited

Loading

lynnyi Nov 14, 2017

warrenmcg Nov 14, 2017

pimentel Nov 15, 2017 •

edited

Loading

P-value aggregate #148

P-value aggregate #148

Conversation

pimentel commented Nov 13, 2017

lynnyi left a comment

Choose a reason for hiding this comment

lynnyi Nov 14, 2017 • edited Loading

Choose a reason for hiding this comment

warrenmcg Nov 14, 2017

Choose a reason for hiding this comment

lynnyi Nov 14, 2017

Choose a reason for hiding this comment

warrenmcg Nov 14, 2017

Choose a reason for hiding this comment

lynnyi Nov 14, 2017 • edited Loading

Choose a reason for hiding this comment

lynnyi Nov 14, 2017

Choose a reason for hiding this comment

warrenmcg Nov 14, 2017

Choose a reason for hiding this comment

pimentel Nov 15, 2017 • edited Loading

Choose a reason for hiding this comment

lynnyi Nov 14, 2017 •

edited

Loading

lynnyi Nov 14, 2017 •

edited

Loading

pimentel Nov 15, 2017 •

edited

Loading