GitHub - UBC-MDS/ptoolkit: A toolkit for adjusting and visualizing p values

A toolkit for adjusting and visualizing p values.

Created by

Amy Goldlist · Esteban Angel · Veronique Mulholland

Key Features • Install • How To Use • Credits • Related • License

Key Features

p_toolkit is a package designed to help adjust and visualize p-values when using multiple comparisons. As computing power has become powerful enough to run hundreds or even thousands of statistical tests, it is important to look at small p-values and try to understand whether the result is small simply by chance, or whether it truly is significant. There are many tools to help decide when to reject a Null hypothesis, which can control either:

The chance of committing a type 1 error (rejecting a null hypothesis given that it is true) on a single test
The chance of committing at least one type 1 error in m tests
The chance of a null hypothesis being true given that we have rejected it, the False Discovery rate (FDR)

We can use the p-values alone, or an adjustment method such as the Bonferroni or the Benjamini-Hochberg (BH) methods. We can also use visualization methods such as QQ-plots or a scatter plot of the p-values, to try and detect patterns.

This package aims to combine these methods in a simple-to-use format, which works by outputting dataframes, which contain results from several adjustment methods.

Package Functions

Example

###set up a dataframe
df <- data.frame(test= c("test 1", "test 2", "test 3", "test 4"),
                 p = c(.05,.5,.0001, .0001))

##p_adjust with the BH correction
ptoolkit::p_adjust(data = df, pv_index = "p", method = "bh")

###p_adjust with the Bonferroni correction
ptoolkit::p_adjust(data = df, pv_index = "p", method = "bonf")

##p_methods
ptoolkit::p_methods(data = df, pv_index = "p", alpha = 0.05)

###p_qq
ptoolkit::p_qq(data = df, pv_index = "p")

##p_plot
ptoolkit::p_plot(data = df, pv_index = "p")

Install

From the R console, enter these two lines:

devtools::install_github("UBC-MDS/ptoolkit")
library(ptoolkit)

Navigate to the help panel or follow the How to Use section for step-by-step instructions.

How To Use

All the commands in this section should be typed in the IDE console. data refers to either a dataframe or other array/vector format and pv_index is the column number if using a dataframe.

Generating Sample P-values

Here are some lines for simulating a toy set of p-values adapted from Research Utopia:

# Choose how many simulations to perform
nSims <- 100000
p <-numeric(nSims) # initialize empty container for simulated p-values

# Simulate an experiment
for(i in 1:nSims){
  #produce 100 simulated participants
  x<-rnorm(n = 100, mean = 110, sd = 15)
  y<-rnorm(n = 100, mean = 100, sd = 15)

 z<-t.test(x,y) #perform the t-test
 p[i]<-z$p.value #get the p-value and store it
}

Generating a Summary

Now let's take a look at the summary table with both Bonferroni and BH correction methods applied:

p_methods(data, pv_index=0, alpha = 0.05)

Bonferroni `Bonf` Correction

For those only interested in getting adjusted p-values rather than seeing the whole summary, type in either 'bonf' or 'bonferroni':

p_adjust(data, pv_index=0, method='bonf', alpha=0.05)

Benjamini-Hochberg `BH` Correction

And now for the BH correction, type in either 'bh' or 'fdr':

p_adjust(data, pv_index=0, method='bh', alpha=0.05)

Plot the results

A plot displaying the p-values and both Bonferroni and Benjamini-Hochberg method significance level lines:

p_plot(data, pv_index)

A simple QQ-plot of the p-values:

p_qq(data,pv_index)

Credits

README formatting inspiration from Markdownify
Badges by Shields IO
Logo by Devendra Karkar

The p.adjust function comes in the base stats library in R. It's a function designed for adjusting an array of p-values using six methods, some for controlling the family-wise error ("holm", "Hochberg", "Hommel", "Bonferroni") and the others for controlling the false discovery rate ("BH", "BY","fdr"). The advantage of this function is its simplicity and that it comes in the stats library, which is built in in the default environments in R, so the user doesn't need to install external packages. It doesn't let the user analyze deeper what is going on with the tests; this is a key element of p_toolkit.

fdrtool

fdrtool is a package designed for analyzing the False Discovery Rate in statistical tests and not limited exclusively to p-value adjustment. Has some functions related to p_toolkit like fdrtool, which calculates and plots the false discovery rate and pval.estimate.eta0, which outputs the proportion of null p-values in a list.

Python:

statsmodels.sandbox.stats.multicomp.multipletests

This function is part of the statsmodels library, a complete set of functions for implementing statistical methods in Python. It works similar to R's p.adjust, receiving an array of p-values as inputs and returning two arrays: one with the corrected p-values and another one with boolean values corresponding to the new logical values after correction. It has no diagnostics and analysis of the results.

License

MIT License

Interested in contributing? See our Contributing Guidelines and Code of Conduct.

Name		Name	Last commit message	Last commit date
Latest commit History 174 Commits
R		R
doc		doc
man		man
tests		tests
vignettes		vignettes
.gitignore		.gitignore
.travis.yml		.travis.yml
CONTRIBUTING.md		CONTRIBUTING.md
Conduct.md		Conduct.md
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
README.md		README.md

License

UBC-MDS/ptoolkit

Folders and files

Latest commit

History

Repository files navigation

A toolkit for adjusting and visualizing p values.

Created by

Amy Goldlist · Esteban Angel · Veronique Mulholland

Key Features • Install • How To Use • Credits • Related • License

Key Features

Package Functions

Example

Install

How To Use

Generating Sample P-values

Generating a Summary

Bonferroni Bonf Correction

Benjamini-Hochberg BH Correction

Plot the results

Credits

Related

Package Dependencies

Similar Packages and Functions

License

Created by Amy Goldlist · Esteban Angel · Veronique Mulholland

About

Resources

License

Stars

Watchers

Forks

Languages

Bonferroni `Bonf` Correction

Benjamini-Hochberg `BH` Correction

Created by
Amy Goldlist · Esteban Angel · Veronique Mulholland