Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow ANOVA-style replotting of data in multi-group plots #43

Closed
josesho opened this issue May 28, 2019 · 1 comment
Closed

Allow ANOVA-style replotting of data in multi-group plots #43

josesho opened this issue May 28, 2019 · 1 comment
Assignees
Milestone

Comments

@josesho
Copy link
Member

josesho commented May 28, 2019

import seaborn as sns
import dabest

# Requires internet access.
iris = sns.load_dataset("iris")

iris_db = dabest.load(iris, x="species", y="sepal_length",
                     idx=(("setosa", "versicolor"),
                          ("setosa", "virginica"),
                          ("versicolor", "virginica"))
                      )

iris_db.mean_diff
DABEST v0.2.4
=============
             
Good evening!
The current time is Tue May 28 18:30:11 2019.

The unpaired mean difference between setosa and versicolor is 0.93 [95%CI 0.76, 1.1].
The two-sided p-value of the Mann-Whitney test is 8.35e-14.

The unpaired mean difference between setosa and virginica is 1.58 [95%CI 1.38, 1.78].
The two-sided p-value of the Mann-Whitney test is 6.4e-17.

The unpaired mean difference between versicolor and virginica is 0.652 [95%CI 0.428, 0.878].
The two-sided p-value of the Mann-Whitney test is 5.87e-07.

5000 bootstrap samples were taken; the confidence interval is bias-corrected and accelerated.
The p-value(s) reported are the likelihood(s) of observing the effect size(s),
if the null hypothesis of zero difference is true.

To get the results of all valid statistical tests, use `.mean_diff.statistical_tests`
iris_db.mean_diff.plot();

The plot does not match the textual output above.
download

@josesho josesho added this to the v0.2.5 milestone May 28, 2019
@josesho josesho self-assigned this May 28, 2019
@josesho
Copy link
Member Author

josesho commented Jun 19, 2019

After much deliberation, duplicating groups across the same plot is misleading at best, and deceptive at worst. It also requires more effort to overhaul the plotting code.

The duplication of groups within idx or across tuples/lists inside idx will now be flagged up and will throw an error.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant