Add configerable average function #23

multimeric · 2021-02-18T13:47:03Z

Closes #22, see discussion there.

Uncertainties:

Have I covered all public APIs, ensuring they can all be configured?
The test statistic ends up being negative, and therefore with a p-value of 1 when used to compare a standard normal and t distribution in the test_different_distributions. Does this make sense, or is it revealing a flaw in the code somewhere?

codecov · 2021-02-18T14:01:22Z

Codecov Report

Merging #23 (b10e89b) into develop (e735155) will increase coverage by 0.07%.
The diff coverage is 100.00%.

@@             Coverage Diff             @@
##           develop      #23      +/-   ##
===========================================
+ Coverage    95.95%   96.03%   +0.07%     
===========================================
  Files           18       18              
  Lines         1137     1159      +22     
===========================================
+ Hits          1091     1113      +22     
  Misses          46       46

Impacted Files	Coverage Δ
dcor/homogeneity.py	`100.00% <ø> (ø)`
dcor/tests/test_independence.py	`100.00% <ø> (ø)`
dcor/_energy.py	`91.66% <100.00%> (+0.75%)`	⬆️
dcor/tests/test_homogeneity.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e735155...b10e89b. Read the comment docs.

dcor/_energy.py

vnmabus · 2021-02-18T19:29:17Z

* Have I covered all public APIs, ensuring they can all be configured?

I think so, although the documentation for the public function is missing (and in a future PR I should merge the public and _imp versions of the functions, as this was only done to achieve keyword only parameters for Python 2, which is no longer supported).

* The test statistic ends up being negative, and therefore with a p-value of 1 when used to compare a standard normal and t distribution in the `test_different_distributions`. Does this make sense, or is it revealing a flaw in the code somewhere?

You mean the statistic with the mean, median or both?

multimeric · 2021-02-19T01:49:32Z

You mean the statistic with the mean, median or both?

I mean with the median. Using the mean is one of your tests, which passes.

dcor/tests/test_homogeneity.py

vnmabus · 2021-02-19T11:40:41Z

You mean the statistic with the mean, median or both?

I mean with the median. Using the mean is one of your tests, which passes.

Ok, I have checked the implementation and it looks ok. The only explanation that I see is that the differences between the Gaussian and t-Student distributions are in the tails of the distribution, and the median is not taking this information into account. Maybe it will notice the difference with a higher number of samples, but it would be very costly. So I would add a test between two different enough distributions with the same mean, and call it a day, unless you have a better explanation.

dcor/homogeneity.py

dcor/_energy.py

dcor/homogeneity.py

dcor/tests/test_homogeneity.py

dcor/homogeneity.py

multimeric added 3 commits February 19, 2021 00:42

Add configerable average function

8885c54

Rename test

099a5d5

Fix tests

a4ca671

vnmabus reviewed Feb 18, 2021

View reviewed changes

dcor/_energy.py Show resolved Hide resolved

Add public documentation of average param

cdfa5e6

vnmabus reviewed Feb 19, 2021

View reviewed changes

dcor/tests/test_homogeneity.py Outdated Show resolved Hide resolved

vnmabus reviewed Feb 19, 2021

View reviewed changes

dcor/homogeneity.py Show resolved Hide resolved

multimeric added 4 commits February 20, 2021 00:30

Fix line length

7f90741

Add average to first line of docstring

0762717

Add distribution test for medians

9e7e5f7

Ensure both distributions have same mean

c3999d3

vnmabus reviewed Feb 19, 2021

View reviewed changes

dcor/_energy.py Show resolved Hide resolved

multimeric added 3 commits February 20, 2021 13:15

Add average to several docstrings

03ac46f

Fix test docstring

1f72baf

standard_normal → normal

5618e70

vnmabus requested changes Feb 20, 2021

View reviewed changes

Use median in median tests; fix line lengths

5173315

vnmabus reviewed Feb 20, 2021

View reviewed changes

dcor/homogeneity.py Outdated Show resolved Hide resolved

Single line function prototype docstring

b10e89b

vnmabus approved these changes Feb 21, 2021

View reviewed changes

vnmabus merged commit 71c36e0 into vnmabus:develop Feb 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add configerable average function #23

Add configerable average function #23

multimeric commented Feb 18, 2021

codecov bot commented Feb 18, 2021 •

edited

vnmabus commented Feb 18, 2021 •

edited

multimeric commented Feb 19, 2021

vnmabus commented Feb 19, 2021

Add configerable average function #23

Add configerable average function #23

Conversation

multimeric commented Feb 18, 2021

codecov bot commented Feb 18, 2021 • edited

Codecov Report

vnmabus commented Feb 18, 2021 • edited

multimeric commented Feb 19, 2021

vnmabus commented Feb 19, 2021

codecov bot commented Feb 18, 2021 •

edited

vnmabus commented Feb 18, 2021 •

edited