Vectorize cdf, ppf #201

k15z · 2020-12-22T18:58:24Z

Resolve #200. I'm on vacation from FB (happy holidays @csala!) so I thought I'd take a look at this - I've been wanting to make this faster forever, this PR makes gaussian kde ~8x faster.

Here's some code for benchmarking it:

import numpy as np
from time import time
from copulas.univariate.gaussian_kde import GaussianKDE

X_train = np.random.uniform(size=1000)
X_test = np.random.uniform(size=1000)

model = GaussianKDE()
model.fit(X_train)

start = time()
y_slow = model.percent_point(X_test)
print(time() - start)

start = time()
y_fast = model.percent_point_fast(X_test)
print(time() - start)

np.abs(y_fast - y_slow).max()

The vectorized version of the ppf takes 0.332 seconds while the current version takes 2.898 seconds.

csala

Hey! Thanks for the proposal @k15z !

The idea looks good, and I agree that vectorizing seems to be the way to go here, but we will need to work a bit more on it to make it robust, since the current implementation seems to fails in some edge cases.

Here's an example of such failure (when passing an array with a single element):

In [1]: import numpy as np
   ...: from copulas.univariate import GaussianKDE
   ...: from copulas.datasets import sample_univariate_bimodal
   ...: 
   ...: data = sample_univariate_bimodal()
   ...: model = GaussianKDE()
   ...: model.fit(data)
   ...: model.percent_point_fast(np.array([0.5]))
   ...: 
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
<ipython-input-1-2ed68b5e8edf> in <module>
      6 model = GaussianKDE()
      7 model.fit(data)
----> 8 model.percent_point_fast(np.array([0.5]))

~/Projects/MIT/Copulas/copulas/univariate/gaussian_kde.py in percent_point_fast(self, U)
    265         X[is_one] = float("inf")
    266         X[is_zero] = float("-inf")
--> 267         X[is_valid] = newton(_f, np.zeros(U[is_valid].shape) + (lower+upper)/2.0)
    268 
    269         return X

~/.virtualenvs/Copulas/lib/python3.8/site-packages/scipy/optimize/zeros.py in newton(func, x0, fprime, args, tol, maxiter, fprime2, x1, rtol, full_output, disp)
    338                             " Failed to converge after %d iterations, value is %s."
    339                             % (itr + 1, p1))
--> 340                         raise RuntimeError(msg)
    341                     warnings.warn(msg, RuntimeWarning)
    342                 p = (p1 + p0) / 2.0

RuntimeError: Tolerance of [-80.25251] reached. Failed to converge after 3 iterations, value is [34.01875655].

I also added some more comments in-line

copulas/univariate/gaussian_kde.py

tests/large_scale_evaluation.py

codecov-io · 2020-12-24T03:29:45Z

Codecov Report

Merging #201 (93ea85d) into master (47bf787) will increase coverage by 0.19%.
The diff coverage is 94.93%.

@@            Coverage Diff             @@
##           master     #201      +/-   ##
==========================================
+ Coverage   89.77%   89.97%   +0.19%     
==========================================
  Files          26       27       +1     
  Lines        1585     1646      +61     
==========================================
+ Hits         1423     1481      +58     
- Misses        162      165       +3

Impacted Files	Coverage Δ
copulas/univariate/gaussian_kde.py	`98.57% <93.75%> (-0.04%)`	⬇️
copulas/optimize/__init__.py	`95.23% <95.23%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 47bf787...93ea85d. Read the comment docs.

k15z · 2020-12-24T05:44:51Z

Good point re: the stability issues. Updated to use bisect and chandrupatla, both of which are equal to or better than brentq in terms of robustness (both are more stable the newton). Here are the running times:

Original: 30.22
Bisect: 5.47
Chandrupatla: 4.72

Up to you whether you prefer the simple bisect method or chandrupatla's algorithm... the latter is obviously slightly faster but is significantly more complex (and may potentially be less robust, although I'm not aware of any issues?)

copulas/univariate/gaussian_kde.py

tests/unit/univariate/test_gaussian_kde.py

csala · 2021-01-06T13:08:56Z

Good point re: the stability issues. Updated to use bisect and chandrupatla, both of which are equal to or better than brentq in terms of robustness (both are more stable the newton). Here are the running times:
Original: 30.22
Bisect: 5.47
Chandrupatla: 4.72
Up to you whether you prefer the simple bisect method or chandrupatla's algorithm... the latter is obviously slightly faster but is significantly more complex (and may potentially be less robust, although I'm not aware of any issues?)

I think that both bisect and chandrupatla options are good, but I'm getting slightly different times:

In [1]: import numpy as np
   ...: from copulas.univariate import GaussianKDE
   ...: from copulas.datasets import sample_univariate_bimodal
   ...: 
   ...: data = sample_univariate_bimodal()
   ...: model = GaussianKDE()
   ...: model.fit(data)
   ...: cdf = model.cumulative_distribution(data)

In [2]: %timeit model.percent_point_slow(cdf[0:10])
34.3 ms ± 299 µs per loop (mean ± std. dev. of 7 runs, 10 loops each)

In [3]: %timeit model.percent_point(cdf[0:10], method='bisect')
20 ms ± 83.5 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

In [4]: %timeit model.percent_point(cdf[0:10], method='chandrupatla')
9.23 ms ± 212 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)

In [5]: %timeit model.percent_point_slow(cdf)
3.35 s ± 4.37 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

In [6]: %timeit model.percent_point(cdf, method='bisect')
2.53 s ± 77.1 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

In [7]: %timeit model.percent_point(cdf, method='chandrupatla')
1.15 s ± 18.5 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

In [8]: %timeit model.percent_point_slow(np.concatenate([cdf] * 10))
33.8 s ± 97.9 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

In [9]: %timeit model.percent_point(np.concatenate([cdf] * 10), method='bisect')
12.9 s ± 48.9 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

In [10]: %timeit model.percent_point(np.concatenate([cdf] * 10), method='chandrupatla')
5.84 s ± 18.2 ms per loop (mean ± std. dev. of 7 runs, 1 loop each)

It seems like bisect takes twice the time chandrupatla takes, and this is ~[1/3-1/6] of what the original approach took.

On the other side, it seems like both bisect and chandrupatla produce the same results as the original method, which is also be the right output (it successfully inverts the cdf):

In [11]: np.allclose(data, model.percent_point_slow(cdf))
Out[11]: True

In [12]: np.allclose(data, model.percent_point(cdf, method='bisect'))
Out[12]: True

In [13]: np.allclose(data, model.percent_point(cdf, method='chandrupatla'))
Out[13]: True

Altogether, I think we could keep chandrupatla as the default option but also have the bisect as an alternative.

Before we call it done I would add a few changes:

Can we move the bisect and chandruptla functions to a copulas.optimize module?
Maybe we could simplify a bit the code and remove unused options, like the return_iter and also put longer variable names, etc. to improve readability.
We should add a few numerical tests to validate that both the outputs of the chandrupatla and bisect functions are right, and also numerical tests that validate that the percent_point is successfully inverting the cumulative_distribution (this could actually be added for all the distributions?)

csala · 2021-01-06T13:09:50Z

Also, the build errors should be fixed after merging #203 to master and then merging master to this branch

Vectorize cdf, ppf

309c1b1

k15z requested review from fealho and csala December 22, 2020 19:01

Fix lint

7c4c88e

csala suggested changes Dec 23, 2020

View reviewed changes

k15z force-pushed the vectorize-kde branch from 4be4140 to e84e168 Compare December 24, 2020 05:41

Addressed feedback.

76c4e9c

k15z force-pushed the vectorize-kde branch from e84e168 to 76c4e9c Compare December 24, 2020 05:42

k15z commented Dec 24, 2020

View reviewed changes

copulas/univariate/gaussian_kde.py Outdated Show resolved Hide resolved

copulas/univariate/gaussian_kde.py Outdated Show resolved Hide resolved

tests/unit/univariate/test_gaussian_kde.py Show resolved Hide resolved

Add maxiter

81d2c11

k15z added 3 commits January 7, 2021 13:11

Create optimize module

08ce7ea

Merge branch 'master' into vectorize-kde

dbdbeac

Fix lint.

93ea85d

csala approved these changes Jan 22, 2021

View reviewed changes

csala merged commit e70f689 into master Jan 22, 2021

csala deleted the vectorize-kde branch January 22, 2021 10:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vectorize cdf, ppf #201

Vectorize cdf, ppf #201

k15z commented Dec 22, 2020 •

edited by csala

Loading

csala left a comment

codecov-io commented Dec 24, 2020 •

edited

Loading

k15z commented Dec 24, 2020 •

edited

Loading

csala commented Jan 6, 2021

csala commented Jan 6, 2021

Vectorize cdf, ppf #201

Vectorize cdf, ppf #201

Conversation

k15z commented Dec 22, 2020 • edited by csala Loading

csala left a comment

Choose a reason for hiding this comment

codecov-io commented Dec 24, 2020 • edited Loading

Codecov Report

k15z commented Dec 24, 2020 • edited Loading

csala commented Jan 6, 2021

csala commented Jan 6, 2021

k15z commented Dec 22, 2020 •

edited by csala

Loading

codecov-io commented Dec 24, 2020 •

edited

Loading

k15z commented Dec 24, 2020 •

edited

Loading