Expanded derivatives #85

briandesilva · 2020-07-20T20:32:11Z

Expands numerical differentiation capabilities by wrapping the methods in the derivative package. The PySINDy versions of each method have the same names as the objects in derivative, but with "Differentiator" appended to the end. I created an example notebook comparing all the differentiation options with and without noise. I also added some basic unit tests, but didn't make them stringent because derivative already has its own suite of tests.

For now I decided to leave in our original FiniteDifference and SmoothedFiniteDifference methods because

they allow the endpoints to be dropped
the seem to perform better in some instances
there isn't a method in derivative that implements smoothed finite differences (apply smoothing, then apply a finite difference method)

See #58 for more context.

I'm curious whether @andgoldschmidt and @Ohjeah have any thoughts about this implementation.

codecov-commenter · 2020-07-20T22:17:55Z

Codecov Report

Merging #85 into master will increase coverage by 0.02%.
The diff coverage is 97.95%.

@@            Coverage Diff             @@
##           master      #85      +/-   ##
==========================================
+ Coverage   95.35%   95.37%   +0.02%     
==========================================
  Files          18       19       +1     
  Lines         968      995      +27     
==========================================
+ Hits          923      949      +26     
- Misses         45       46       +1

Impacted Files	Coverage Δ
pysindy/differentiation/base.py	`100.00% <ø> (ø)`
pysindy/feature_library/polynomial_library.py	`97.72% <ø> (ø)`
pysindy/optimizers/sr3.py	`91.00% <ø> (ø)`
pysindy/optimizers/stlsq.py	`92.53% <ø> (ø)`
pysindy/differentiation/sindy_derivative.py	`96.15% <96.15%> (ø)`
pysindy/differentiation/__init__.py	`100.00% <100.00%> (ø)`
pysindy/feature_library/__init__.py	`100.00% <100.00%> (ø)`
pysindy/optimizers/__init__.py	`100.00% <100.00%> (ø)`
pysindy/pysindy.py	`96.88% <100.00%> (ø)`
pysindy/utils/__init__.py	`100.00% <100.00%> (ø)`
... and 1 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 0800089...622f5c3. Read the comment docs.

andgoldschmidt · 2020-07-27T19:15:18Z

FiniteDifference in pysindy is better than the one in the derivative package. I'll work to improve the derivative implementation en route to v1.0.

I believe SmoothedFiniteDifference in pysindy is similar to the SavitzkyGolay derivative in the package. Both fit a polynomial to a window of the data. The difference is whether the fit polynomial is used to compute the exact derivative (derivative) or if the the fit polynomial is piped into another derivative method (pysindy).

Not sure which functionality is desired, but I think if the pipe goes to finite difference the methods should be equivalent up to the numerical error of the finite difference method. (If you pipe into a method with additional fits/regularization, they will be different).

Ohjeah · 2020-07-28T14:53:03Z

I am slightly worried about the maintenance overhead this implementation causes. Every change in the derivative package requires us to adopt these changes in pysindy too.

I'd prefer to use the dxdt and pass through the derivative_args and derivative_kwargs as well as link to the derivative sphinx docu as soon as its ready.

briandesilva · 2020-07-28T16:08:13Z

@Ohjeah, I see what you mean and I agree that the code would be more maintainable if we were to use derivative objects more directly.

I'd prefer to use the dxdt and pass through the derivative_args and derivative_kwargs as well as link to the derivative sphinx docu as soon as its ready.

I'm not sure I understand what this solution would look like. Are you envisioning that rather than specify a class

model = SINDy(differentiation_method=SpectralDerivative(...))

the user would supply some keywords?

model = SINDy(derivative_kwargs={...})

Or maybe we could write a simple wrapper class for all derivative objects, similar to what we did with the optimizers?

optimizer = SINDyOptimizer(self.optimizer, unbias=unbias)

Or were you thinking something along the lines of having the user pass in a derivative class directly, that SINDy uses unmodified?

I'm not sure it's good enough reason to outweigh the maintainability concern you raised, but the nice thing about the current implementation is that it allows parameters of the differentiators to be included when performing sklearn-style cross-validation.

@andgoldschmidt, I think SmoothedFiniteDifference and SavitzkyGolay can coexist. You're right about how the two methods are similar, but if the user specifies a different smoother in SmoothedFiniteDifference, then they'll give different results. I could change the default smoother that's used to be something other than scipy's savgol_filter, but it appears to do well in practice.

Once you've updated FiniteDifference in derivative, I can switch PySINDy to use that implementation by default. Actually, maybe the best thing to do would be to use derivative objects exclusively and do away with pysindy.differentiation completely, pending reasonable resolutions to the concerns I raised above. I could contribute a SmoothedFiniteDifference class to your package to help ease the transition.

andgoldschmidt · 2020-07-28T22:59:04Z

I like the approach of SINDyDerivative("kind", args, kwargs) as a wrapper on the derivative dxdt("kind", args, kwargs) function. Echoing my understanding of @Ohjeah's comment, this should allow you to maintain the additional features you desire without requiring unique wrappers for each derivative object. This keeps you safe once derivative has a 1.0 interface--any new methods in derivative 1.x that satisfy the 1.0 interface should be able to be included with PySINDy without breaking anything.

I do agree that a wrapper for PySINDy is important, especially because

it allows parameters of the differentiators to be included when performing sklearn-style cross-validation.

is a great feature--honestly essential for users to truly integrate some of these derivative methods into the PySINDy framework. Also, a wrapper let's you create default args if that's appealing.

Let me know if derivative should make changes to better accommodate integration with cross validation.

If there are other common tools for filtering data besides a savgol filter, maybe that belongs as a separate utility. That is, this seems less like a derivative method and more like a pre-processing step to me.

Ohjeah · 2020-07-29T11:23:11Z

I agree with you both. Wrapping is required to ensure the cross-validation functionality and maybe specific input validation and error handling. The wrapping should be flexible enough though to seamlessly upgrade the derivative package dependency.

briandesilva · 2020-07-29T21:12:52Z

Derivative wrapper

I'll take a stab at implementing a universal wrapper for derivative objects. I'm a little worried that if the PySINDy wrapper only interfaces with the dxdt function then cross-validation will be difficult. When dxdt is called it builds a differentiation object on the fly, then immediately applies it to the data. I'll dig into how Scikit-learn's cross-validation methods set different parameter values to see if our wrapper implementation will work. I'll get back to you about any changes that need to be made in derivative

Smoothed finite differences

The main reason I included this method in the initial release was because it was one that Floris highlighted as being relatively simple, effective, and easy to implement. I agree that we could just leave it to the user to smooth their data before feeding it to a SINDy instance. Let me deal with the derivative wrapper stuff first then I can better judge what to do with SmoothedFiniteDifference.

Ohjeah · 2020-08-04T07:57:00Z

@briandesilva You can run a gridsearch on **kwargs parameters if you modify the set_params() method similar to how they do it with xgboost. Here is a little mockup:

from sklearn.base import BaseEstimator


class Model(BaseEstimator):
    def __init__(self, nested_kwargs=None):
        self.kwargs = {"nested_kwargs": nested_kwargs}

    def fit(self, x, y=None, **fit_params):
        print(self.kwargs)
        return self

    def score(self, x, y):
        return 0

    def set_params(self, **params):
        """Set the parameters of this estimator.
        Modification of the sklearn method to allow unknown kwargs. This allows using
        the full range of xgboost parameters that are not defined as member variables
        in sklearn grid search.
        Returns
        -------
        self
        """
        if not params:
            # Simple optimization to gain speed (inspect is slow)
            return self

        for key, value in params.items():
            if hasattr(self, key):
                setattr(self, key, value)
            else:
                self.kwargs[key] = value

        return self


import numpy as np
from sklearn.model_selection import train_test_split
from sklearn import datasets
from sklearn import svm

X, y = datasets.load_iris(return_X_y=True)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.4, random_state=0)

from sklearn.model_selection import GridSearchCV

m = Model()
p = {"nested_kwargs": [{"a": 1}, {"a": 2}]}
grid = GridSearchCV(m, p)
print(grid)
grid.fit(X, y)

Model would be the derivative wrapper and nested_kwargs the object passed to the dxdt method.

briandesilva · 2020-08-22T23:06:16Z

Okay, I finally got around to updating the implementation based on @Ohjeah's suggestion. I added unit tests and updated the derivative and sklearn example notebooks.

The solution ended up being both lightweight and flexible enough to account for changes in the derivative package. Let me know what you think @andgoldschmidt.

Ohjeah · 2020-08-24T12:43:56Z

LGTM, we should also use an inter-sphinx link to the derivative.py documentation.

briandesilva · 2020-08-30T00:46:27Z

Okay, I'm going to go ahead and merge this and create a new release.

Expanded derivatives

briandesilva added 5 commits July 17, 2020 16:30

Incorporate differentiators from the derivative package

73e3bda

Notebook comparing differentiation methods

5921dc7

Add derivative package to requirements

3d06018

Update derivative examples

d4b54dc

Some unit tests for new differentiation methods

ac2d021

briandesilva requested a review from Ohjeah July 20, 2020 20:32

briandesilva self-assigned this Jul 20, 2020

First attempt at generic derivative wrapper

d8f308a

briandesilva added 4 commits August 22, 2020 14:52

Refactor derivative wrapper based on xgboost implementation

a2a5302

Remove old derivative wrappers and update tests

cdda783

Update derivative example notebook

d6ee40d

Update sklearn notebook with new derivative example

be2dce7

Ohjeah approved these changes Aug 25, 2020

View reviewed changes

briandesilva added 7 commits August 25, 2020 12:01

Add intersphinx links for derivative

17a7dc5

Rename derivative file

2765a26

Cross-validation test

3ec5847

Reformat to conform to new black standards

8c7a6d5

Remove old incorrect derivative code

cefe9f1

Add SINDyDerivative example to tutorial

833f751

Merge branch 'master' into expanded-derivatives

695eec6

Add SINDyDerivative to top-level __init__

622f5c3

briandesilva closed this Aug 30, 2020

briandesilva reopened this Aug 30, 2020

briandesilva merged commit c7c7829 into master Aug 30, 2020

briandesilva deleted the expanded-derivatives branch August 30, 2020 00:48

briandesilva mentioned this pull request Aug 30, 2020

Expand numerical differentiation options #58

Closed

jpcurbelo pushed a commit to jpcurbelo/pysindy_fork that referenced this pull request Apr 30, 2024

Merge pull request dynamicslab#85 from dynamicslab/expanded-derivatives

d592ec9

Expanded derivatives

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expanded derivatives #85

Expanded derivatives #85

briandesilva commented Jul 20, 2020

codecov-commenter commented Jul 20, 2020 •

edited

Loading

andgoldschmidt commented Jul 27, 2020

Ohjeah commented Jul 28, 2020

briandesilva commented Jul 28, 2020

andgoldschmidt commented Jul 28, 2020 •

edited

Loading

Ohjeah commented Jul 29, 2020

briandesilva commented Jul 29, 2020

Ohjeah commented Aug 4, 2020

briandesilva commented Aug 22, 2020

Ohjeah commented Aug 24, 2020

briandesilva commented Aug 30, 2020

Expanded derivatives #85

Expanded derivatives #85

Conversation

briandesilva commented Jul 20, 2020

codecov-commenter commented Jul 20, 2020 • edited Loading

Codecov Report

andgoldschmidt commented Jul 27, 2020

Ohjeah commented Jul 28, 2020

briandesilva commented Jul 28, 2020

andgoldschmidt commented Jul 28, 2020 • edited Loading

Ohjeah commented Jul 29, 2020

briandesilva commented Jul 29, 2020

Derivative wrapper

Smoothed finite differences

Ohjeah commented Aug 4, 2020

briandesilva commented Aug 22, 2020

Ohjeah commented Aug 24, 2020

briandesilva commented Aug 30, 2020

codecov-commenter commented Jul 20, 2020 •

edited

Loading

andgoldschmidt commented Jul 28, 2020 •

edited

Loading