sklearn api #12

gioxc88 · 2021-06-07T16:10:07Z

Would you consider the possibility of making it compatible with sklearn using fit and transform instead of smooth?
Is there a specific reason why you save the transformed data as an instance attribute? (this would be against the sklearn API)

I am thinking of doing it myself for a project I am working on but I wanted to ask you first if I missed anything obvious that would make this difficult or not possible.

Many thanks

The text was updated successfully, but these errors were encountered:

cerlymarco · 2021-06-07T16:30:13Z

Hi,

all the Smoother classes compute 'smooth'... this is the equivalent of 'fit_trasform' method. There is no way to operate 'fit' or 'transform' separately. For this reason, I prefer to not introduce this behavior, but you are free to create your own class in sklearn style like a wrapper of tsmoothie.

If you support the project don't forget to live a star ;-)

All the best

gioxc88 · 2021-06-09T10:12:38Z

thanks for the answer I am aware of this behavior, but I don't think you are entirely right.
On many of them you can in fact operate fit and transform separately

There are many classes where you calculate X_base and then you fit a LinearRegression using X_base. Finally you use the fitted LinearRegression for predicting the smoothed data.
In particular this applies to Polynomial, Gaussian , Spline and Binner.

That is by definition the separation between fit (where you calculate the X_base and fit the LinearRegression) and transform (where you use the LinearRegression for predicting the smoothed data). In this case for example you would only save the fitted LinearRegression as instance attribute

A similar pattern happens for the Lowess where the fitting process consists in calculating the weights (what you call betas) and the transform step is basically only this (betas[..., 0] + betas[..., 1] * X).T

And finally for Exponential and Convolutional the fit step would consist in calculating the weights and the transform step is to apply the convolution using the "fitted" weights.

So I believe saying "There is no way to operate 'fit' or 'transform' separately" is inaccurate.

That being said I love the package and I am not saying you should change the api to sklearn (I'll do it my self).
I am only suggesting that in my opinion there are some design patterns you used like self._store_results which are not great and makes it more difficult to create wrappers.

At the moment creating a Wrapper compliant with the sklearn api would require literally to take the code and copy paste it using a different structure. So this is not exactly ideal because every time update tsmoothie, the hypothetical external developer who wrote the Wrapper would need to check line by line the consistency between tsmoothie and his Wrapper.

Again just my 2 cents and congrats for the awesome work.

cerlymarco · 2021-06-11T08:26:28Z

Thanks for the suggestions, but I saw them not entirely applicable to all the smoother available. For the same reason, I used _store_results, as a common method for all the smoother, to make the code build in the same way (this is only a choice of mine).

I also think that a dummy sklearn wrapper can be built in this simple way (also it depends on what u are looking for):

class SklearnLowess(LowessSmoother):

  def __init__(self, smooth_fraction, iterations, batch_size=None, copy=True):
    self.smooth_fraction = smooth_fraction
    self.iterations = iterations
    self.batch_size = batch_size
    self.copy = copy

  def fit(self, X):
    return self

  def transform(self, X):
    return self.smooth(X).smooth_data

  def fit_transform(self, X):
    return self.smooth(X).smooth_data

here the full code

Bye

cerlymarco closed this as completed Jun 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sklearn api #12

sklearn api #12

gioxc88 commented Jun 7, 2021

cerlymarco commented Jun 7, 2021

gioxc88 commented Jun 9, 2021 •

edited

cerlymarco commented Jun 11, 2021 •

edited

sklearn api #12

sklearn api #12

Comments

gioxc88 commented Jun 7, 2021

cerlymarco commented Jun 7, 2021

gioxc88 commented Jun 9, 2021 • edited

cerlymarco commented Jun 11, 2021 • edited

gioxc88 commented Jun 9, 2021 •

edited

cerlymarco commented Jun 11, 2021 •

edited