API pairwise_distances will require explicit V/VI param if Y is given #16993

jnothman · 2020-04-22T01:00:03Z

Deprecation until version 0.25.

The current approach in _precompute_metric_params
(

scikit-learn/sklearn/metrics/pairwise.py

Lines 1429 to 1444 in f82a2cb

    
           def _precompute_metric_params(X, Y, metric=None, **kwds): 
        
               """Precompute data-derived metric parameters if not provided 
        
               """ 
        
               if metric == "seuclidean" and 'V' not in kwds: 
        
                   if X is Y: 
        
                       V = np.var(X, axis=0, ddof=1) 
        
                   else: 
        
                       V = np.var(np.vstack([X, Y]), axis=0, ddof=1) 
        
                   return {'V': V} 
        
               if metric == "mahalanobis" and 'VI' not in kwds: 
        
                   if X is Y: 
        
                       VI = np.linalg.inv(np.cov(X.T)).T 
        
                   else: 
        
                       VI = np.linalg.inv(np.cov(np.vstack([X, Y]).T)).T 
        
                   return {'VI': VI} 
        
               return {}

)
means that we may be applying a different metric at training and test
time. Ideally we'd have a framework for fitting a metric on some
specific training data, but in the meantime, this deprecation stops
users making mistakes.

Deprecation until version 0.25. The current approach in `_precompute_metric_params` (https://github.com/scikit-learn/scikit-learn/blob/f82a2cb33871a67b36150647ece1c7e56d3132bb/sklearn/metrics/pairwise.py#L1429-L1444) means that we may be applying a different metric at training and test time. Ideally we'd have a framework for fitting a metric on some specific training data, but in the meantime, this deprecation stops users making mistakes.

adrinjalali · 2020-04-22T08:22:52Z

Are there other metrics where we have a similar pattern?

jnothman · 2020-04-22T13:01:29Z

Only Gower in progress.

sklearn/metrics/pairwise.py

sklearn/metrics/tests/test_pairwise.py

Co-Authored-By: Thomas J Fan <thomasjpfan@gmail.com>

jnothman · 2020-04-27T12:46:05Z

Good to go, @thomasjpfan?

adrinjalali · 2020-04-27T19:55:31Z

tagging for inclusion #17010

…#16993) * API pairwise_distances will require explicit V/VI param if Y is given Deprecation until version 0.25. The current approach in `_precompute_metric_params` (https://github.com/scikit-learn/scikit-learn/blob/f82a2cb33871a67b36150647ece1c7e56d3132bb/sklearn/metrics/pairwise.py#L1429-L1444) means that we may be applying a different metric at training and test time. Ideally we'd have a framework for fitting a metric on some specific training data, but in the meantime, this deprecation stops users making mistakes. * DOC update what's new * Update sklearn/metrics/tests/test_pairwise.py Co-Authored-By: Thomas J Fan <thomasjpfan@gmail.com> * Update sklearn/metrics/pairwise.py Co-Authored-By: Thomas J Fan <thomasjpfan@gmail.com> * Update sklearn/metrics/pairwise.py Co-Authored-By: Thomas J Fan <thomasjpfan@gmail.com> * Update sklearn/metrics/tests/test_pairwise.py Co-Authored-By: Thomas J Fan <thomasjpfan@gmail.com> Co-authored-by: Thomas J Fan <thomasjpfan@gmail.com>

…scikit-learn#16993) * API pairwise_distances will require explicit V/VI param if Y is given Deprecation until version 0.25. The current approach in `_precompute_metric_params` (https://github.com/scikit-learn/scikit-learn/blob/f82a2cb33871a67b36150647ece1c7e56d3132bb/sklearn/metrics/pairwise.py#L1429-L1444) means that we may be applying a different metric at training and test time. Ideally we'd have a framework for fitting a metric on some specific training data, but in the meantime, this deprecation stops users making mistakes. * DOC update what's new * Update sklearn/metrics/tests/test_pairwise.py Co-Authored-By: Thomas J Fan <thomasjpfan@gmail.com> * Update sklearn/metrics/pairwise.py Co-Authored-By: Thomas J Fan <thomasjpfan@gmail.com> * Update sklearn/metrics/pairwise.py Co-Authored-By: Thomas J Fan <thomasjpfan@gmail.com> * Update sklearn/metrics/tests/test_pairwise.py Co-Authored-By: Thomas J Fan <thomasjpfan@gmail.com> Co-authored-by: Thomas J Fan <thomasjpfan@gmail.com>

github-actions bot added the module:metrics label Apr 22, 2020

DOC update what's new

843fb29

adrinjalali approved these changes Apr 22, 2020

View reviewed changes

jnothman added the Waiting for Reviewer label Apr 25, 2020

thomasjpfan reviewed Apr 26, 2020

View reviewed changes

sklearn/metrics/pairwise.py Outdated Show resolved Hide resolved

sklearn/metrics/pairwise.py Outdated Show resolved Hide resolved

sklearn/metrics/tests/test_pairwise.py Outdated Show resolved Hide resolved

sklearn/metrics/tests/test_pairwise.py Show resolved Hide resolved

jnothman and others added 4 commits April 27, 2020 17:48

Update sklearn/metrics/tests/test_pairwise.py

35ca3d0

Co-Authored-By: Thomas J Fan <thomasjpfan@gmail.com>

Update sklearn/metrics/pairwise.py

97a0aea

Co-Authored-By: Thomas J Fan <thomasjpfan@gmail.com>

Update sklearn/metrics/pairwise.py

11cfccc

Co-Authored-By: Thomas J Fan <thomasjpfan@gmail.com>

Update sklearn/metrics/tests/test_pairwise.py

22e1fc7

Co-Authored-By: Thomas J Fan <thomasjpfan@gmail.com>

Merge remote-tracking branch 'upstream/master' into pr/16993

fbc2c12

thomasjpfan approved these changes Apr 27, 2020

View reviewed changes

thomasjpfan merged commit 5b2c931 into scikit-learn:master Apr 27, 2020

nyanp mentioned this pull request Apr 26, 2022

bug(KNN FIT): Neighbors metric_params expecting 'VI', not 'V' nyanp/optiver-realized-volatility-prediction#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

API pairwise_distances will require explicit V/VI param if Y is given #16993

API pairwise_distances will require explicit V/VI param if Y is given #16993

Uh oh!

jnothman commented Apr 22, 2020

Uh oh!

adrinjalali commented Apr 22, 2020

Uh oh!

jnothman commented Apr 22, 2020 via email

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jnothman commented Apr 27, 2020 •

edited

Loading

Uh oh!

adrinjalali commented Apr 27, 2020

Uh oh!

Uh oh!

	def _precompute_metric_params(X, Y, metric=None, **kwds):
	"""Precompute data-derived metric parameters if not provided
	"""
	if metric == "seuclidean" and 'V' not in kwds:
	if X is Y:
	V = np.var(X, axis=0, ddof=1)
	else:
	V = np.var(np.vstack([X, Y]), axis=0, ddof=1)
	return {'V': V}
	if metric == "mahalanobis" and 'VI' not in kwds:
	if X is Y:
	VI = np.linalg.inv(np.cov(X.T)).T
	else:
	VI = np.linalg.inv(np.cov(np.vstack([X, Y]).T)).T
	return {'VI': VI}
	return {}

Uh oh!

API pairwise_distances will require explicit V/VI param if Y is given #16993

API pairwise_distances will require explicit V/VI param if Y is given #16993

Uh oh!

Conversation

jnothman commented Apr 22, 2020

Uh oh!

adrinjalali commented Apr 22, 2020

Uh oh!

jnothman commented Apr 22, 2020 via email

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jnothman commented Apr 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

adrinjalali commented Apr 27, 2020

Uh oh!

Uh oh!

jnothman commented Apr 27, 2020 •

edited

Loading