KernelDensity docstring #3270

mblondel · 2014-06-12T08:27:34Z

I had troubles understanding how parameters related to KD-tree and Ball tree affect DensityEstimation.

In my understanding, KDE uses the average evaluation of kernels centered on every training point so I didn't quite understand how KD-tree and Ball-tree are useful (I would understand if it used, say, the average of the k nearest neighbors only).

The docstring says that we can specify tolerance options rtol and atol. With respect to what stopping criterion are these constants used?

Regarding the breadth_first option, the docstring says "use a breadth-first approach to the problem". I didn't understand what "problem" it refers to.

What is the practical impact of of the tree related parameters? Do they only affect speed or can they also affect quality of estimation?

Sorry for the naive questions but it would help my understanding if we could add a couple of words to clarify the docstring.

BTW, thanks @jakevdp for this great module!

The text was updated successfully, but these errors were encountered:

mblondel · 2014-06-13T02:10:24Z

Just realized that Jake had written a nice blog post answering most of my questions.
http://jakevdp.github.io/blog/2013/12/01/kernel-density-estimation/

Would be nice to improve the documentation based on this blog post.

jakevdp · 2014-06-13T15:48:14Z

Would be nice to improve the documentation based on this blog post.

Yes, I agree! Part of the reason this info is not in the doc string is that I hadn't explored it in detail yet: the blog post was the result of me trying to figure that out 😀

Hasil-Sharma · 2014-06-21T10:53:56Z

Hi, Is this issue open for resolving ?

mblondel · 2014-06-23T03:42:51Z

@Hasil-Sharma All issues are open for resolving :)

Winterflower · 2014-08-31T10:36:24Z

Working on improving the docs based on the blog post mentioned above,

jakevdp · 2014-08-31T13:57:24Z

Awesome - thanks @Winterflower

jakevdp · 2014-08-31T13:59:02Z

One thing I'd thought of here: the default parameter value asks for exact results, which is basically the slowest possible algorithm. Most users will not likely dig into the doc strings to figure this out... perhaps we should change it to use some reasonable error threshold as the default?

loldja · 2017-03-04T18:17:59Z

🤔 I'm working on this

amueller · 2018-08-21T19:40:51Z

interestingly enough the BinaryTree.kernel_density function has the default rtol of 1E-8 but documents 0...

amueller · 2018-08-21T19:41:54Z

I think we actually might need @jakevdp on this

reshamas · 2021-07-08T17:12:29Z

Note: This file (https://github.com/scikit-learn/scikit-learn/blob/506b12b2761ad88039114dec1c6c4fcec4a7a021/sklearn/neighbors/_binary_tree.pxi) has

        rtol : float, default=1e-8
            Specify the desired relative tolerance of the result.
            If the true result is `K_true`, then the returned result `K_ret`
            satisfies ``abs(K_true - K_ret) < atol + rtol * K_ret``
            The default is `1e-8` (i.e. machine precision).

and log_rtol is used after rtol definition.

adrinjalali · 2024-04-18T08:18:21Z

Docstrings have evolved and improved. Closing this, happy to have a new issue if something's still unclear.

mblondel added the Documentation label Jun 12, 2014

amueller added Easy Well-defined and straightforward way to resolve Need Contributor labels Oct 27, 2016

amueller added the Sprint label Mar 3, 2017

loldja mentioned this issue Mar 5, 2017

changed default value rtol in docstring to reflect source code #8533

Closed

lesteve added help wanted and removed Need Contributor labels Oct 18, 2017

amueller removed Easy Well-defined and straightforward way to resolve Sprint help wanted labels Sep 29, 2018

cmarmo added the module:neighbors label Dec 6, 2021

adrinjalali closed this as completed Apr 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KernelDensity docstring #3270

KernelDensity docstring #3270

mblondel commented Jun 12, 2014

mblondel commented Jun 13, 2014

jakevdp commented Jun 13, 2014

Hasil-Sharma commented Jun 21, 2014

mblondel commented Jun 23, 2014

Winterflower commented Aug 31, 2014

jakevdp commented Aug 31, 2014

jakevdp commented Aug 31, 2014

loldja commented Mar 4, 2017

amueller commented Aug 21, 2018

amueller commented Aug 21, 2018

reshamas commented Jul 8, 2021

adrinjalali commented Apr 18, 2024

KernelDensity docstring #3270

KernelDensity docstring #3270

Comments

mblondel commented Jun 12, 2014

mblondel commented Jun 13, 2014

jakevdp commented Jun 13, 2014

Hasil-Sharma commented Jun 21, 2014

mblondel commented Jun 23, 2014

Winterflower commented Aug 31, 2014

jakevdp commented Aug 31, 2014

jakevdp commented Aug 31, 2014

loldja commented Mar 4, 2017

amueller commented Aug 21, 2018

amueller commented Aug 21, 2018

reshamas commented Jul 8, 2021

adrinjalali commented Apr 18, 2024