Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kernel Density Estimation (kde) #1944

Closed
iglesias opened this issue Mar 6, 2014 · 9 comments
Closed

Kernel Density Estimation (kde) #1944

iglesias opened this issue Mar 6, 2014 · 9 comments

Comments

@iglesias
Copy link
Collaborator

iglesias commented Mar 6, 2014

This is an entrance task for the fundamental machine learning algorithms GSoC project http://www.shogun-toolbox.org/page/Events/gsoc2014_ideas#fundamental.

  • Implement a KDE class inside the distributions directory that performs kernel density estimation. This algorithm depends on the choice of a kernel and a nearest neighbours search algorithm.

Scikit-learn has a nice implementation of this algorithm, so feel free to draw inspiration from it: github.com/scikit-learn/scikit-learn/blob/master/sklearn/neighbors/kde.py

This task is fun because the algorithm is not particularly hard to implement and it is possible to make good-looking examples (ask @iglesias if you need an idea for this once the main algorithm and unit test are implemented).

@iglesias
Copy link
Collaborator Author

iglesias commented Mar 6, 2014

Feel free to write your name below if you are working on this task! This way we avoid wasting manpower on the same algorithm.

@Saurabh7
Copy link
Contributor

Saurabh7 commented Mar 7, 2014

for reference, working on this one. :)

@SanchitAggarwal
Copy link

I am attempting this :)

@Saurabh7
Copy link
Contributor

Saurabh7 commented Mar 8, 2014

@iglesias i figured i needed help on this:
the scikit one computes log of density estimation ( not by default though) and uses log of the kernel for computation is that what would be expected here?

@karlnapf
Copy link
Member

karlnapf commented Mar 8, 2014

densities should always be evaluated in log-domain to avoid numerical problems

@cassiogreco
Copy link

I will attempt this as well

@arman-z
Copy link

arman-z commented Mar 26, 2014

Maybe I'm too late. But I'm also try to impelement it.

@vigsterkr
Copy link
Member

@iglesias @mazumdarparijat this is done, or?

@iglesias
Copy link
Collaborator Author

#2383

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

7 participants