Fast NMF algorithm for dense and sparse data #896

mblondel · 2012-06-08T08:30:50Z

Here's an algorithm which I think would be a good candidate for inclusion in scikit-learn:

http://www.cs.utexas.edu/~cjhsieh/nmf/

ogrisel · 2012-06-08T19:49:46Z

I would rather like an algorithm that scales with n_samples rather than n_features. Maybe an Averaged SGD optimization of the NMF cost function + positive projections?

mblondel · 2012-06-09T06:24:15Z

Their algorithm didn't gave me the impression that it doesn't scale wrt n_samples (the datasets they use for their experiments are pretty large). And variable selection seems to help accelerate convergence a lot.

BTW, the tricks necessary for efficient implementation of averaging in the sparse case may not be applicable if there's a projection step (to be verified).

ogrisel · 2012-06-09T11:23:16Z

Ok interesting. The code seems simple enough to implement too.

mblondel · 2014-08-14T15:02:53Z

I added a preliminary implementation of this method here:
https://gist.github.com/mblondel/09648344984565f9477a

A difference is that my implementation uses cyclic coordinate selection instead of greedy.

CC @vene @larsmans

mblondel · 2014-08-17T14:48:30Z

I obtained a 20x speed up by numba-ing the most computationally expensive part. Now computing the NMF on the full News20 dataset takes 10 seconds.

https://gist.github.com/mblondel/09648344984565f9477a

amueller · 2015-08-28T15:23:00Z

should we close this in favor of #4811?

mblondel · 2015-08-29T12:43:34Z

Let's close it when #4852 is merged :)

amueller · 2015-09-22T16:26:16Z

#4852 is merged now :)

mblondel mentioned this issue Nov 11, 2012

NMF for Kullback-Leibler divergence #1348

Closed

TomDLT mentioned this issue Jun 3, 2015

Discussion about adding NMF methods #4811

Closed

amueller closed this as completed Sep 22, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fast NMF algorithm for dense and sparse data #896

Fast NMF algorithm for dense and sparse data #896

mblondel commented Jun 8, 2012

ogrisel commented Jun 8, 2012

mblondel commented Jun 9, 2012

ogrisel commented Jun 9, 2012

mblondel commented Aug 14, 2014

mblondel commented Aug 17, 2014

amueller commented Aug 28, 2015

mblondel commented Aug 29, 2015

amueller commented Sep 22, 2015

Fast NMF algorithm for dense and sparse data #896

Fast NMF algorithm for dense and sparse data #896

Comments

mblondel commented Jun 8, 2012

ogrisel commented Jun 8, 2012

mblondel commented Jun 9, 2012

ogrisel commented Jun 9, 2012

mblondel commented Aug 14, 2014

mblondel commented Aug 17, 2014

amueller commented Aug 28, 2015

mblondel commented Aug 29, 2015

amueller commented Sep 22, 2015