Simplify spectral clustering solver logic #14713

amueller · 2019-08-21T18:55:36Z

We need to simplify the logic for selecting a solver in spectral_clustering.
See discussion here:
#10715 (comment)
#14647 (comment)
#10720 (comment)

The text was updated successfully, but these errors were encountered:

alexshacked · 2019-09-24T12:43:19Z

Hi @amueller. I would like to take a shot at this. I can see it's demanding but would still like to take a look. Is it OK, or is someone already looking at it?

amueller · 2019-09-25T17:36:54Z

@alexshacked please go for it!

alexshacked · 2019-09-25T19:18:10Z

Thank you, I will have a design based on #10715 (comment), #14647 (comment), #10720 (comment) in a couple of days

alexshacked · 2019-10-04T22:56:16Z

After reading the issues and PRs referenced here and also studying the spectral clustering implementation, I understand that this enhancement's goal is to refactor the logic for chosing an eigen solver in function spectral_embedding() located in manifold/spectral_embedding.py
I opened a [WIP ] PR #15136, where I show an inital suggestion for refactoring spectral_embedding(). My intention is just to present an idea for simplifing the logic that choses the solver. It felt easier to write code, then to write a design document. I hope to get feed-back that will focus me onwards.
I also intend to add more regressinon tests, but I thought to work on the tests only after the refactoring goal becomes more clear to me.

alexshacked · 2019-10-07T09:44:52Z

After we finish the refactoring and achieve a clear code, there are a couple of candidates for improving the algorithm. I found those by reading the issues referenced here. For now I have 2 ideas based on @lobpcg suggestions .

call scipy.linalg.eigh() always if n_nodes < 5 * n_components.
Meaning, if number of samples is smaller than the (number_of_requested_eigen_vectors x 5)
original comment: Amg arpack workaround fix #14647 (comment)
When calling arpack with shift-invert use a sigma value of -1e-5 instead of 1.0 and do not multiply the laplacian by -1, just before calling arpack
original comment: Amg arpack workaround fix #14647 (comment)

@amueller what do you think?

…4713)

…-learn#14713)

…4713)

amueller added Enhancement help wanted labels Aug 21, 2019

amueller mentioned this issue Aug 21, 2019

Amg arpack workaround fix #14647

Merged

alexshacked pushed a commit to alexshacked/scikit-learn that referenced this issue Oct 4, 2019

ENH refactored logic for chosing eigen solver (scikit-learn#14713)

790e9e9

alexshacked mentioned this issue Oct 4, 2019

[WIP] Refactoring logic for chosing eigen solver in spectral clustering (#14713) #15136

Open

alexshacked pushed a commit to alexshacked/scikit-learn that referenced this issue Oct 4, 2019

ENH logic for eigen solver - PEP8 issues (scikit-learn#14713)

a4ff2d2

alexshacked pushed a commit to alexshacked/scikit-learn that referenced this issue Oct 4, 2019

ENH logic for eigen solver - PEP8 issues 2 (scikit-learn#14713)

f6ff7a0

alexshacked pushed a commit to alexshacked/scikit-learn that referenced this issue Oct 15, 2019

ENH logic for eigen chossing solver - improve comment (scikit-learn#1…

c0e61ab

…4713)

alexshacked pushed a commit to alexshacked/scikit-learn that referenced this issue Oct 15, 2019

ENH logic for eigen chossing solver - shorten comment for PIP (scikit…

ef6fba9

…-learn#14713)

alexshacked pushed a commit to alexshacked/scikit-learn that referenced this issue Oct 15, 2019

ENH logic for eigen chossing solver - shorten comment (scikit-learn#1…

7a83d40

…4713)

cmarmo added module:manifold and removed help wanted labels Jan 21, 2021

lobpcg mentioned this issue Nov 3, 2021

ENH support float32 in SpectralEmbedding for LOBPCG and PyAMG solvers #21534

Merged

lobpcg mentioned this issue Dec 17, 2021

eigen_tol in _spectral_embedding.py does not propagate to solvers other than arpack #21243

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify spectral clustering solver logic #14713

Simplify spectral clustering solver logic #14713

amueller commented Aug 21, 2019

alexshacked commented Sep 24, 2019

amueller commented Sep 25, 2019

alexshacked commented Sep 25, 2019

alexshacked commented Oct 4, 2019 •

edited

alexshacked commented Oct 7, 2019 •

edited

Simplify spectral clustering solver logic #14713

Simplify spectral clustering solver logic #14713

Comments

amueller commented Aug 21, 2019

alexshacked commented Sep 24, 2019

amueller commented Sep 25, 2019

alexshacked commented Sep 25, 2019

alexshacked commented Oct 4, 2019 • edited

alexshacked commented Oct 7, 2019 • edited

alexshacked commented Oct 4, 2019 •

edited

alexshacked commented Oct 7, 2019 •

edited