better description of the hierarchical clustering parameter #9171

raamana · 2018-08-21T19:47:25Z

Clarifies that t could be an integer specifying the max number of clusters under maxclust* criteria

raamana · 2018-08-22T19:27:21Z

the reasons pypy3 failed with PR seems to be for reasons completely unrelated to my changes - any idea whats going on?

jeffyancey · 2018-08-23T17:44:33Z

scipy/cluster/hierarchy.py

@@ -2471,8 +2471,11 @@ def fcluster(Z, t, criterion='inconsistent', depth=2, R=None, monocrit=None):
    Z : ndarray
        The hierarchical clustering encoded with the matrix returned
        by the `linkage` function.
-    t : float
-        The threshold to apply when forming flat clusters.
+    t : float or int


This could really be any numeric that can be safely cast to int and float , correct? A boolean would work here as well. I like your new comment below, but am unsure if changing to "float or int" is an improvement.

its more of a semantic hint to further reinforce the point that it is referring to number of clusters, an integer.. Making it explicit, speaking in python terms!. Without that hint, people may supply 2.0, thinking they are expected a float.

in fact, it might even be better to rename to t to t_or_num_clust

in other places referring to kmeans, scipy already uses k_or_guess

the generic term to use for float or int is scalar

I take your point @raamana, I think scalar is the right way to go

sure - what about the renaming the variable to t_or_num_clust?

you cannot rename variables in the signature, that breaks backwards compatibility

Sure. revised the docs now.

rgommers · 2018-08-23T19:46:57Z

Merged, thanks @raamana, @jeffyancey

raamana · 2018-08-23T19:49:43Z

Yay! thanks. So glad to be able to add a few characters into the mighty scipy codebase :)

raamana added 3 commits August 21, 2018 15:43

better description of the parameter

b7a0fe0

sharpening descriptions

b4c2fd7

updating fcluster as well

e322cd9

rgommers added scipy.cluster Documentation Issues related to the SciPy documentation. Also check https://github.com/scipy/scipy.org labels Aug 21, 2018

jeffyancey reviewed Aug 23, 2018

View reviewed changes

more succinct description of the data type

b11ca98

jeffyancey approved these changes Aug 23, 2018

View reviewed changes

rgommers merged commit 04f6392 into scipy:master Aug 23, 2018

rgommers added this to the 1.2.0 milestone Aug 23, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

better description of the hierarchical clustering parameter #9171

better description of the hierarchical clustering parameter #9171

raamana commented Aug 21, 2018 •

edited

Loading

raamana commented Aug 22, 2018

jeffyancey Aug 23, 2018

raamana Aug 23, 2018

raamana Aug 23, 2018

rgommers Aug 23, 2018 •

edited

Loading

jeffyancey Aug 23, 2018

raamana Aug 23, 2018

rgommers Aug 23, 2018

raamana Aug 23, 2018

rgommers commented Aug 23, 2018

raamana commented Aug 23, 2018

better description of the hierarchical clustering parameter #9171

better description of the hierarchical clustering parameter #9171

Conversation

raamana commented Aug 21, 2018 • edited Loading

raamana commented Aug 22, 2018

jeffyancey Aug 23, 2018

Choose a reason for hiding this comment

raamana Aug 23, 2018

Choose a reason for hiding this comment

raamana Aug 23, 2018

Choose a reason for hiding this comment

rgommers Aug 23, 2018 • edited Loading

Choose a reason for hiding this comment

jeffyancey Aug 23, 2018

Choose a reason for hiding this comment

raamana Aug 23, 2018

Choose a reason for hiding this comment

rgommers Aug 23, 2018

Choose a reason for hiding this comment

raamana Aug 23, 2018

Choose a reason for hiding this comment

rgommers commented Aug 23, 2018

raamana commented Aug 23, 2018

raamana commented Aug 21, 2018 •

edited

Loading

rgommers Aug 23, 2018 •

edited

Loading