Fetching contributors…
Cannot retrieve contributors at this time
62 lines (46 sloc) 1.98 KB
- Fixed an issue in multiprocessing code.
- Multiprocessing (by loisaidasam)
- Python 3 support
- Split up one big file into smaller more logical sub-modules
- Fixed
- Documentation update.
- Fixed bug #1727558
- Some more unit-tests
- ValueError changed to ClusteringError where appropriate
- Fixed bug #1604859 (thanks to Willi Richert for reporting it)
- Applied patch [1535137] (thanks ajaksu)
--> Topology output supported
--> data and raw_data are now properties.
- KMeans Clustering implemented for simple numeric tuples.
Data in the form [(1,1), (2,1), (5,3), ...]
can be clustered.
>>> from cluster import KMeansClustering
>>> cl = KMeansClustering([(1,1), (2,1), (5,3), ...])
>>> clusters = cl.getclusters(2)
the method "getclusters" takes the amount of clusters you would like to
have as parameter.
Only numeric values are supported in the tuples. The reason for this is
that the "centroid" method which I use, essentially returns a tuple of
floats. So you will lose any other kind of metadata. Once I figure out a
way how to recode that method, other types should be possible.
- Optimized calculation of the hierarchical clustering by using the fact, that
the generated matrix is symmetrical.
- Implemented complete-, average-, and uclus-linkage methods. You can select
one by specifying it in the constructor, for example:
cl = HierarchicalClustering(data, distfunc, linkage='uclus')
or by setting it before starting the clustering process:
cl = HierarchicalClustering(data, distfunc)
- Clustering is not executed on object creation, but on the first call of
"getlevel". You can force the creation of the clusters by calling the
"cluster" method as shown above.