Speed up post-clustering processing #32

mhhennig · 2018-05-25T18:53:54Z

Building the table of clusters, after clustering, takes a long time, often longer than the actual clustering. This seems unnecessary and needs checking. It's the last bit in the CombinedClustering method.

The text was updated successfully, but these errors were encountered:

frozenblit · 2018-05-30T15:16:49Z

Have you tried using spikes.groupby('cl')? Then you can loop over the tuples (cluster_number, spikes within that cluster) and use all the pandas machinery to efficiently calculate the grouped means and amplitudes. In my experience that was much faster than iterating over the cluster labels.

mhhennig · 2018-05-31T21:50:55Z

Indeed, wow, thank you Fernando! This is now much faster, and as a side effect, DBSCAN now works flawlessly.

frozenblit · 2018-06-01T07:16:27Z

Pandas is magical ;)

martinosorb · 2018-06-01T08:28:07Z

Ah, this is my fault, I don't understand Pandas very well. Thanks Fernando!

mhhennig added the enhancement label May 25, 2018

mhhennig self-assigned this May 25, 2018

mhhennig closed this as completed May 31, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up post-clustering processing #32

Speed up post-clustering processing #32

mhhennig commented May 25, 2018

frozenblit commented May 30, 2018

mhhennig commented May 31, 2018

frozenblit commented Jun 1, 2018

martinosorb commented Jun 1, 2018

Speed up post-clustering processing #32

Speed up post-clustering processing #32

Comments

mhhennig commented May 25, 2018

frozenblit commented May 30, 2018

mhhennig commented May 31, 2018

frozenblit commented Jun 1, 2018

martinosorb commented Jun 1, 2018