New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generalizing support for categorical data #140

jbednar opened this Issue Apr 14, 2016 · 0 comments


None yet
3 participants
Copy link

jbednar commented Apr 14, 2016

Right now, we offer the count_cat reduction operator, which effectively does the count reduction separately per pixel. Most of the other reduction operators would also be meaningful to do categorically, and it would be a pain to implement a categorical version of each one as it is needed.

Instead, we should be able to implement a general-purpose categorical operator, similar to the summary operator, which takes a scalar reduction operator and applies it per category. @jcrist suggests a syntax like by('category_column', mean('some_other_column')), which seems clear to me as well. I think we should do this as soon as we need to apply any operator other than count to categories, since making a general version is probably not much more difficult than copying count_cat and changing the counting bits.

@jbednar jbednar added the wishlist label Apr 15, 2016

@jbednar jbednar added ready and removed wishlist labels Sep 8, 2016

@jbednar jbednar assigned jbcrail and unassigned jcrist Apr 11, 2017

@jbcrail jbcrail added this to the 0.6.0 milestone May 12, 2017

@jbednar jbednar removed this from the 0.6.0 milestone Aug 18, 2017

@jbednar jbednar referenced this issue Dec 19, 2018


Datashader internals to-do list #672

4 of 6 tasks complete
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment