xray::distribution on character vector with thousands of unique values #1

bbrewington · 2017-11-22T04:07:56Z

I ran xray::distributions on a data frame with a couple columns, address (3185 unique values) and cfsm (4558 unique values), and it overwhelmed the plotting device. Maybe build in a way to only pick the top N elements by occurance, and lump the rest together into "other"? Or skip the variable altogether.

sicarul · 2017-11-22T04:36:35Z

I thought i handled that case 🤦‍♂️ thanks for the report, i'll fix it.

sicarul · 2017-11-22T05:03:48Z

Fixed with Commit c67e34b
I'll close the ticket now, let me know if you find any more bugs 🐛

Thanks!

bbrewington · 2017-11-22T17:51:39Z

Awesome, and will do. I REALLY like this package. Pretty simple, but automates some essential data quality processes and I'll be using it all the time. Wonder if this will make it into the tidyverse :)

sicarul closed this as completed Nov 22, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

xray::distribution on character vector with thousands of unique values #1

xray::distribution on character vector with thousands of unique values #1

bbrewington commented Nov 22, 2017

sicarul commented Nov 22, 2017

sicarul commented Nov 22, 2017

bbrewington commented Nov 22, 2017

xray::distribution on character vector with thousands of unique values #1

xray::distribution on character vector with thousands of unique values #1

Comments

bbrewington commented Nov 22, 2017

sicarul commented Nov 22, 2017

sicarul commented Nov 22, 2017

bbrewington commented Nov 22, 2017