Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

xray::distribution on character vector with thousands of unique values #1

Closed
bbrewington opened this issue Nov 22, 2017 · 3 comments
Closed

Comments

@bbrewington
Copy link

I ran xray::distributions on a data frame with a couple columns, address (3185 unique values) and cfsm (4558 unique values), and it overwhelmed the plotting device. Maybe build in a way to only pick the top N elements by occurance, and lump the rest together into "other"? Or skip the variable altogether.

xray_distributions_issue

@sicarul
Copy link
Owner

sicarul commented Nov 22, 2017

I thought i handled that case 🤦‍♂️ thanks for the report, i'll fix it.

@sicarul
Copy link
Owner

sicarul commented Nov 22, 2017

Fixed with Commit c67e34b
I'll close the ticket now, let me know if you find any more bugs 🐛

Thanks!

@sicarul sicarul closed this as completed Nov 22, 2017
@bbrewington
Copy link
Author

Awesome, and will do. I REALLY like this package. Pretty simple, but automates some essential data quality processes and I'll be using it all the time. Wonder if this will make it into the tidyverse :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants