Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use a better histogram binning algorithm by default #214

Closed
mjskay opened this issue Feb 18, 2024 · 1 comment
Closed

Use a better histogram binning algorithm by default #214

mjskay opened this issue Feb 18, 2024 · 1 comment

Comments

@mjskay
Copy link
Owner

mjskay commented Feb 18, 2024

Sturges is too conservative (usually not enough bins). Possibilities:

  • FD will do lots of bins (maybe too many?).
  • Scott will generally do more than Sturges and less than FD, where Scott is supposed to be optimal for normal distributions. This makes it an attractive option.
  • KernSmooth::dpih is an extension of Scott that should work well on non-normal distributions too. The KernSmooth license is permissive so one option is to look into creating a weighted version of it (we already have weighted versions of the other bin selection algorithms).
@mjskay
Copy link
Owner Author

mjskay commented Feb 24, 2024

I'd say, Scott for now and if I (or someone else) wanders along and wants to make a weighted variant of KernSmooth::dpi, we'll try that.

@mjskay mjskay closed this as completed in 8e801e9 Feb 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant