Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

USA births dataset? #1

Closed
jakevdp opened this issue Jun 10, 2015 · 3 comments
Closed

USA births dataset? #1

jakevdp opened this issue Jun 10, 2015 · 3 comments

Comments

@jakevdp
Copy link

jakevdp commented Jun 10, 2015

Any interest in adding this dataset to the repo? I just put it together yesterday: https://github.com/jakevdp/data-CDCbirths
It's been analyzed in the past by Andrew Gelman & his team, and I think allows for some interesting possibilities!

@mwaskom
Copy link
Owner

mwaskom commented Jun 12, 2015

This is a cool dataset, although I'm not sure it shows off any of the things seaborn is particularly good at doing, since most of the plots you'd make with it are single timeseries traces. Unless you had something in particular in mind? I'd love to replace the tips dataset with something that is actually somewhat scientifically interesting!

I also think it would be nice to have a more general purpose PyData dataset repository, seaborn-data isn't quite that, but I'd be happy to point load_dataset at it (the current system was just a quick hack).

@jakevdp
Copy link
Author

jakevdp commented Jun 12, 2015

That's a good idea! Note that there's already something like this in the broader ML community: http://mldata.org/
Scikit-learn has tools to access this data: http://scikit-learn.org/stable/datasets/mldata.html

@jakevdp
Copy link
Author

jakevdp commented Jun 12, 2015

I think you're right that this data does not necessarily take advantage of Seaborn's strengths, and in that light it doesn't make much sense to add it here. Closing the issue!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants