Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DataFrame dream / long term future #6

Open
maartenbreddels opened this issue Sep 2, 2019 · 0 comments
Open

DataFrame dream / long term future #6

maartenbreddels opened this issue Sep 2, 2019 · 0 comments

Comments

@maartenbreddels
Copy link

What does the long term future look like in the Python/PyData landscape (say 2025)? What would be the ideal 'dream' dataframe library? E.g, what are the issues we need to tackle?

For instance, vaex solves most of the 2017 issues mentioned by Wes: https://wesmckinney.com/blog/apache-arrow-pandas-internals/

Also think about:

  • Sizes of datasets (e.g rows and/or column counts), compared to current hardware+Moore's Law.
  • Kinds of data, more unstructured?
  • Expectation on the hardware, more cores, more GPU?
  • Distributed vs cloud vs single computer
  • API, (e.g. expose laziness or not?)

Are we going in the right direction, also taking into account the convergence/divergence of dataframe libraries?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant