Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

__repr__ and __str__ have inefficient implementations #40

Closed
devin-petersohn opened this issue Jul 12, 2018 · 1 comment
Closed

__repr__ and __str__ have inefficient implementations #40

devin-petersohn opened this issue Jul 12, 2018 · 1 comment
Labels
Performance 🚀 Performance related issues and pull requests.

Comments

@devin-petersohn
Copy link
Collaborator

Describe the problem

Printing large dataframes is slow.

Also see #37. We are concatenating full columns, which is very slow for large dataframes. A solution similar to #39 will need to be implemented.

Source code / logs

import modin.pandas as pd

df = pd.read_csv(...)
print(df)
@devin-petersohn devin-petersohn added the Performance 🚀 Performance related issues and pull requests. label Jul 12, 2018
@kunalgosar
Copy link
Collaborator

Closed via #41

naren-ponder pushed a commit to naren-ponder/modin that referenced this issue Jan 11, 2023
…to-numpy

Fix to_numpy by making it like to_pandas.
mvashishtha pushed a commit to mvashishtha/modin that referenced this issue May 10, 2023
…ely w/in pushdown code ) (modin-project#40) [no upstream]

* Simplify the masking approach to .loc

* Remove iloc hack code for loc support
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Performance 🚀 Performance related issues and pull requests.
Projects
None yet
Development

No branches or pull requests

2 participants