Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Converting pandas index takes very long, add in arrow_table. #41

Merged
merged 2 commits into from Jul 24, 2023

Conversation

justinGilmer
Copy link

Small regression in the arrow_to_dataframe method, trying to set the pandas index as the time column leads to pretty massive slowdowns in performance. A pandas issue, not ours. But for our use cases, we like to use pandas dataframes.

If we need to set the index, we can do that at a different point of the workflow, not when we are getting the data back from arrow. This makes the arrow method appear to be very slow when it is not.

@justinGilmer justinGilmer merged commit 56f75fe into master Jul 24, 2023
15 checks passed
@justinGilmer justinGilmer deleted the performance-fix branch July 24, 2023 16:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
1 participant