forked from pandas-dev/pandas
-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
PERF: StataReader is slow due to excessive lookups for missing vales
Previous versions of StataReader did not correctly check for missing values. This was fixed in a previous PR, but these checks had previously been implemented on a value-by-value basis. This has now been changed to a vectorized version that is orders of magnitude faster. Adds support for preserving Stata missing values when importing data using the convert_missing argument. When True, missing values will be returned as StataMissingValue objects. Test cases for missing value conversion have been included. Additionally, a benchmark was added to monitor performance issues in the future. Closes pandas-dev#8040
- Loading branch information
Showing
8 changed files
with
224 additions
and
41 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Binary file not shown.
Binary file not shown.
Binary file not shown.
Oops, something went wrong.