-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Handle incoming Object dtype data #1645
Changes from 10 commits
2bb098f
3063a7b
4a4f230
e07dce8
8ab16a2
11e2cb3
3dd24da
903e2da
9893807
1c63a49
97edf31
4827ca0
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -93,7 +93,7 @@ | |
|
||
DEFAULT_TYPE = Unknown | ||
|
||
INFERENCE_SAMPLE_SIZE = 100000 | ||
INFERENCE_SAMPLE_SIZE = 100_000 | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. We might have to keep this at 100,000 for the time being. Reducing this exposes issues with larger datasets like |
||
|
||
|
||
class TypeSystem(object): | ||
|
@@ -383,6 +383,8 @@ def get_inference_matches(types_to_check, series, type_matches=[]): | |
Categorical in type_matches or Double in type_matches | ||
) and IntegerNullable in type_matches: | ||
best_match = IntegerNullable | ||
elif Categorical in type_matches and Double in type_matches: | ||
best_match = Double | ||
else: | ||
best_match = type_matches[0] | ||
best_depth = self._get_depth(best_match) | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This was the cause of the current LG ww perf issue