Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[C++] Add fixed_size_binary support to compute functions #29496

Closed
asfimport opened this issue Sep 3, 2021 · 7 comments
Closed

[C++] Add fixed_size_binary support to compute functions #29496

asfimport opened this issue Sep 3, 2021 · 7 comments
Assignees
Milestone

Comments

@asfimport
Copy link
Collaborator

asfimport commented Sep 3, 2021

The following compute functions do not support fixed_size_binary but do support binary:

  • binary_length
  • binary_replace_slice
  • count_substring
  • find_substring
  • find_substring_regex
  • equal
  • greater
  • greater_equal
  • less
  • less_equal

Reporter: Weston Pace / @westonpace
Assignee: David Li / @lidavidm

Related issues:

PRs and other links:

Note: This issue was originally created as ARROW-13878. Please see the migration documentation for further details.

@asfimport
Copy link
Collaborator Author

Eduardo Ponce / @edponce:
Can we break these into individual issues (if not already)?

@asfimport
Copy link
Collaborator Author

Eduardo Ponce / @edponce:
FixedSizeBinaryType serves as the base class for DecimalTypes, then this would allow casting decimals to binary and applying string transforms. Is there any use case where it is used to represent strings for which the substring kernels make sense? Well, maybe there is an application that uses fixed size strings for some data. Based on this, then all the other string kernels in ARROW-13879 also need to be updated to support FixedSizeBinaryTypes.

@asfimport
Copy link
Collaborator Author

Eduardo Ponce / @edponce:
I would have made the distinction that kernels that are named with string or unambiguously operate on strings should only support string types. Nevertheless, in the software community the terms binary strings are common, so it would be difficult to provide a consistent nomenclature.

@asfimport
Copy link
Collaborator Author

Eduardo Ponce / @edponce:
Also, should string kernels prefixed with ascii or utf8 need to support binary types as well? or at least the ASCII ones?

@asfimport
Copy link
Collaborator Author

David Li / @lidavidm:
Decimal is not relevant here. ascii/utf8 should never support binary, they may rely on invariants which do not hold - cast first.

@asfimport
Copy link
Collaborator Author

Eduardo Ponce / @edponce:
@lidavidm Thanks for the clarifications.

@asfimport
Copy link
Collaborator Author

Antoine Pitrou / @pitrou:
Issue resolved by pull request 11230
#11230

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants