-
Notifications
You must be signed in to change notification settings - Fork 28k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SPARK-47566][SQL] Support SubstringIndex function to work with collated strings #45725
Closed
miland-db
wants to merge
45
commits into
apache:master
from
miland-db:miland-db/substringIndex-stringLocate
Closed
Changes from 27 commits
Commits
Show all changes
45 commits
Select commit
Hold shift + click to select a range
d2e75fe
Add find method with collation supported
miland-db 2097a66
Add SubstringIndex support for collated strings
miland-db b3bd34a
Improve unit tests and fix bugs
miland-db 15c5491
Fix bug with the rfind on collated strings
miland-db 5925763
Merge branch 'master' into miland-db/substringIndex-stringLocate
miland-db 34ee8af
Resolve merge problems with master
miland-db 2f8f13d
improve scala style
miland-db 5538b07
Add tests to UTF8StringWithCollationSuite
miland-db ee6c67d
Improve tests and fix bug with collatedFind
miland-db 1bfb027
Fix Java linter error
miland-db d2616f6
Remove repeated code for getting collationId
miland-db 69fa05c
Merge branch 'master' into substringIndex-stringLocate
miland-db 6d380ce
Improve method naming for collation aware methods, and remove lowerca…
miland-db f28eac9
Merge branch 'master' into substringIndex-stringLocate
miland-db 5c9865a
Improve scala/java style
miland-db 4837159
Update getStringSearch naming
miland-db 027ab04
Add doc comment
miland-db 208042b
Remove unrelated change: blank line in UTF8StringWithCollationSuite.java
miland-db 16f5b15
Merge branch 'master' into substringIndex-stringLocate
miland-db 39827bf
Merge latest master and add SubstringIndex to CollationTypeCasts tran…
miland-db cc453b1
Add empty lines between imports
miland-db 8831848
Handle all collationIds in getStringSearch
miland-db dccd63d
Merge branch 'master' into substringIndex-stringLocate
miland-db 2765e6b
Add code for SubstringIndex to CollationSupport
miland-db 9ff4b50
Add SubstringIndex functionality and fix errors
miland-db ae2a572
Fix java line length
miland-db e7db0e9
Remove unused import
miland-db 71ed00c
Add SubstringIndex to CollationTypeCasts
miland-db eefa77b
Refactor tests
miland-db bf6bb2a
Merge branch 'master' into substringIndex-stringLocate
miland-db c9e8788
Rename methods in CollationSupport to be as in UTF8String
miland-db 02c927a
Merge branch 'master' into substringIndex-stringLocate
miland-db 1d0531b
Merge branch 'master' into substringIndex-stringLocate
miland-db f5760fc
Merge branch 'master' into substringIndex-stringLocate
miland-db 59953e9
Added tests (1 failing)
miland-db 5e81c3c
Merge branch 'master' into substringIndex-stringLocate
miland-db 9dbc55d
Added new tests with variable length character.
miland-db 6102e49
Merge branch 'master' into substringIndex-stringLocate
miland-db ba1be71
Add tests and sync with master
miland-db 2e6db7a
Fix Case-variable character length bug
miland-db c9f98d1
Add new tests
miland-db ae1bcf6
Fix java linter
miland-db ea76298
Merge branch 'master' into substringIndex-stringLocate
miland-db 626e36c
Merge branch 'master' into substringIndex-stringLocate
miland-db 437de71
Add more test cases with case variable character length
miland-db File filter
Filter by extension
Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The class
CollationAwareUTF8String
is getting bigger. Shall we move it to an individual file?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Maybe in the next PR. We will consider this option
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
agreed, we should do this in #45820