-
Notifications
You must be signed in to change notification settings - Fork 380
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[VL] When the regexExpr of the split function is an empty character or a non-ASCII character, fall back to Valina Spark #6127
Comments
This is because the current split only supports splitting delemiter, while spark supports regexp. If necessary, I can add a PR to fix this problem, because Velox already supports regexp_split. |
@jackylee-ch Please fix it. |
hey @mcdull-zhang i am working on fixing split issue, there is a fix PR in velox facebookincubator/velox#10248, once the PR ready, i will update gluten part. |
hey @jackylee-ch |
Great job, wait for your fix. |
Backend
VL (Velox)
Bug description
empty character
non-ASCII character
Spark version
Spark-3.2.x
Spark configurations
No response
System information
No response
Relevant logs
java.lang.RuntimeException: Exception: VeloxRuntimeError Error Source: RUNTIME Error Code: INVALID_STATE Reason: (0 vs. 1) split only supports only single-character pattern Retriable: False Expression: patternString.size() == 1 Context: split(n0_0, :VARCHAR) Top-Level Context: Same as context. Function: apply File: ep/build-velox/build/velox_ep/velox/functions/sparksql/SplitFunctions.cpp
The text was updated successfully, but these errors were encountered: