[FLINK-37795] rewrite ml_evaluate to ml_predict and aggregate function #26667

lihaosky · 2025-06-10T21:44:49Z

What is the purpose of the change

Rewrite ml_evaluate table function scan to ml_predict table function scan and LogicalAggreate

Brief change log

Rewrite ml_evaluate table function scan to ml_predict table function scan and LogicalAggreate

Verifying this change

Unit test

Does this pull request potentially affect one of the following parts:

Dependencies (does it add or upgrade a dependency): (no)
The public API, i.e., is any changed class annotated with @Public(Evolving): (yes)
The serializers: (no)
The runtime per-record code paths (performance sensitive): (no)
Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Kubernetes/Yarn, ZooKeeper: (no)
The S3 file system connector: (no)

Documentation

Does this pull request introduce a new feature? (yes)
If yes, how is the feature documented? (JavaDocs)

flinkbot · 2025-06-10T21:52:24Z

CI report:

e301fb6 UNKNOWN
93d7430 Azure: SUCCESS

Bot commands

The @flinkbot bot supports the following commands:

@flinkbot run azure re-run the last Azure build

davidradl · 2025-07-01T13:22:35Z

flink-table/flink-table-common/src/main/java/org/apache/flink/table/ml/TaskType.java

    }

    public static boolean isValidTaskType(String name) {
        return Arrays.stream(values()).anyMatch(taskType -> taskType.name.equals(name));
    }
+
+    public static Optional<RuntimeException> throwOrReturnInvalidTaskType(


I think it would make more sense to call this method validateTaskType.

davidradl · 2025-07-01T13:25:49Z

...ava/org/apache/flink/table/planner/plan/rules/logical/ExpandMLEvaluateTableFunctionRule.java

+        String task = null;
+        if (taskNode instanceof RexLiteral) {
+            task = ((RexLiteral) taskNode).getValueAs(NlsString.class).getValue();
+            if (task == null || task.isEmpty()) {


we do not need to if (task == null if we are going to set it to null

davidradl · 2025-07-01T13:27:11Z

...ava/org/apache/flink/table/planner/plan/rules/logical/ExpandMLEvaluateTableFunctionRule.java

+                task = null;
+            }
+        }
+        if (task == null) {


this message is not accurate as we could have a task but it not be a RexLiteral

davidradl · 2025-07-01T13:27:49Z

...ava/org/apache/flink/table/planner/plan/rules/logical/ExpandMLEvaluateTableFunctionRule.java

+    private static String getTask(RexCall rexCall) {
+        final RexNode taskNode = rexCall.getOperands().get(4);
+        String task = null;
+        if (taskNode instanceof RexLiteral) {


It would be good to include a comment as to why we need a RexLiteral here.

I wonder if it would be better to have one method

String task = getRexLiteralFrom(rexCall.getOperands().get(4), true);
then all the checking and validation is done in one place.

Also I suggest removing the boolean, and always thowing an exception from the method, that the caller can catch and return as required.

davidradl · 2025-07-01T13:35:27Z

...ava/org/apache/flink/table/planner/plan/rules/logical/ExpandMLEvaluateTableFunctionRule.java

+                                                if (!(scan.getCall() instanceof RexCall)) {
+                                                    return false;
+                                                }
+                                                RexCall call = (RexCall) scan.getCall();


can't call be declared as a final also?

davidradl · 2025-07-01T13:41:59Z

...nk-table-common/src/test/java/org/apache/flink/table/factories/TestModelProviderFactory.java

@@ -77,6 +76,7 @@ public Set<ConfigOption<?>> requiredOptions() {
    public Set<ConfigOption<?>> optionalOptions() {
        Set<ConfigOption<?>> options = new HashSet<>();
        options.add(MODEL_VERSION);
+        options.add(TASK);


I am curious about this test, you have added task as a config option to a test model provider factory. I notice that
OpenAIModelProviderFactory implements ModelProviderFactory should this not pickup the task in the same way as the test here?

airlock-confluentinc bot force-pushed the model-evaluate-rewrite branch from de7bce9 to b0cc113 Compare June 10, 2025 21:49

airlock-confluentinc bot force-pushed the model-evaluate-rewrite branch 2 times, most recently from 10b36e8 to 84452ad Compare June 16, 2025 19:20

[FLINK-37795] rewrite ml_evaluate to ml_predict and aggregate function

e301fb6

airlock-confluentinc bot force-pushed the model-evaluate-rewrite branch from 84452ad to e301fb6 Compare June 17, 2025 02:38

lihaosky added 2 commits June 16, 2025 19:39

fix

402ea01

fix

93d7430

davidradl reviewed Jul 1, 2025

View reviewed changes

github-actions bot added community-reviewed and removed community-reviewed labels Jul 1, 2025

github-actions bot added community-reviewed and removed community-reviewed labels Jul 5, 2025

github-actions bot added community-reviewed and removed community-reviewed labels Jul 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FLINK-37795] rewrite ml_evaluate to ml_predict and aggregate function #26667

[FLINK-37795] rewrite ml_evaluate to ml_predict and aggregate function #26667

lihaosky commented Jun 10, 2025 •

edited

Loading

Uh oh!

flinkbot commented Jun 10, 2025 •

edited

Loading

Uh oh!

davidradl Jul 1, 2025

Uh oh!

davidradl Jul 1, 2025

Uh oh!

davidradl Jul 1, 2025

Uh oh!

davidradl Jul 1, 2025

Uh oh!

davidradl Jul 1, 2025

Uh oh!

davidradl Jul 1, 2025

Uh oh!

davidradl Jul 1, 2025

Uh oh!

Uh oh!

[FLINK-37795] rewrite ml_evaluate to ml_predict and aggregate function #26667

Are you sure you want to change the base?

[FLINK-37795] rewrite ml_evaluate to ml_predict and aggregate function #26667

Conversation

lihaosky commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What is the purpose of the change

Brief change log

Verifying this change

Does this pull request potentially affect one of the following parts:

Documentation

Uh oh!

flinkbot commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CI report:

Uh oh!

davidradl Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

davidradl Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

davidradl Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

davidradl Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

davidradl Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

davidradl Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

davidradl Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lihaosky commented Jun 10, 2025 •

edited

Loading

flinkbot commented Jun 10, 2025 •

edited

Loading