Support multiple planner output dialects #24

ryannedolan · 2023-05-31T21:42:00Z

The planner is currently hard-coded to output MySQL dialect, which Flink natively understands. In order to target additional runtimes, we need a way to generate SQL in other dialects. N.B. the planner is largely dialect-agnostic already, until you ask for the plan as a String.

HoganEdwardChu · 2023-06-05T04:07:38Z

Hi, I didn't mean to disturb you, I was watching with interest. The line that you mean is here?

Hoptimator/hoptimator-planner/src/main/java/com/linkedin/hoptimator/planner/PipelineRel.java

Line 39 in f748776

    
           SqlDialect OUTPUT_DIALECT = MysqlSqlDialect.DEFAULT; // closely resembles Flink SQL

ryannedolan · 2023-06-05T05:23:49Z

The line that you mean is here?

Yep. It looks like the hard-coded dialect is only used in a couple methods that return SQL strings. It might make sense to change those to return the underlying ScriptImplementer, which is dialect agnostic. Then callers could do rel.query(...).sql(dialect) and get whatever dialect they want.

HoganEdwardChu · 2023-06-05T11:57:50Z

Oh you mean https://github.com/HoganEdwardChu/Hoptimator/pull/1/files this way? I haven't seen all the code, so I don't know the specific details, so it might seem lacking(sorry for that) But than i think we have to get input dialect.

ryannedolan · 2023-06-05T14:18:57Z

this way?

Exactly, tho I'm not sure where to go from there. One way would be to add a dialect parameter to pipeline(), s.t. the operator can generate pipelines in whatever dialect it wants. For now, the operator can just use the MySQL dialect.

Looking ahead, we'd like to be able to implement alternative runtimes, e.g. using Spark SQL instead of Flink SQL. Ideally, those changes would be in the operator, not the planner. So this would be a step in that direction.

HoganEdwardChu · 2023-06-05T14:47:57Z

Thanks for the detailed and contextual explanation. I'll think about what you said too. (if there's a good direction, I'll try it and share it again too!) Also would try implementing alternative runtimes Spark SQL instead of Flink SQL.(operator part)

ryannedolan · 2023-06-05T16:53:51Z

I haven't given much thought to how we'd support multiple runtimes in the operator. There is a flink adapter, which is really the only place in the control plane that we assume the SQL will run on flink: https://github.com/linkedin/Hoptimator/blob/main/hoptimator-flink-adapter/src/main/java/com/linkedin/hoptimator/operator/flink/FlinkSqlJobReconciler.java#L57

As you can see, the flink adapter just passes the SQL from SqlJob to a FlinkDeployment (which is handled by the external flink-kubernetes-operator). Theoretically, we could do something similar for Spark or any other SQL runtime. I'm not sure how valuable that would be, but it seems possible.

HoganEdwardChu · 2023-06-05T18:22:00Z

I guess I'll have to think about it a bit. If you have any good opinions on whether it is better to put a dialect in the pipeline, please proceed. I will continue to try. I think making another adapter should be postponed to a later date. I think the most important thing is to handle the flink job well.

HoganEdwardChu · 2023-06-07T05:46:01Z

I think adding sqlDialect to pipeline is best ... maybe at this point
https://github.com/HoganEdwardChu/Hoptimator/blob/90f4c459e1bb71f79c2625f19d79753eb0fbfbd5/hoptimator-planner/src/main/java/com/linkedin/hoptimator/planner/PipelineRel.java#L95

ryannedolan · 2023-06-07T14:35:45Z

adding sqlDialect to pipeline

Makes sense to me!

ryannedolan · 2023-06-12T16:25:30Z

Thanks @HoganEdwardChu for #31

HoganEdwardChu · 2023-06-12T22:32:51Z

Thanks!!!!!

I am trying to add another sql time like spark. If there is any valuable progress I will pr! @ryannedolan

ryannedolan added the good first issue Good for newcomers label May 31, 2023

ryannedolan closed this as completed Jun 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support multiple planner output dialects #24

Support multiple planner output dialects #24

ryannedolan commented May 31, 2023

HoganEdwardChu commented Jun 5, 2023

ryannedolan commented Jun 5, 2023

HoganEdwardChu commented Jun 5, 2023 •

edited

ryannedolan commented Jun 5, 2023

HoganEdwardChu commented Jun 5, 2023 •

edited

ryannedolan commented Jun 5, 2023

HoganEdwardChu commented Jun 5, 2023

HoganEdwardChu commented Jun 7, 2023 •

edited

ryannedolan commented Jun 7, 2023

ryannedolan commented Jun 12, 2023

HoganEdwardChu commented Jun 12, 2023

Support multiple planner output dialects #24

Support multiple planner output dialects #24

Comments

ryannedolan commented May 31, 2023

HoganEdwardChu commented Jun 5, 2023

ryannedolan commented Jun 5, 2023

HoganEdwardChu commented Jun 5, 2023 • edited

ryannedolan commented Jun 5, 2023

HoganEdwardChu commented Jun 5, 2023 • edited

ryannedolan commented Jun 5, 2023

HoganEdwardChu commented Jun 5, 2023

HoganEdwardChu commented Jun 7, 2023 • edited

ryannedolan commented Jun 7, 2023

ryannedolan commented Jun 12, 2023

HoganEdwardChu commented Jun 12, 2023

HoganEdwardChu commented Jun 5, 2023 •

edited

HoganEdwardChu commented Jun 5, 2023 •

edited

HoganEdwardChu commented Jun 7, 2023 •

edited