Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature][Flink] Introducing the INSERT OVERWRITE statement for mixed-streaming format tables. #4

Open
2 of 3 tasks
YesOrNo828 opened this issue Jul 19, 2022 · 2 comments
Assignees
Labels
good first issue Good for newcomers module:mixed-flink Flink moduel for Mixed Format

Comments

@YesOrNo828
Copy link
Contributor

YesOrNo828 commented Jul 19, 2022

Search before asking

  • I have searched in the issues and found no similar issues.

What would you like to be improved?

Currently, the insert overwrite statement is supported for mixed-streaming format tables without primary key specification. In order to meet the batch processing capability of the Flink engine on keyed tables.

Mixed-streaming format tables should include mixed-iceberg and mixed-hive format tables.

INSERT OVERWRITE [catalog_name.][db_name.]table_name [column_list] select_statement

column_list:
  (col_name1 [, column_name2, ...])

OVERWRITE

INSERT OVERWRITE will overwrite any existing data in the table or partition. Otherwise, new data is appended.

COLUMN LIST

Given a table T(a INT, b INT, c INT), Flink supports INSERT INTO T(c, b) SELECT x, y FROM S. The expectation is that ‘x’ is written to column ‘c’ and ‘y’ is written to column ‘b’ and ‘a’ is set to NULL (assuming column ‘a’ is nullable).

How should we improve?

Flink API should implement the interface: SupportsOverwrite;

This feature only works in flink batch runtime mode.

Affected Flink versions: flink1.12/flink1.14/flink1.15.

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Subtasks

No response

Code of Conduct

@YesOrNo828 YesOrNo828 added the module:mixed-flink Flink moduel for Mixed Format label Jul 19, 2022
@YesOrNo828 YesOrNo828 added the good first issue Good for newcomers label Aug 11, 2022
shidayang referenced this issue in shidayang/arctic Nov 25, 2022
shidayang referenced this issue in shidayang/arctic Jan 29, 2023
Adapt new Transaction model
@YesOrNo828 YesOrNo828 changed the title Flink: support insert overwrite arctic table which has a primary key definition [Improvement][Flink] Introducing the INSERT OVERWRITE statement for mixed-iceberg format tables. Apr 24, 2023
@YesOrNo828 YesOrNo828 changed the title [Improvement][Flink] Introducing the INSERT OVERWRITE statement for mixed-iceberg format tables. [Feature][Flink] Introducing the INSERT OVERWRITE statement for mixed-iceberg format tables. Apr 24, 2023
@YesOrNo828 YesOrNo828 changed the title [Feature][Flink] Introducing the INSERT OVERWRITE statement for mixed-iceberg format tables. [Feature][Flink] Introducing the INSERT OVERWRITE statement for mixed-streaming format tables. Apr 26, 2023
@czy006
Copy link
Contributor

czy006 commented Mar 20, 2024

@xujiangfeng001 I wonder if the work is still moving forward?

@xujiangfeng001
Copy link
Contributor

@xujiangfeng001 I wonder if the work is still moving forward?

Hi @czy006 , I'm very sorry, I don't have time to continue advancing this issue recently. Can you help me push it forward ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers module:mixed-flink Flink moduel for Mixed Format
Projects
None yet
Development

No branches or pull requests

3 participants