[FLINK-30479][doc] Document flink-connector-files for local execution #21549

liuml07 · 2022-12-22T00:52:13Z

What is the purpose of the change

The file system SQL connector itself is included in Flink and does not require an additional dependency. However, if a user uses the filesystem connector for local execution, for e.g. running Flink job in the IDE, she will need to add dependency. Otherwise, the user will get validation exception: Cannot discover a connector using option: 'connector'='filesystem'. This is confusing and can be documented.

Brief change log

The scope of the files connector dependency should be provided, because they should not be packaged into the job JAR file. So we do not use the sql_download_table shortcodes like {{< sql_download_table "files" >}}. Also that shortcodes has texts saying the dependencies are required for SQL Client with SQL JAR bundles. That is not applicable to files connector as it's already shipped int he /lib directlory.

Verifying this change

This is a doc change, and I have tested it rendered locally.

Does this pull request potentially affect one of the following parts:

Dependencies (does it add or upgrade a dependency): (yes / no)
The public API, i.e., is any changed class annotated with @Public(Evolving): (yes / no)
The serializers: (yes / no / don't know)
The runtime per-record code paths (performance sensitive): (yes / no / don't know)
Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Kubernetes/Yarn, ZooKeeper: (yes / no / don't know)
The S3 file system connector: (yes / no / don't know)

Documentation

Does this pull request introduce a new feature? (yes / no)
If yes, how is the feature documented? (not applicable / docs / JavaDocs / not documented)

liuml07 · 2022-12-22T00:52:46Z

flinkbot · 2022-12-22T00:55:35Z

CI report:

1ace9fc Azure: SUCCESS

Bot commands

The @flinkbot bot supports the following commands:

@flinkbot run azure re-run the last Azure build

liuml07 · 2022-12-22T00:56:55Z

docs/content/docs/connectors/table/filesystem.md

+NOTE: If you use the filesystem connector for [local execution]({{< ref "docs/dev/dataset/local_execution" >}}),
+for e.g. running Flink job in your IDE, you will need to add dependency.
+
+```xml


In the PR description, we mentioned why this does not use the sql_download_table shortcodes like {{< sql_download_table "files" >}}.

liuml07 · 2022-12-22T04:20:47Z

CC: @twalthr @fapaul

MartijnVisser

I don't think this is necessarily the best method. I don't think this should be resolved per connector, but there is more value in documenting centrally what's needed to run things locally. If we do it per connector, you need to do it for all connectors, but also for things like running things that require Hadoop etc. WDYT?

liuml07 · 2022-12-22T09:48:41Z

Yeah, documenting centrally sounds good.

Maybe I'm limited by how I build the jobs with connectors - I shade all connectors (and dependencies) into the uber job jar. For other connectors (e.g. Kafka), I add the dependency to the Flink job following the Maven snippet in each connector's doc page. That works for both local execution (IDE) and remote deployment. Filesystem connector is a bit special because it's in the Flink deploy (so no need to shade) but not ready for local execution. Adding "provided" scope dependency for this connector solves my problem. I don't find other connectors dependency needs to change for local execution.

I'm thinking where it would be a good central place. There is a short guide for setting Hadoop dependencies for local execution. Do you think it's a good idea to write a new section in the Connectors and Formats page or Advanced Configuration Topics page?

[FLINK-30479][doc] Document flink-connector-files for local execution

1ace9fc

liuml07 force-pushed the doc-connector-files branch from 3d0a250 to 1ace9fc Compare December 22, 2022 00:54

liuml07 commented Dec 22, 2022

View reviewed changes

flinkbot added the component=Documentation label Dec 22, 2022

MartijnVisser requested changes Dec 22, 2022

View reviewed changes

liuml07 closed this Sep 4, 2024

liuml07 deleted the doc-connector-files branch September 4, 2024 00:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FLINK-30479][doc] Document flink-connector-files for local execution #21549

[FLINK-30479][doc] Document flink-connector-files for local execution #21549

Uh oh!

liuml07 commented Dec 22, 2022 •

edited

Loading

Uh oh!

liuml07 commented Dec 22, 2022 •

edited

Loading

Uh oh!

flinkbot commented Dec 22, 2022 •

edited

Loading

Uh oh!

liuml07 Dec 22, 2022

Uh oh!

liuml07 commented Dec 22, 2022

Uh oh!

MartijnVisser left a comment •

edited

Loading

Uh oh!

liuml07 commented Dec 22, 2022

Uh oh!

Uh oh!

[FLINK-30479][doc] Document flink-connector-files for local execution #21549

[FLINK-30479][doc] Document flink-connector-files for local execution #21549

Uh oh!

Conversation

liuml07 commented Dec 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What is the purpose of the change

Brief change log

Verifying this change

Does this pull request potentially affect one of the following parts:

Documentation

Uh oh!

liuml07 commented Dec 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

flinkbot commented Dec 22, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CI report:

Uh oh!

liuml07 Dec 22, 2022

Choose a reason for hiding this comment

Uh oh!

liuml07 commented Dec 22, 2022

Uh oh!

MartijnVisser left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

liuml07 commented Dec 22, 2022

Uh oh!

Uh oh!

liuml07 commented Dec 22, 2022 •

edited

Loading

liuml07 commented Dec 22, 2022 •

edited

Loading

flinkbot commented Dec 22, 2022 •

edited

Loading

MartijnVisser left a comment •

edited

Loading