Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enable the pipelines for Flink non-local execution modes as well #893

Open
chandrashekar-s opened this issue Nov 28, 2023 · 1 comment
Open
Labels
enhancement New feature or request P3:TBE An issue that is not evaluated and/or planned for implementation yet.

Comments

@chandrashekar-s
Copy link
Collaborator

Currently the pipelines are validated and test for Flink local execution modes, validate and make necessary changes for the non-local execution modes as well.

@bashir2 bashir2 added P3:TBE An issue that is not evaluated and/or planned for implementation yet. enhancement New feature or request labels Dec 5, 2023
@chandrashekar-s
Copy link
Collaborator Author

chandrashekar-s commented Feb 15, 2024

The following improvements have been made for the Flink local execution mode

  1. Auto generate Flink configuration file with appropriate values configured under it. The parameters are determined to the best effort basis so that the pipelines does not fail even for high loads. Refer here for details.
  2. The number of threads (parallelism) are defaulted to the cores in the machine, but can be overridden over here. In local mode, by default only one worker gets created per pipeline and the parallelism is achieved by the same worker. However, in non-local mode the cluster can distribute the load across workers(Taskmanagers) to achieve the needed parallelism.
  3. The parquet row group sizes are made configurable, so that the pipeline does not consume much Heap memory, changes can be found here.

Since for the non-local execution mode the resources are little abundant, these properties can be fine tuned for it. There are might be few changes that are needed to suit the needs of the cluster.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request P3:TBE An issue that is not evaluated and/or planned for implementation yet.
Projects
None yet
Development

No branches or pull requests

2 participants