Skip to content

Conversation

@HuangXingBo
Copy link
Contributor

What is the purpose of the change

This pull request will support DataStream PythonProcessOperator in Thread Mode

Brief change log

  • Refactor the directory structure of the current Python DataStream Operators
  • Add the support of EmbeddedPythonProcessOperator

Verifying this change

This change added tests and can be verified as follows:

  • EmbeddedDataStreamTests in test_data_stream.py

Does this pull request potentially affect one of the following parts:

  • Dependencies (does it add or upgrade a dependency): (no)
  • The public API, i.e., is any changed class annotated with @Public(Evolving): (no)
  • The serializers: (no)
  • The runtime per-record code paths (performance sensitive): (no)
  • Anything that affects deployment or recovery: JobManager (and its components), Checkpointing, Kubernetes/Yarn, ZooKeeper: (no)
  • The S3 file system connector: (no)

Documentation

  • Does this pull request introduce a new feature? (no)
  • If yes, how is the feature documented? (not applicable)

@flinkbot
Copy link
Collaborator

flinkbot commented Jul 14, 2022

CI report:

Bot commands The @flinkbot bot supports the following commands:
  • @flinkbot run azure re-run the last Azure build

actual.sort()
self.assertEqual(expected, actual)

def test_basic_operations(self):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could we refactor the test cases a bit and move test cases to be executed in both process mode and thread mode in one class?

return ((SimpleOperatorFactory<?>) streamOperatorFactory).getOperator()
instanceof AbstractDataStreamPythonFunctionOperator;
instanceof AbstractExternalDataStreamPythonFunctionOperator
|| ((SimpleOperatorFactory<?>) streamOperatorFactory).getOperator()
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

instance of DataStreamPythonFunctionOperator

if (PythonConfigUtil.isPythonDataStreamOperator(upTransform)) {
final AbstractDataStreamPythonFunctionOperator<?> upOperator =
(AbstractDataStreamPythonFunctionOperator<?>)
final AbstractExternalDataStreamPythonFunctionOperator<?> upOperator =
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

DataStreamPythonFunctionOperator?

huangxiaofeng10047 pushed a commit to huangxiaofeng10047/flink that referenced this pull request Nov 3, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants