-
Notifications
You must be signed in to change notification settings - Fork 692
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
LocalDagRunner can't run simple Pipeline with FileBasedExampleGen and custom parquet Executor #4561
Comments
Facing the same issue, any news regarding this? |
1 similar comment
Facing the same issue, any news regarding this? |
this simple change fixed the issue: use BeamExecutorSpec rather than ExecutorClassSpec in many wrong examples. |
We're really sorry for late response and I have reproduced the same code without error, you've to use Please use If issue still persists please let us know, we'll look into your issue and will try to resolve it as soon as possible If your issue got resolved so please feel free to close this issue Thank you! |
Thanks for the feedback @purplechem and @gaikwadrahul8 - switching to the BeamExecutorSpec indeed fixes the issue in the simple example notebook I provided. @gaikwadrahul8, please note @purplechem's reference to "many wrong examples". I came up with my code following such examples. These wrong examples need to be corrected in the TFX code docstrings, e.g. the "Example Usage" docstring at https://github.com/tensorflow/tfx/blob/master/tfx/components/example_gen/custom_executors/parquet_executor.py#L85, as well as in the documentation at https://www.tensorflow.org/tfx/guide/examplegen#file-based_examplegen_customization_experimental |
Thank you for your confirmation and noticing the mistakes in our examples so we'll take care of those mistakes in the Examples and will correct those mistakes very soon in the TFX code docstrings as well as in the documentation, I really appreciate your efforts and valuable time Could you please confirm if this issue is resolved for you? Please feel free to close the issue if it is resolved ? Thank you! |
System information
pip freeze
output):Describe the current behavior
The LocalDagRunner can't run an extremely simple pipeline using FilebasedExampleGen and the parquet custom executor. It reports the following error:
BaseBeamExecutor found initialized with BaseExecutorSpec. Please use BeamExecutorSpec for Beam Components instead.
Describe the expected behavior
The pipeline should run and produce examples from a parquet file.
Standalone code to reproduce the issue
This simple notebook downloads some data, converts it to a parquet file, builds a pipeline containing only a FileBasedExampleGen component with the custom parquet executor and runs it with the LocalDagRunner.
https://colab.research.google.com/drive/1ByIodrGiOkaj_qta3tMYXgkAhfuD_wlJ?usp=sharing
Thoughts
I believe this error is raised because:
The error is specifically caused by this instance check (https://github.com/tensorflow/tfx/blob/master/tfx/dsl/components/base/base_beam_executor.py#L70) in the BaseBeamExecutor:
a BaseExecutorContext is not a BaseBeamExecutor.Context, so the check fails.
The text was updated successfully, but these errors were encountered: