-
Notifications
You must be signed in to change notification settings - Fork 58
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEATURE] Support stream/table joins #177
Comments
👋 @dberardo-com, I will try and answer your questions below.
Yes, you can join streams together in Bytewax. The caveat is that the native Kafka connector ( Today, a dataflow (like this example) can be written with the
Yes, persistent queries, if I understand what you mean, are a good use case for Bytewax and Stateful operators (
If a worker dies and you have recovery enabled, you will be able to restart the workflow and recover the state and it will start at the appropriate offset automatically. If you are using Bytewax on k8s or as a service via (waxctl)[https://www.bytewax.io/docs/deployment/waxctl] you will be able to restart automatically as well. |
Would be great if this could be done more cleanly, e.g. having two different inputs in a flow which can be transformed independently, and then later keyed and joined. Otherwise the whole paradigm is untyped in python and would require messy if blocks 🤔 |
This is now cleanly possible in the latest version of Bytewax https://github.com/bytewax/bytewax/releases/tag/v0.18.0 . It supports having multiple independent input sources and an explicit |
Is it possible to use bytewax for joining content of different kafka topics (similar to what ksqldb is doing) ?
doing this will be an example of integrating "persistent queries" (permanent background processes that never stops). is this a good use case for bytewax?
also comparing to ksqldb, what happens if the bytewax workers are killed? will those persistent queries restart automatically when workers come back up and will they use the latest/earliest committed offset on the kafka topics ? or is the restart manual?
cheers
The text was updated successfully, but these errors were encountered: