Replies: 1 comment
-
We should look at solving that too: #7012 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hey all,
Let's talk streaming
As you may have noticed, we started work on supporting streaming scenarios via:
The next step in our mind is extending sources so they can manage the usual objects required in an end-to-end streaming pipeline. But we don't to limit ourselves to
stream
objects, we want to think about all the external thingies that we had always wanted to manage via dbt. This new type of sources should be valuable outside of streaming scenarios.This discussion may be the most relevant, and I will be parsing it to absorb as much as I can from it:
Though I didn't want to continue the conversation there, as we went to focus here on "storage" objects. dbt is about ELT, not ETL. Our atomic unit is the model, a unit of transformation that results in rows. Compute only objects (tasks/jobs…) typically used in streaming (as in streaming ETLs) are not in scope. Same for UDFs.
Managed sources (?)
Here's a list of traits I can imagine for that new object:
Obviously there is a large common surface area with dbt-external-tables, we will see how we handle that once we have a firmer design in mind.
Not in scope?
Flink has a nice concept of hybrid sources, as in wrapping the definition of historical sources in addition to the live/streaming one, while managing the “lambda” view on top auto-magically. We won’t do that in sources as it involves transformation logic. It’s user land logic that needs to be composed with 2 sources and a model.
While we are looking at sources, we may have an opportunity to tackle other asks in the area. I'm pessimistic about it (some of it just looks like codegen) but we should still do our due diligence:
Next steps
I would like a first version of managed sources to ship with 1.6 (July). I will be doing a bit more research and come back here with a draft of what they could look like in the coming weeks. So if you have a strong opinion on the mater, now is the best time to voice it ;)
Beta Was this translation helpful? Give feedback.
All reactions