KLIP-20: remove TERMINATE #4126

big-andy-coates · 2019-12-12T16:06:21Z

Description

Initial drop of KLIP-20 which proposed to remove the need for the TERMINATE statement.

hjafarpour · 2019-12-16T19:30:07Z

I assume if we have INSERT INTO as discussed here (#4125 (comment)), we should also keep TERMINATE.

rodesai · 2019-12-16T19:45:10Z

design-proposals/klip-20_remove_terminate.md

+
+## What is in scope
+
+* Removing `TERMINATE` from our SQL syntax.


What's the benefit of removing TERMINATE. Can't we just drop the requirement that the query be terminated to drop the stream/table?

We cannot drop a stream/table if a query is writing into it. Otherwise we will have queries that write into zombie stream/tables!

@hjafarpour - I think @rodesai means to automatically discover and terminate the queries writing into it

Why not drop TERMINATE ? What use is it? Trad RDBS doesn't expose the persistent query updating materialized views to users. It's an implementation detail that shouldn't be exposed IMHO.

I agree that dropping a MV, should imply that the corresponding query is stopped. However, I think it would still be useful to allow users to terminate a persistent query directly. I also commented on KLIP-17, as I think, keeping INSERT INTO as-is is useful.

This KLIP seems to relate to the "ownership model" that ksqlDB is lacking (ie, it's not well defined yet). If we assume that the output topic is not owned by ksqlDB, but a query write into it, it make total sense to allow users to just terminate the query.

This KLIP seems to relate to the "ownership model" that ksqlDB is lacking

I don't see how it is...

If we assume that the output topic is not owned by ksqlDB, but a query write into it, it make total sense to allow users to just terminate the query.

To be able to have a persistent query that writes to the topic there must be a source registered in KSQL that is backed by the topic. Currently this would either be by a CREATE STREAM or CREATE STREAM AS SELECT statement. Neither of these currently embody any sense of ownership.

Dropping the source created by the CS or CSAS doesn't currently delete the topic, and that wouldn't change with this KLIP.

What this KLIP proposes is that if the user wants to stop the query they would just drop the source. The query is an implementation detail.

design-proposals/klip-20_remove_terminate.md

purplefox

+1 To a better user experience.

Instead of asking the question "Should we drop terminate", I'd rather ask "If we were designing ksql again now from scratch would we introduce a terminate statement?". To me that's a pretty clear no.

As Andy points out queries are an implementation detail that shouldn't be exposed to users - it just causes unnecessary confusion and damages the user experience.

Even if a separate terminate is useful for some edge cases, as a general rule we should be very cautious of damaging the 99% user experience to make things easier for the 1%.

mjsax · 2020-01-27T19:36:01Z

Instead of asking the question "Should we drop terminate", I'd rather ask "If we were designing ksql again now from scratch would we introduce a terminate statement?". To me that's a pretty clear no.

Without an clear "ownership model" you cannot really answer this question IMHO. Before we remove something that we might need to reintroduce later again, it might be better to keep it until we have an answer.

big-andy-coates · 2020-02-19T18:32:15Z

Not sure how the ownership model fits here. Can you explain @mjsax ?

derekjn · 2020-02-19T21:10:40Z

@big-andy-coates I know this isn't currently possible, but theoretically speaking if we wanted to introduce the ability to terminate a transient query, how would we do that after removing TERMINATE?

mjsax · 2020-02-21T17:00:42Z

@big-andy-coates If the ownership model assumes that for example all MVs are owned by ksql, than it makes sense to delete a MV when a query is stopped, and thus, we can reverse the logic and say, if the MV is dropped, the query is terminated (hence, no TERMINATE keyword needed). However, if we assume that a TABLE is owned by an external entity, and maybe there are even multiple query pushing into the TABLE (note, for this case the result of the query would not be a MV but we have multiple "update streams" that are all applied/merged into the same result TABLE) than it make totally sense to have a TERMINATE statement that allows to stop queries without deleting the TABLE. However, there is no ownership model atm in ksql that defines what we actually want to support.

mjsax · 2020-02-21T17:03:02Z

@derekjn I am not sure if I can follow? If you have a transient query, it's either issued in the CLI and one can just hit CTRL+C or via the new client protocol and the client can just close the connection that terminates the query. There is no TERMINATE involved atm and thus removing TERMINATE does not affect transient queries.

derekjn · 2020-02-21T17:10:08Z

@derekjn I am not sure if I can follow?

@mjsax the use case here would be a cluster admin terminating transient queries independently of the session that they’re running in.

mjsax · 2020-02-21T17:32:52Z

Thanks @derekjn -- that makes a lot of sense. One more reason to keep TERMINATE and to drop this KLIP.

big-andy-coates · 2020-02-28T14:14:46Z

@derekjn

@big-andy-coates I know this isn't currently possible, but theoretically speaking if we wanted to introduce the ability to terminate a transient query, how would we do that after removing TERMINATE?

Sure, that may be a valid use of TERMINATE. However, we don't have that functionality at the moment and what this KLIP is basically proposing is that the persistent query(s) running to keep a source up-to-date are an implementation detail, and hence should not be exposed to users.

@mjsax

Thanks @derekjn -- that makes a lot of sense. One more reason to keep TERMINATE and to drop this KLIP.

Ouch! ;). As per my comment to @derekjn above, this KLIP may be badly named: it's mostly about hiding persistent queries as they are an implementation detail.

big-andy-coates · 2020-02-28T14:22:12Z

@big-andy-coates If the ownership model assumes that for example all MVs are owned by ksql, than it makes sense to delete a MV when a query is stopped, and thus, we can reverse the logic and say, if the MV is dropped, the query is terminated (hence, no TERMINATE keyword needed). However, if we assume that a TABLE is owned by an external entity, and maybe there are even multiple query pushing into the TABLE (note, for this case the result of the query would not be a MV but we have multiple "update streams" that are all applied/merged into the same result TABLE) than it make totally sense to have a TERMINATE statement that allows to stop queries without deleting the TABLE. However, there is no ownership model atm in ksql that defines what we actually want to support.

I think there something here that I'm just not understanding. I really don't see how this is at all related to an ownership model.

However, if we assume that a TABLE is owned by an external entity ... than it make totally sense to have a TERMINATE statement that allows to stop queries without deleting the TABLE

I think you're assuming that dropping the table deletes the topic. That is not the case! Hence, for your example, if the stream/table is owned by another system, but there is a C*AS running in ksql that is writing to it, then there is still no need for TERMINATE. The user can just drop the table/stream in KSQL.

Essentially, TERMINATE leaves a C*AS in a weird zombie state. There is no way to restart the query that was terminated.

To sum up: If we get rid of INSERT INTO then I see no reason to keep TERMINATE. If we choose to keep INSERT INTO then we still need TERMINATE.

My vote is strongly for replacing INSERT INTO with a combination of UNION ALL and ALTER TABLE, and then dropping TERMINATE.

mjsax · 2020-03-10T17:47:35Z

I think you're assuming that dropping the table deletes the topic.

Yes, obviously (if KSQL owns the TABLE). Otherwise, users need to manually cleanup/GC topics what is a pain to do.

Dropping the source created by the CS or CSAS doesn't currently delete the topic, and that wouldn't change with this KLIP.

Well, that does exactly boil down to the ownership model. If KSQL owns the MV and the MV is delete, the corresponding topic should be deleted, too. Same for a TABLE that ksql owns. However, if KSQL does not own the TABLE it should not delete the topic.

Also, if I define a TABLE over a topic that KSQL doesn't own, I can still want to keep this TABLE to be user for other queries to read from it. Or I actually want to start a new query that INSERT INTO this TABLE (I am in favor to keep INSERT INTO for that reason).

Essentially, TERMINATE leaves a C*AS in a weird zombie state. There is no way to restart the query that was terminated.

Not sure what you mean by that?

The query is an implementation detail.

IMHO, this only holds for MV, but not for TABLES -- a TABLE is an independent entity and not coupled to any query that might right into it.

guozhangwang · 2020-05-21T22:19:11Z

In CCloud, a topic may be shared among multiple users or be exclusively owned by a single user account. I'm wondering for ksqlDB in CCloud, do we allow different user accounts write persistent queries piping to the same sharable topics?

If not, the the ownership is pretty straight-forward: the topics created in ksqlDB is always "exclusive" to the user who writes the persistent queries resulting them, and as long as that user wants to drop the topic, all corresponding queries could be dropped all together since they should be owned by that same user.

If yes, then it's a bit tricky: by writing my query to a sharable topic in CCloud, I'm basically creating a query not to my own, but shared in the CCloud (of course, they would still be under same org account). This means that I basically allow anyone who has authority to the topic to also have authority to terminate that query. That is okay as long as we made it clear in our doc.

spena · 2020-08-05T21:09:26Z

@big-andy-coates Can we implement part of this KLIP, such as terminate the internal C*AS query when issuing the DROP STREAM|TABLE, but only if there are no other queries writing or reading from it? It would improve the current UX a lot by allowing that at least.

FYI @colinhicks @agavra

chore: initial drop of KLIP-20: remove TERMINATE

0e522a3

big-andy-coates requested a review from a team as a code owner December 12, 2019 16:06

big-andy-coates added this to In progress in Language changes via automation Dec 16, 2019

rodesai reviewed Dec 16, 2019

View reviewed changes

agavra reviewed Dec 16, 2019

View reviewed changes

design-proposals/klip-20_remove_terminate.md Outdated Show resolved Hide resolved

agavra requested a review from a team December 16, 2019 23:46

big-andy-coates changed the title ~~chore: initial drop of KLIP-20: remove TERMINATE~~ KLIP-20: remove TERMINATE Dec 20, 2019

apurvam added the design-proposal Tag KLIP Prs with this label label Jan 2, 2020

purplefox approved these changes Jan 22, 2020

View reviewed changes

Merge branch 'master' into KLIP-20-Remove-Terminate

5c60192

chore: update links

8ac491b

big-andy-coates closed this Jun 30, 2020

big-andy-coates deleted the KLIP-20-Remove-Terminate branch June 30, 2020 16:14

Language changes automation moved this from In progress to Done Jun 30, 2020

big-andy-coates restored the KLIP-20-Remove-Terminate branch June 30, 2020 16:35

big-andy-coates reopened this Jun 30, 2020

Language changes automation moved this from Done to In progress Jun 30, 2020

spena mentioned this pull request Aug 12, 2020

Automatically terminate internal C*AS query on DROP STREAM|TABLE #5987

Closed

big-andy-coates mentioned this pull request Sep 30, 2020

feat: terminate persistent query on DROP command #6143

Merged

2 tasks

big-andy-coates closed this Nov 24, 2021

Language changes automation moved this from In progress to Done Nov 24, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KLIP-20: remove TERMINATE #4126

KLIP-20: remove TERMINATE #4126

big-andy-coates commented Dec 12, 2019 •

edited

Loading

hjafarpour commented Dec 16, 2019

rodesai Dec 16, 2019

hjafarpour Dec 16, 2019

agavra Dec 16, 2019

big-andy-coates Dec 18, 2019

mjsax Jan 2, 2020 •

edited

Loading

big-andy-coates Feb 28, 2020

purplefox left a comment •

edited

Loading

mjsax commented Jan 27, 2020

big-andy-coates commented Feb 19, 2020

derekjn commented Feb 19, 2020

mjsax commented Feb 21, 2020

mjsax commented Feb 21, 2020

derekjn commented Feb 21, 2020

mjsax commented Feb 21, 2020

big-andy-coates commented Feb 28, 2020 •

edited

Loading

big-andy-coates commented Feb 28, 2020

mjsax commented Mar 10, 2020

guozhangwang commented May 21, 2020

spena commented Aug 5, 2020


		## What is in scope

		* Removing `TERMINATE` from our SQL syntax.

KLIP-20: remove TERMINATE #4126

KLIP-20: remove TERMINATE #4126

Conversation

big-andy-coates commented Dec 12, 2019 • edited Loading

Description

hjafarpour commented Dec 16, 2019

rodesai Dec 16, 2019

Choose a reason for hiding this comment

hjafarpour Dec 16, 2019

Choose a reason for hiding this comment

agavra Dec 16, 2019

Choose a reason for hiding this comment

big-andy-coates Dec 18, 2019

Choose a reason for hiding this comment

mjsax Jan 2, 2020 • edited Loading

Choose a reason for hiding this comment

big-andy-coates Feb 28, 2020

Choose a reason for hiding this comment

purplefox left a comment • edited Loading

Choose a reason for hiding this comment

mjsax commented Jan 27, 2020

big-andy-coates commented Feb 19, 2020

derekjn commented Feb 19, 2020

mjsax commented Feb 21, 2020

mjsax commented Feb 21, 2020

derekjn commented Feb 21, 2020

mjsax commented Feb 21, 2020

big-andy-coates commented Feb 28, 2020 • edited Loading

big-andy-coates commented Feb 28, 2020

mjsax commented Mar 10, 2020

guozhangwang commented May 21, 2020

spena commented Aug 5, 2020

big-andy-coates commented Dec 12, 2019 •

edited

Loading

mjsax Jan 2, 2020 •

edited

Loading

purplefox left a comment •

edited

Loading

big-andy-coates commented Feb 28, 2020 •

edited

Loading