Can redis key space notifications be pushed to the redis stream instead of pub/sub channel? #5766

jharishabh7 · 2019-01-10T11:30:08Z

We have a requirement that we need to get a notification on changes to a Redis data structure. Based on my research I found out that I can use Redis key space notifications for doing the same. However, Redis key space notifications send the events to Redis pub/sub channel which is fire and forget i.e once the clients lose the connections all the events till the connection is up again are lost.

Redis streams solve this problem. Also, I want to use consumer group feature of Redis streams. So is there any way that Redis key space notifications can be pushed to Redis streams instead of Redis pub/sub channel?

I would also like to contribute if this feature does not exist already.

itamarhaber · 2019-01-10T14:03:09Z

Hello @jharishabh7

Only modules can do this at the time being.

xref: https://stackoverflow.com/questions/54127210/can-redis-key-space-notifications-be-pushed-to-the-redis-stream-instead-of-pub-s

jharishabh7 · 2019-01-10T14:14:34Z

Thank you @itamarhaber for the confirmation.

gkorland · 2019-01-10T15:07:29Z

@jharishabh7 can you describe the use case? which key space notifications would you like to be pushed to streams?

jharishabh7 · 2019-01-10T15:58:44Z

@gkorland - Well the use case is that we want to get notifications, if status of a Data structure in redis changes(in our case if something has been pushed to a sorted set). So we thought we can use key space notifications for this purpose.

But key space notification internally use pub/sub channels of Redis which are fire and forget. So in case our client disconnects we will lose some notifications. Also we want to use something like a consumer group where only one of the client gets an event if we have multiple instances for scalability purposes.

So we thought it would be nice if key space notifications pushed the events to a Redis stream instead of a pub/sub channel. As we want the channel to be reliable and scalable.

And that is why I was wondering if this is possible to do in redis. And @itamarhaber said it is not possible to do it without building a module for it.

So we are now exploring blocked pop to solve our use case. But I think it would be a nice add-on to Redis functionality.

antirez · 2019-01-10T16:04:25Z

I think we'll end having such support soon, because it is trivial and it solves a huge problem with lost messages in keyspace notifications...

jharishabh7 · 2019-01-10T16:09:51Z

Thank you so much @antirez . That will be awesome!

antirez · 2019-01-10T16:14:21Z

Thanks for validating the usefulness of this. It's quite some time that I think about it. Another "internal" use case I've for streams is to publish (not by default, only if enabled) server metrics every second to some stream. Memory usage, clients, iops-per-sec, ... Of course the stream will be capped but it's simple to retain a few days without much memory usage.

jharishabh7 · 2019-01-10T16:20:08Z

Yes, that would be awesome too. Let me know if I can contribute in any way and if you are open for receiving some PR's.

wang2bo2 · 2019-03-19T17:56:59Z

Thanks for validating the usefulness of this. It's quite some time that I think about it. Another "internal" use case I've for streams is to publish (not by default, only if enabled) server metrics every second to some stream. Memory usage, clients, iops-per-sec, ... Of course the stream will be capped but it's simple to retain a few days without much memory usage.

I’m working on a serverless stream consumer dispatcher. It would be very helpful if we could listen to the key space in a reliable way. In my case, that would allow users to create arbitrary streams and got consumer groups attached to it as soon as possible.

maguec · 2019-04-29T22:42:21Z

My use case here would be to watch a stream for key expiration events then to use gears to listen to that stream and then remove that document from a redis search index and/or redis graph

antirez · 2019-05-06T08:56:47Z

Just to make you aware. Right now the blocker on this is: what to do with Cluster? Users cannot specify a single key name in the cluster, the local instance may not map with such slot for instance. Also, do we want unified or non unified events in that case (per single instance or a single key in some instance)?

So far I'm thinking about two very different solutions:

Let's specify a single key name for the cluster as well: nodes will route the events to a single master that is responsible for such key, that will do all the XADDs. So we have a single unified view.
Allow Cluster to use database "1" (SELECT 1 basically) as a local, non sharded database. Use this private database for such tasks, that is, to push local events in that case, but in the future for other stuff.

It's a big choice. There is this other possibility:

Don't support that for Cluster for now and bring the feature ASAP for the standalone Redis mode.

itamarhaber · 2019-05-06T10:28:55Z

WRT 1 - that would potentially create a very hot master shard - smells off.
WRT 2 - a shard-local DB sounds like a good idea for a lot of things, and I prefer that approach for collecting keyspace notifications (and perhaps other types of notifications that may be upcoming). It also gives "atomicity" and as long as the shard is available, so are its events streams.

Another, perhaps half-baked, possibility:

3.5 Have the cluster connect to another, external, Redis database (single or cluster, proxy or not, HA or not, whatevs) for that purpose. In the case of a single instance, it can use 'local' and perhaps some configurable DB number.

andrascz · 2020-04-28T11:45:43Z

Is this still on the table? We would like to get __keyevent@0__:expired notifications for gathering some statistics about some expired keys (expired sessions). Our first thinking was putting this into the related service. Unfortunately relying on pubsub does not play nice with multiple runnig instance of the service. A stream would mean we could use a consumer group so only one instance would get an expiration event.

itamarhaber · 2020-04-28T13:56:12Z

Hello @andrascz

Is this still on the table?

I think it is still on the table, but no action has been taken in this direction so far. That said, it is possible to achieve this with modules, for example RedisGears. Here's a function that acts as an expiry trigger in a cluster (or standalone) and adds the key names to a stream:

EDIT: this is just a quick poc snippet to show the principle - the real solution should be slightly more complexified TBD
EDIT 2: A full(er) example is at https://oss.redislabs.com/redisgears/1.0/examples.html#keyspace-notification-processing

$ cat /tmp/expired-stream.py
streamkey = 'expired-stream'
gb = GearsBuilder()
gb.repartition(lambda x: streamkey)
gb.foreach(lambda x: execute('XADD', streamkey, '*', 'key', x['key']))
gb.register(prefix='*', eventTypes=['expired'])

$ cat /tmp/expired-stream.py | redis-cli -p 30001 -x  RG.PYEXECUTE
OK
$ redis-cli -c -p 30001
127.0.0.1:30001> KEYS *
(empty array)
127.0.0.1:30001> SETEX foo 1 bar
-> Redirected to slot [12182] located at 127.0.0.1:30003
OK
127.0.0.1:30003> SETEX baz 2 qux
-> Redirected to slot [4813] located at 127.0.0.1:30001
OK
127.0.0.1:30001> keys *
1) "expired-stream"
127.0.0.1:30001> XRANGE expired-stream 0 +
1) 1) "1588081672407-0"
   2) 1) "key"
      2) "foo"
2) 1) "1588081696056-0"
   2) 1) "key"
      2) "baz"
127.0.0.1:30001>

andrascz · 2020-04-28T21:07:05Z

Thank you for the pointer and example. This is even more powerful than the notification as in our use case we would generate a new element in a Redis List for every expired notification message if the key matches a prefix. I can accomplish this with only RedisGears by adding some extra logic in the executed Python code. So no need to read a stream from our services.

ligoo · 2020-05-23T10:28:40Z

@itamarhaber That's a great solution, thanks for the post. I tried it but something is going on that I can't understand.
When I'm listening to messages with something like XREAD-GROUP GROUP mygroup NAME myname STREAMS expired-stream > I can listen to new incoming messages when I simply XADD expired-stream * hello world. But when a key actually expire with SET toto titi ex 5 the script is running fine but it doesn't trigger the XREAD-GROUP thing as a new incoming message until some other client read the stream with something like XRANGE expired-stream 0 +.
What am I missing ?

itamarhaber · 2020-05-23T12:44:18Z

@ligoo rest assured there's little if anything that you're missing :)

That's an effect of a known issue - #6936 - and to work around it (for now), you need to use RG's 'sync' mode. I don't want to turn this to a discussion about the above, but feel free to use the RG repo for that purpose.

soloestoy · 2021-05-11T08:32:40Z

Here is a draft design:

add meta database in Redis, and use special db name to access meta database, for example select -1.
when a keyspace notification event triggered, add the event to a stream in meta database, it's similar with a special XADD:
key foo expired -> select -1; xadd __keyevent@0__:expired * key foo
Then we can use the streams commands to read the meta key.
for cluster, we allow using select -1 to access meta database, and accessing key in meta database doesn't need to redirect, it means different nodes could have same meta keys, just like notification channel.

The above is easy to implement, but there are still some problem I didn't figure it out(it's about replica and persistance, the key point is if the keys in meta database belong to real data or not? notify pubsub channel doesn't belong to real data) :

does the keys in meta database need to be persistance(both RDB and AOF)?
- if not, the history of notification will be lost after restart and load data from disk.
- if needs, the AOF file may grow too fast, since every notify XADD needs a SELECT -1 before it, and the next write command also needs a new SELECT.
- and if needs, the RDB file would contain meta keys too, that will affect full resync, see below.
does the keys in meta database need to be replicated to replicas?
- if not, the history of notification will be lost after failover.
- if needs, there are some scenarios may lead to inconsistency:
  - the write commands replicated to replica would trigger notify on replicas, and then add events to replicas' notify stream, then the meta key in replica is larger than master's(to fix it, we can disable notify when the client is master).
allow or not allow XREADGROUP command on meta database in replica? since it's a write command. Or instead of X like commands, implement other commands to access meta key? maybe too complex.
FLUSHALL would flush meta database or not?

oranagra · 2021-05-11T08:40:11Z

to share one concern:
i feel it would be a bit hackish to use normal commands (SELECT and XRAD) on some hard coded key name.
i.e. we have some major feature of redis (keyspace notifications), implicitly creating "keys" and expecting the user to access them with normal commands and special key names. (in contrast for example to the SLOWLOG, which has its own command.

On alternative is maybe to put this in the hands of the user in some way, i.e. the user explicitly calls some command like KEYSPACE TRACK <key name>, if we do that, the user the one who redirected it to a specific key.
on the other hand, this poses some issue with cluster.

anyway, just sharing one more concern, maybe someone will be able to come up with a winning idea.

yossigo · 2021-10-14T14:26:52Z

This has been the subject of long sessions involving @oranagra @YaacovHazan @MeirShpilraien @guybe7 @itamarhaber @inbaryuval and others. I'm posting a brief summary of these discussions here for reference.

The bottom line is this gets more complex when we consider cluster environment and various scenarios, so very strong arguments about the value of this feature are needed to drive any further work.

Rationale - Why do it?

Redis already supports keyspace notifications to make it possible to track changes and operations on the server side.

There are many use cases that can benefit from this mechanism, including:

Write through cache scenarios
Change Data Capture (where Redis is the source)
Other application patterns that are based on event sourcing

The main limitation of the existing keyspace notifications mechanism is that it offers very few guarantees. Notifications are sent once to connected clients and are not stored anywhere, so a client that drops a connection may lose any arbitrary number of notifications.

Reliable KSN (RKSN)

The goal is to define a keyspace notifications mechanism which is reliable. The properties of this solution are:

Notifications are not lost, even if not immediately delivered
Clients use a client-side cursor mechanism to be able to track new notifications
Notifications are delivered in a total order to corresponds to the order of operations on the server

RKSN storage and access

The RKSN properties are identical to what Streams provide, so we considered the option of simply using stream keys, which can be accessed by clients using existing stream commands.

However, using regular stream keys creates a conflict with Redis Cluster as the key name must match the locally assigned hash slots. Redis could override that in the case of a notifications stream key, but this would also impact cluster-aware clients that would need to access the notifications stream key on all nodes regardless of hash slots topology.

Another alternative we considered was using a dedicated pseudo-database and SELECT to store those keys. However, this has been rejected as SELECT and multiple databases are not supported by Redis Cluster.

The final option was to store reliable notifications as special meta-data that lives outside the keyspace. The interface still resembles streams, but there is a dedicated (e.g., KREAD) command to access this data.

Replication and persistence

RKSN data needs to be persisted into RDB, for several reasons:

A master may restart and still need to serve past notifications to clients
A replica may perform full-sync, get promoted, and need to serve past notifications

The stream of notification includes additional meta-data, such as timestamp or stream entry ID that provide more information about the operation and support a client-side cursor mechanism.

Because of this, notifications must also be propagated locally to AOF and explicitly replicated to replicas. On the replica side, RKSN should be inhibited when processing commands on the replication link.

Redis Cluster

Slot-level ordering

In a cluster environment there is no concept of total order between operations that take place on different nodes. Because hash slots may also migrate between nodes, RKSN can effectively guarantee total ordering of notifications only for a specific hash slot.

Because of this, notifications should include an explicit hash slot identifier so clients can easily distinguish between ordered and unordered events.

RKSN Aggregation

Cluster aware clients need to read RKSN from all cluster nodes, as even for a single key there's no way to guarantee that the full RKSN history is available on a single node.

Clients will be able to aggregate, order, and if necessary de-duplicate notifications received from different nodes.

Migration

The cluster key migration mechanism needs to be enhanced to also support migration of RKSN entries. This operation does not need to be atomic as notifications may live on both source and target nodes during migration, but once a slot has been fully migrated we must guarantee the importing node has the full notification history.

Configuration

RKSN have a more significant impact on resources (memory in particular) and should have several configuration options to control that:

Limit on number of stored notifications / memory used / etc.
Notify-time filtering similar to the existing notify-keyspace-events configuration
Read-time filters in the KREAD command.

Open Issues

FLUSHALL

Should the FLUSHALL command also flush all RKSN, or do we need a dedicated command that explicitly flushes the RKSN history?

Writable Replicas

We currently assume that replicas propagate received notifications and never generate local notifications on commands received on the replication link.

Should writable replicas generate and propagate notifications for commands received from local users?

hpatro · 2022-04-04T04:41:10Z

@yossigo Thanks for the detailed explanation. I would like to work on this. I see few alternatives mentioned above. Is there any recommended approach already or should I put up a proposal ?

YourTechBud · 2023-02-03T00:20:25Z

What's the current status of this? I would like to enable CDC like usecases for my current redis cluster

oranagra · 2023-02-04T09:05:53Z

the current status is summed up in the last big comment.
there are no plans to support this in the near future.

hpatro · 2023-02-09T21:46:46Z

@YourTechBud Could you explain your usecase a bit more?

Currently the notification you would receive on SET operation are the below two. This doesn't have the complete data (value part) to build CDC usecase. How do you plan to tackle it ?

1) "pmessage"
2) "*"
3) "__keyspace@0__:A"
4) "set"

1) "pmessage"
2) "*"
3) "__keyevent@0__:set"
4) "A"

christiancarpinelli · 2023-09-28T23:52:36Z

any news on this?

antirez added state-design-effort-needed state-proposed-feature labels May 6, 2019

mp911de mentioned this issue May 20, 2019

DATAREDIS-976 - Integrate Cluster-Specific Lettuce PubSub Connection Code spring-projects/spring-data-redis#450

Closed

andrascz mentioned this issue Apr 28, 2020

Competing consumers for "Keyspace Notifications" #2057

Open

This was referenced Sep 29, 2020

[NEW] FLUSHDB/FLUSHALL/SWAPDB should trigger a keyspace notification event #7622

Open

REQUEST: Redis keyspace Pub/Sub on FlushDB #3782

Open

madolson added this to To do in 7.0 via automation Feb 2, 2021

madolson added state:needs-design the solution is not obvious and some effort should be made to design it and removed state-design-effort-needed state-proposed-feature labels Feb 2, 2021

madolson removed this from To do in 7.0 Feb 2, 2021

oranagra added this to Backlog in 7.0 via automation May 24, 2021

oranagra moved this from Backlog to To Do in 7.0 Jul 5, 2021

oranagra assigned YaacovHazan Aug 17, 2021

oranagra removed this from To Do in 7.0 Oct 18, 2021

oranagra added this to the Next major backlog milestone Oct 18, 2021

madolson added the class:feature label Mar 31, 2022

ranshid added the streams label Feb 19, 2023

This was referenced Aug 30, 2023

How to configure a writable replica in a Redis Cluster #12511

Open

Feature Request: Implement Delivery Acknowledgment Mechanism for Redis Pub/Sub #12532

Open

hpatro mentioned this issue Mar 25, 2024

Reliable Keyspace Notifications valkey-io/valkey#28

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can redis key space notifications be pushed to the redis stream instead of pub/sub channel? #5766

Can redis key space notifications be pushed to the redis stream instead of pub/sub channel? #5766

jharishabh7 commented Jan 10, 2019

itamarhaber commented Jan 10, 2019

jharishabh7 commented Jan 10, 2019

gkorland commented Jan 10, 2019

jharishabh7 commented Jan 10, 2019 •

edited

antirez commented Jan 10, 2019

jharishabh7 commented Jan 10, 2019 •

edited

antirez commented Jan 10, 2019

jharishabh7 commented Jan 10, 2019

wang2bo2 commented Mar 19, 2019

maguec commented Apr 29, 2019

antirez commented May 6, 2019

itamarhaber commented May 6, 2019

andrascz commented Apr 28, 2020 •

edited

itamarhaber commented Apr 28, 2020 •

edited

andrascz commented Apr 28, 2020

ligoo commented May 23, 2020 •

edited

itamarhaber commented May 23, 2020

soloestoy commented May 11, 2021

oranagra commented May 11, 2021

yossigo commented Oct 14, 2021

hpatro commented Apr 4, 2022

YourTechBud commented Feb 3, 2023

oranagra commented Feb 4, 2023

hpatro commented Feb 9, 2023

christiancarpinelli commented Sep 28, 2023

Can redis key space notifications be pushed to the redis stream instead of pub/sub channel? #5766

Can redis key space notifications be pushed to the redis stream instead of pub/sub channel? #5766

Comments

jharishabh7 commented Jan 10, 2019

itamarhaber commented Jan 10, 2019

jharishabh7 commented Jan 10, 2019

gkorland commented Jan 10, 2019

jharishabh7 commented Jan 10, 2019 • edited

antirez commented Jan 10, 2019

jharishabh7 commented Jan 10, 2019 • edited

antirez commented Jan 10, 2019

jharishabh7 commented Jan 10, 2019

wang2bo2 commented Mar 19, 2019

maguec commented Apr 29, 2019

antirez commented May 6, 2019

itamarhaber commented May 6, 2019

andrascz commented Apr 28, 2020 • edited

itamarhaber commented Apr 28, 2020 • edited

andrascz commented Apr 28, 2020

ligoo commented May 23, 2020 • edited

itamarhaber commented May 23, 2020

soloestoy commented May 11, 2021

oranagra commented May 11, 2021

yossigo commented Oct 14, 2021

Rationale - Why do it?

Reliable KSN (RKSN)

RKSN storage and access

Replication and persistence

Redis Cluster

Slot-level ordering

RKSN Aggregation

Migration

Configuration

Open Issues

FLUSHALL

Writable Replicas

hpatro commented Apr 4, 2022

YourTechBud commented Feb 3, 2023

oranagra commented Feb 4, 2023

hpatro commented Feb 9, 2023

christiancarpinelli commented Sep 28, 2023

jharishabh7 commented Jan 10, 2019 •

edited

jharishabh7 commented Jan 10, 2019 •

edited

andrascz commented Apr 28, 2020 •

edited

itamarhaber commented Apr 28, 2020 •

edited

ligoo commented May 23, 2020 •

edited