API gateway: cache topic producer for the same gateway #660

nicoloboschi · 2023-10-27T13:32:38Z

Changes:

We use Guava cache to keep producer by gateway. The cache is LRU and every access refresh the ttl.
The max size is configurable (max 100 across all tenants). The cache can be disabled

...tream-api-gateway/src/main/java/ai/langstream/apigateway/gateways/LRUTopicProducerCache.java

eolivelli · 2023-10-27T16:03:08Z

...tream-api-gateway/src/main/java/ai/langstream/apigateway/gateways/LRUTopicProducerCache.java

+        try {
+            final SharedTopicProducer sharedTopicProducer =
+                    cache.get(key, () -> new SharedTopicProducer(topicProducerSupplier.get()));
+            sharedTopicProducer.acquire();


this is a common pitfall

the "acquire" should be performed in the constructor, because between this line and the previous line it is possible that the producer has been released if the count is zero (t is an edge case, but it usually happens).

The tricky thing is that you have to call acquire if there is already an object.

Also the producer must be started only once, so you have to start it when you have created it
and the cache should not return the producer until it is fully initialised (started) with success
and if the start operation fails it must not be added to the cache

the producer is started in during the load in the cache and every producer in the cache is already started.

the acquire must take into condiseration the edge case you mentioned, right. I did it in another manner. When you call acquire, if the producer has been actually closed (removed from cache for example), we retry and that will trigger the constructor to be called

eolivelli · 2023-10-27T16:05:26Z

langstream-api-gateway/src/main/java/ai/langstream/apigateway/gateways/ProduceGateway.java

                topicConnectionsRuntime.createProducer(
                        null, streamingCluster, Map.of("topic", topic));
-        producer.start();
+        topicProducer.start();


this is not to be executed here, see the comments above

eolivelli · 2023-10-27T16:06:12Z

langstream-api-gateway/src/main/java/ai/langstream/apigateway/gateways/TopicProducerCache.java

+public interface TopicProducerCache {
+    record Key(String tenant, String application, String gatewayId) {}
+
+    TopicProducer getOrCreate(Key key, Supplier<TopicProducer> topicProducerSupplier);


this operation may fail, it is better to add a "throws" clause. The risk is to forget possible failures (that will happen for instance if the broker is down)

eolivelli · 2023-10-27T16:07:06Z

...m-api-gateway/src/main/java/ai/langstream/apigateway/gateways/TopicProducerCacheFactory.java

+        if (topicProperties.isProducersCacheEnabled()) {
+            return new LRUTopicProducerCache(topicProperties.getProducersCacheSize());
+        } else {
+            return (key, topicProducerSupplier) -> topicProducerSupplier.get();


This is tricker then expected, see the comments above
the producer that is returned should be already ready to accept writes (started)

eolivelli · 2023-10-27T16:08:26Z

langstream-api-gateway/src/main/resources/application.properties

+
+application.topics.producers-cache-enabled=true
+application.topics.producers-cache-size=100


we should have metrics for this, otherwise it is hard to understand if the system is close to the threshold.
when you hit the threshold we stop caching the system will have very bad performances (latency)

nicoloboschi force-pushed the cache-producers branch from 5f38cd8 to 86f589b Compare October 27, 2023 15:51

nicoloboschi marked this pull request as ready for review October 27, 2023 15:56

nicoloboschi closed this Oct 27, 2023

eolivelli requested changes Oct 27, 2023

View reviewed changes

nicoloboschi reopened this Oct 27, 2023

eolivelli approved these changes Oct 30, 2023

View reviewed changes

nicoloboschi added 2 commits October 30, 2023 12:31

API gateway: cache topic producer for the same gateway

80fc750

fix

b131418

nicoloboschi force-pushed the cache-producers branch from a8e89fc to b131418 Compare October 30, 2023 11:35

nicoloboschi added 3 commits October 30, 2023 17:25

fixes

59feb58

add names

148fdc0

fix

310a0de

nicoloboschi merged commit c2208a2 into main Oct 30, 2023
9 of 10 checks passed

nicoloboschi deleted the cache-producers branch October 30, 2023 17:11

benfrank241 pushed a commit to vectorize-io/langstream that referenced this pull request May 2, 2024

API gateway: cache topic producer for the same gateway (LangStream#660)

0ee6c78

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API gateway: cache topic producer for the same gateway #660

API gateway: cache topic producer for the same gateway #660

nicoloboschi commented Oct 27, 2023 •

edited

Loading

eolivelli Oct 27, 2023

eolivelli Oct 27, 2023

nicoloboschi Oct 30, 2023

nicoloboschi Oct 30, 2023

eolivelli Oct 27, 2023

eolivelli Oct 27, 2023

eolivelli Oct 27, 2023

nicoloboschi Oct 30, 2023

eolivelli Oct 27, 2023


		application.topics.producers-cache-enabled=true
		application.topics.producers-cache-size=100

API gateway: cache topic producer for the same gateway #660

API gateway: cache topic producer for the same gateway #660

Conversation

nicoloboschi commented Oct 27, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nicoloboschi commented Oct 27, 2023 •

edited

Loading