Add gRPC processor agent #406

cbornet · 2023-09-13T15:14:00Z

For preliminary review.
Still need to:

Put the agent in its own artifact
Build a NAR
Resolve TODOs

cbornet · 2023-09-13T15:30:11Z

...me/langstream-runtime-impl/src/main/java/ai/langstream/runtime/agent/GrpcAgentProcessor.java

+    private final Map<Object, Integer> schemaIds = new ConcurrentHashMap<>();
+
+    // Schemas received from the server
+    private final Map<Integer, Object> serverSchemas = new ConcurrentHashMap<>();


An untrusted remote server could OOM us by sending a big amount of schemas.
I don't think this is an issue atm since the remote application is owned by the user.
But in the future we may have to put a limit.

cbornet · 2023-09-13T15:32:18Z

...me/langstream-runtime-impl/src/main/java/ai/langstream/runtime/agent/GrpcAgentProcessor.java

+            } else {
+                // TODO: received unknown schema. error ?
+                // Do we send an error result or do we fail completely so that the agent is
+                // restarted ?


What should we do here ?
Ignore ? Send error for the source record ? Fail the producer completely at the next process call ?

fail completely

OK.
Also currently I restart the request when the server errors.
But maybe it would be better to also fail completely to have the pod completely restarted.

eolivelli · 2023-09-14T14:34:36Z

...agents/langstream-agent-grpc/src/main/java/ai/langstream/agents/grpc/GrpcAgentProcessor.java

+    private RecordSink sink;
+
+    // For each record sent, we increment the recordId
+    private final AtomicInteger recordId = new AtomicInteger(0);


probably we need AtomicLong here, it is possible to reach 2 billion messages for a production system that is processing data

Of course! I use long everywhere for record id (protocol, caches) except here... Good catch !

Note that it wouldn't have been a real issue since record ids are transient and forgotten once we get the server response. I believe there would be an OOM before reaching the MAX_VALUE if they are not removed.

eolivelli

I have left some final feedback.
Overall LGTM

eolivelli · 2023-09-15T12:51:24Z

langstream-runtime/langstream-runtime-impl/pom.xml

@@ -269,6 +269,13 @@
      <scope>provided</scope>
    </dependency>

+    <dependency>
+      <groupId>${project.groupId}</groupId>
+      <artifactId>langstream-agent-grpc</artifactId>


this is not needed here, please remove it

eolivelli · 2023-09-15T12:52:18Z

langstream-api/src/main/java/ai/langstream/api/runner/code/AgentContext.java

@@ -43,4 +43,9 @@ public interface AgentContext {
    default BadRecordHandler getBadRecordHandler() {
        return DEFAULT_BAD_RECORD_HANDLER;
    }
+
+    default void criticalFailure(Throwable error) {
+        System.err.printf("Critical failure: %s. Shutting down the runtime...", error.getMessage());


for the default implementation we can simply log, this way the tests won't crash the JVM

eolivelli · 2023-09-15T12:53:52Z

...m-runtime/langstream-runtime-impl/src/main/java/ai/langstream/runtime/agent/AgentRunner.java

+                    "Critical failure: %s. Shutting down the runtime..."
+                            .formatted(error.getMessage()),
+                    error);
+            AgentContext.super.criticalFailure(error);


We can make the behaviour configurable with a static "Consumer" variable, and in AbstractApplicationRunner (that is used in unit tests) we intercept the call without crashing the JVM

* Add gRPC processor agent * Fail the JVM in case of error from the remote server * Add a GrpcAgentsProvider in k8s runtime * Rename to experimental-python-processor * Remove grpc agent from runtime deps * By default, don't crash in AgentContext's criticalFailure method

cbornet requested review from eolivelli and nicoloboschi September 13, 2023 15:14

cbornet commented Sep 13, 2023

View reviewed changes

cbornet force-pushed the grpc branch 3 times, most recently from 05cfadb to 888eec5 Compare September 14, 2023 12:30

eolivelli reviewed Sep 14, 2023

View reviewed changes

cbornet force-pushed the grpc branch 4 times, most recently from d5d728f to 21be05a Compare September 14, 2023 15:29

cbornet requested a review from eolivelli September 14, 2023 15:58

cbornet marked this pull request as ready for review September 14, 2023 15:58

cbornet force-pushed the grpc branch from 7c3c73c to 63f048a Compare September 14, 2023 17:09

eolivelli reviewed Sep 15, 2023

View reviewed changes

cbornet force-pushed the grpc branch from 47e8faf to 001b65e Compare September 15, 2023 22:40

cbornet added 5 commits September 16, 2023 01:08

Add gRPC processor agent

cbad262

Fail the JVM in case of error from the remote server

7638f68

Add a GrpcAgentsProvider in k8s runtime

19fa93b

Rename to experimental-python-processor

daf90ee

Remove grpc agent from runtime deps

9a3878e

cbornet force-pushed the grpc branch from 001b65e to 1426232 Compare September 15, 2023 23:08

By default, don't crash in AgentContext's criticalFailure method

1a7570e

cbornet force-pushed the grpc branch from 1426232 to 1a7570e Compare September 16, 2023 13:38

cbornet merged commit aa23a06 into main Sep 16, 2023
8 checks passed

cbornet deleted the grpc branch September 16, 2023 14:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add gRPC processor agent #406

Add gRPC processor agent #406

cbornet commented Sep 13, 2023

cbornet Sep 13, 2023

cbornet Sep 13, 2023

eolivelli Sep 14, 2023

cbornet Sep 14, 2023

cbornet Sep 14, 2023

eolivelli Sep 14, 2023

cbornet Sep 14, 2023

cbornet Sep 14, 2023

cbornet Sep 14, 2023

eolivelli left a comment

eolivelli Sep 15, 2023

eolivelli Sep 15, 2023

eolivelli Sep 15, 2023

Add gRPC processor agent #406

Add gRPC processor agent #406

Conversation

cbornet commented Sep 13, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

eolivelli left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment