Robust SSE parsing #336

burke · 2023-10-04T20:32:14Z

I took @atesgoral's work in #332 and ran with it a bit

A few things fixed up or adjusted here:

OpenAI always sends "data: [DONE]\n\n" at the end of responses; we can assert that was received to detect aborted connections.
OpenAI never sends "error: ...\n\n" messages: errors are passed inside the JSON object attached to the "data: {...}\n\n".
I'm not aware of OpenAI ever sending malformed JSON, so erroring on that (new behaviour) feels correct to me.

I also made a few other changes along the way:

Mostly switched from require_relative to require
Changed up-front nested class/module loading to autoloads for (trivial runtime performance gain for consumers and) making it possible to refer to OpenAI::Error in OpenAI::SSE without having to put the requires at the bottom of openai.rb.

Let me know what you think, I'm happy to keep adjusting here.

All Submissions:

Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?
Have you added an explanation of what your changes do and why you'd like us to include them?

A few things fixed up or adjusted here: 1. OpenAI always sends "data: [DONE]\n\n" at the end of responses; we can assert that was received to detect aborted connections. 2. OpenAI never sends "error: ...\n\n" messages: errors are passed inside the JSON object attached to the "data: {...}\n\n". 3. I'm not aware of OpenAI ever sending malformed JSON, so erroring on that (new behaviour) feels correct to me. I also made a few other changes along the way: 1. Mostly switched from require_relative to require 2. Changed up-front nested class/module loading to autoloads for (trivial runtime performance gain for consumers and) making it possible to refer to OpenAI::Error in OpenAI::SSE without having to put the requires at the bottom of openai.rb. Let me know what you think, I'm happy to keep adjusting here.

vizakenjack · 2023-10-09T00:45:03Z

There is an issue that should be fixed (both in original ruby-openai and your pull request).

How to reproduce: try to send a pretty long text that won't fit into context. There will be a message:

buffer = "{\n \"error\": {\n \"message\": \"This model's maximum context length is 4097 tokens. However, you requested 4235 tokens (2235 in the messages, 2000 in the completion). Please reduce the length of the messages or completion.\",\n \"type\": \"invalid_request_error\",\n \"param\": \"messages\",\n \"code\": \"context_length_exceeded\"\n }\n}\n"

I had fix it in my own fork

atesgoral · 2023-10-16T15:08:10Z

@vizakenjack That error sounds like just hitting token limits. What was your fix?

fabioxgn

@alexrudall can we get this merged? I'm having an issue where the chunks are being split in half and the regex on to_json_stream does not match the partial chunk, so it's ignored.

This branch fixes that issue.

This is what I'm getting in to_json_stream:

chunk
=> "98,\"model\":\"gpt-4-0613\",\"choices\":[{\"index\":0,\"delta\":{\"role\":\"assistant\",\"content\":\"\"},\"finish_reason\":null}]}\n\ndata: {\"id\":\"chatcmpl-8FTgAGg922FFFVvUVdLp8raDbny4j\",\"object\":\"chat.completion.chunk\",\"crea"

This does not match chunk.scan(/(?:data|error): (\{.*\})/i) so this chunk is ignored.

alexrudall · 2023-10-30T22:11:28Z

Hey @fabioxgn, have you tried v5.2? That should fix your issue. (I still probably will merge this)

…

On Mon, 30 Oct 2023 at 21:14, Fábio Gomes ***@***.***> wrote: ***@***.**** approved this pull request. @alexrudall <https://github.com/alexrudall> can we get this merged? I'm having an issue where the chunks are being split in half and the regex on to_json_stream does not match the partial chunk, so it's ignored. This branch fixes that issue. This is what I'm getting in to_json_stream: chunk => "98,\"model\":\"gpt-4-0613\",\"choices\":[{\"index\":0,\"delta\":{\"role\":\"assistant\",\"content\":\"\"},\"finish_reason\":null}]}\n\ndata: {\"id\":\"chatcmpl-8FTgAGg922FFFVvUVdLp8raDbny4j\",\"object\":\"chat.completion.chunk\",\"crea" This does not match chunk.scan(/(?:data|error): (\{.*\})/i) so this chunk is ignored. — Reply to this email directly, view it on GitHub <#336 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABWXYXWF4KFHATLMQSGB52DYCAKBRAVCNFSM6AAAAAA5TGJYC6VHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMYTOMBVGA4DKMZRHA> . You are receiving this because you were mentioned.Message ID: ***@***.***>

fabioxgn · 2023-10-31T11:26:06Z

Hey @fabioxgn, have you tried v5.2? That should fix your issue. (I still probably will merge this)
…
On Mon, 30 Oct 2023 at 21:14, Fábio Gomes @.> wrote: @.* approved this pull request. @alexrudall https://github.com/alexrudall can we get this merged? I'm having an issue where the chunks are being split in half and the regex on to_json_stream does not match the partial chunk, so it's ignored. This branch fixes that issue. This is what I'm getting in to_json_stream: chunk => "98,"model":"gpt-4-0613","choices":[{"index":0,"delta":{"role":"assistant","content":""},"finish_reason":null}]}\n\ndata: {"id":"chatcmpl-8FTgAGg922FFFVvUVdLp8raDbny4j","object":"chat.completion.chunk","crea" This does not match chunk.scan(/(?:data|error): ({.})/i) so this chunk is ignored. — Reply to this email directly, view it on GitHub <#336 (review)>, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABWXYXWF4KFHATLMQSGB52DYCAKBRAVCNFSM6AAAAAA5TGJYC6VHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMYTOMBVGA4DKMZRHA . You are receiving this because you were mentioned.Message ID: @.**>

Hi @alexrudall, yes, that fixed my issue. Thank you!

alexrudall · 2023-11-06T01:31:10Z

Big thanks for your work on this @burke - SSE parsing is now much improved / fixed with @atesgoral's PR, so I think this one is not needed. Definitely open to the autoload changes if you want to cherrypick them to a new PR. Thanks!

atesgoral and others added 3 commits September 26, 2023 22:49

Robust SSE parsing

e330b43

Split out SSE handling

66d17c0

fabioxgn approved these changes Oct 30, 2023

View reviewed changes

alexrudall closed this Nov 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Robust SSE parsing #336

Robust SSE parsing #336

burke commented Oct 4, 2023 •

edited

Loading

vizakenjack commented Oct 9, 2023 •

edited

Loading

atesgoral commented Oct 16, 2023

fabioxgn left a comment

alexrudall commented Oct 30, 2023 via email

fabioxgn commented Oct 31, 2023

alexrudall commented Nov 6, 2023

Robust SSE parsing #336

Robust SSE parsing #336

Conversation

burke commented Oct 4, 2023 • edited Loading

All Submissions:

vizakenjack commented Oct 9, 2023 • edited Loading

atesgoral commented Oct 16, 2023

fabioxgn left a comment

Choose a reason for hiding this comment

alexrudall commented Oct 30, 2023 via email

fabioxgn commented Oct 31, 2023

alexrudall commented Nov 6, 2023

burke commented Oct 4, 2023 •

edited

Loading

vizakenjack commented Oct 9, 2023 •

edited

Loading