[Feature]: Azure, OpenAI - Log Response Headers #1724

ishaan-jaff · 2024-01-31T20:25:34Z

The Feature

User request

Motivation, pitch

Twitter / LinkedIn details

No response

Manouchehri · 2024-01-31T20:26:37Z

Probably wouldn't hurt to do for other providers as well. Off the top of my head, I don't think any popular LLM API is returning private/sensitive content in their response headers.

ishaan-jaff · 2024-01-31T20:32:01Z

Starting with just OpenAI/Azure because we're going to need to move all OpenAI python calls

from chat.completions.create
to chat.completions.with_raw_response.create

ishaan-jaff · 2024-01-31T20:33:23Z

Tradeoffs of this move: openai/openai-python#416 (comment) It looks like with_raw_response will currently return an object with only sync methods

ishaan-jaff · 2024-01-31T20:34:43Z

another approach is using: https://www.python-httpx.org/advanced/#event-hooks

ishaan-jaff · 2024-02-07T18:24:41Z

Useful details here: langchain-ai/langchain#9601

Manouchehri · 2024-02-07T18:37:02Z

Feedback from Langfuse's Marc (when I asked if he thought it was a good idea to also feed upstream traces to a different project):

I'd suggest to add it to the metadata on the same generation in Langfuse though. Splitting the data into two projects makes this a kind of "exotic" setting that not many users will benefit from

I'm inclined to agree with him, not a good idea if it's just me asking.

For logging the request and response metadata, can we do it like this?

https://github.com/orgs/langfuse/discussions/1070#discussioncomment-8369918

{
  "request": {
    "headers": {
      "host": "example.com"
    },
    "url": "https://api.openai.com/v1/chat/completions"
  },
  "response": {
    "headers": {
      "age": "1337"
    },
    "status": 200
  }
}

ishaan-jaff · 2024-02-07T22:30:15Z

Attempt 1 PR: #1873
TLDR: we can't use chat.completions.with_raw_response.create it's not async and impacts throughput

Manouchehri · 2024-04-12T04:39:18Z

Hmm, I noticed that Sentry is able to log the response object/headers. Maybe we can steal/borrow their method of doing it?

Manouchehri · 2024-05-16T15:43:27Z

This is possible to do with: https://til.simonwillison.net/httpx/openai-log-requests-responses

ishaan-jaff · 2024-08-09T22:06:29Z

we do this now

ishaan-jaff added the enhancement New feature or request label Jan 31, 2024

ishaan-jaff mentioned this issue Feb 7, 2024

[05/02/2024 - 12/02/2024] New Models/Endpoints/Providers/Improvements #1831

Closed

34 tasks

Manouchehri mentioned this issue Apr 12, 2024

[Feature]: Expose original HTTP headers returned by the LLM APIs #2440

Closed

Manouchehri mentioned this issue May 16, 2024

[Feature]: proxy_server_response #3168

Open

ishaan-jaff closed this as completed Aug 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Azure, OpenAI - Log Response Headers #1724

[Feature]: Azure, OpenAI - Log Response Headers #1724

ishaan-jaff commented Jan 31, 2024

Manouchehri commented Jan 31, 2024

ishaan-jaff commented Jan 31, 2024

ishaan-jaff commented Jan 31, 2024

ishaan-jaff commented Jan 31, 2024

ishaan-jaff commented Feb 7, 2024

Manouchehri commented Feb 7, 2024

ishaan-jaff commented Feb 7, 2024

Manouchehri commented Apr 12, 2024

Manouchehri commented May 16, 2024 •

edited

Loading

ishaan-jaff commented Aug 9, 2024

[Feature]: Azure, OpenAI - Log Response Headers #1724

[Feature]: Azure, OpenAI - Log Response Headers #1724

Comments

ishaan-jaff commented Jan 31, 2024

The Feature

Motivation, pitch

Twitter / LinkedIn details

Manouchehri commented Jan 31, 2024

ishaan-jaff commented Jan 31, 2024

ishaan-jaff commented Jan 31, 2024

ishaan-jaff commented Jan 31, 2024

ishaan-jaff commented Feb 7, 2024

Manouchehri commented Feb 7, 2024

ishaan-jaff commented Feb 7, 2024

Manouchehri commented Apr 12, 2024

Manouchehri commented May 16, 2024 • edited Loading

ishaan-jaff commented Aug 9, 2024

Manouchehri commented May 16, 2024 •

edited

Loading