Expose usage counts in OpenAI streamed responses (Fixes #2003) #2016

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

Nihhaar0002 wants to merge 6 commits into HeyPuter:main from Nihhaar0002:feature-stream-usage

Contributor

Nihhaar0002 commented Nov 24, 2025

Fixes #2003

This PR adds token usage exposure for streamed OpenAI responses:

Enables stream_options.include_usage when stream: true
Allows downstream handlers to surface live usage counts
Matches existing Claude streaming usage behavior

Claude implementation is already complete — this PR finishes OpenAI support.

Nihhaar0002 and others added 5 commits

November 23, 2025 12:54


          Update AI.js

6b46184


          Update GeminiImageGenerationService.js

e4482b6


          Add streaming usage totals to AI chunks

331d25d


          Merge pull request #1 from Nihhaar0002/codex/implement-token-usage-tr…

044c702

…acking-in-streaming-functions

Add streaming usage totals to AI chunks


          Enable usage in OpenAI streamed chat completions

eb900be

CLAassistant commented Nov 24, 2025

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
1 out of 2 committers have signed the CLA.

✅ Nihhaar0002
❌ Nihhaar Saini

Nihhaar Saini seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

Nihhaar0002 force-pushed the feature-stream-usage branch from f3799eb to 44d31c2 Compare

November 24, 2025 08:51


          Enable usage in OpenAI streamed chat completions

44d31c2

Salazareo reviewed

View reviewed changes

Collaborator

Salazareo left a comment

Need to run test on this before merge but generally looks good to me

Salazareo requested changes

View reviewed changes

Collaborator

Salazareo left a comment

some changes required there.

I'm also wondering if this should just be implemented as part of the puterai stream instead of here in claude. But might be too specific to the model.

@ProgrammerIn-wonderland since you've been mostly working on the ai stuff, what do you think

src/backend/src/modules/puterai/ClaudeService.js

    
                                  const init_chat_stream = async ({ chatStream }) => {

                                      const completion = await anthropic.messages.stream(sdk_params);

                                      const usageSum = {};

                                      const runningUsage = {

Collaborator

Salazareo Nov 28, 2025

Suggested change

      
                                    const runningUsage = {
          
                                    const runningUsage = this.usageFormatterUtil({});

should match actual claude usages to make it more visible

src/backend/src/modules/puterai/ClaudeService.js

    
                                      // Each emitted content block now carries an incremental usage object

                                      // ({ input_tokens, output_tokens, total_tokens }) for live metering.

                                      const getUsage = () => ({

Collaborator

Salazareo Nov 28, 2025

can just spread op the data to copy it

{...runningUsage}

src/backend/src/modules/puterai/ClaudeService.js

    
                                              const payload = {

                                                  type: 'text',

                                                  text,

                                                  usage: getUsage(),

Collaborator

Salazareo Nov 28, 2025

Suggested change

      
                                                usage: getUsage(),
          
                                                usage: { ...runningUsage },

src/backend/src/modules/puterai/ClaudeService.js

    
                                                  input: JSON.parse(buffer),

                                                  ...(block.contentBlock?.text ? {} : { text: '' }),

                                                  type: 'tool_use',

                                                  usage: getUsage(),

Collaborator

Salazareo Nov 28, 2025

Suggested change

      
                                                usage: getUsage(),
          
                                                usage: { ...runningUsage },

src/backend/src/modules/puterai/ClaudeService.js

    
                                                  ...block.contentBlock,

                                                  input: JSON.parse(buffer),

                                                  ...(block.contentBlock?.text ? {} : { text: '' }),

                                                  type: 'tool_use',

Collaborator

Salazareo Nov 28, 2025

This needs to go at top of the block, as stream block.contentBlock might override it, to match existing method

src/backend/src/modules/puterai/ClaudeService.js

    
                                              if ( ! usageSum[key] ) usageSum[key] = 0;

                                              usageSum[key] += meteredData[key];

                                          });

                                          runningUsage.input_tokens += meteredData.input_tokens || 0;

Collaborator

Salazareo Nov 28, 2025

Suggested change

      
                                        runningUsage.input_tokens += meteredData.input_tokens || 0;
          
                                        for ( const usageType in runningUsage ) {
          
                                            runningUsage[usageType] += meteredData[usageType];
          
                                        }

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet