fix(llma): extract pydantic-ai tool calls from output parts by shauryapednekar · Pull Request #59755 · PostHog/posthog

shauryapednekar · 2026-05-23T00:11:03Z

Problem

Pydantic AI's current OpenTelemetry instrumentation stores generation output in gen_ai.output.messages, which PostHog maps to $ai_output_choices. Tool calls in that output are represented as message parts with type: "tool_call" and a top-level name.

PostHog already extracts tool calls from several provider shapes, but not this Pydantic AI message-parts shape, so affected $ai_generation events do not get $ai_tools_called or $ai_tool_call_count populated.

Pydantic AI instrumentation docs: https://pydantic.dev/docs/ai/api/models/instrumented/

Changes

Teach AI tool-call extraction to read parts[] arrays on output choices and message wrappers.
Extract part.name when part.type === "tool_call".
Reuse existing sanitization, ordering, duplicate preservation, and per-event cap behavior.
Add tests for direct extraction, malformed parts, stringified JSON, and the full OTel ingestion path.

How did you test this code?

I tested this locally in Docker.

pnpm --filter=@posthog/nodejs format:check
pnpm --filter=@posthog/nodejs lint
pnpm --filter=@posthog/nodejs build

All passed.

pnpm exec jest --runInBand --forceExit \
  src/ingestion/ai/tools/extract-tool-calls.test.ts \
  src/ingestion/ai/otel/attribute-mapping.test.ts \
  src/ingestion/ai/process-ai-event.test.ts

Result: 3 suites passed, 288 tests passed.

git diff --check

Passed.

Publish to changelog?

Do not publish to changelog.

Docs update

No docs update. This is a backend ingestion compatibility fix for an already documented tools normalization field.

🤖 Agent context

I used Codex to help me with this PR.

Codex helped inspect the existing tool-call extractor patterns, implement the narrow Pydantic AI parts[].type === "tool_call" extraction path, and add focused tests.

The implementation stays local to the existing extractor and reuses the current sanitizer, ordering, duplicate preservation, and count behavior. The added tests cover wrapped and unwrapped output shapes, malformed parts, mixed text/tool parts, stringified JSON, and the full OTel ingestion path.

Teach tool-call extraction to read Pydantic AI OTel output message parts. Pydantic AI emits assistant tool calls as parts with type="tool_call" and a top-level name; normalize those into the existing $ai_tools_called and $ai_tool_call_count fields. Add extractor coverage for single, multiple, mixed, wrapped, malformed, and stringified JSON shapes, plus a processAiEvent OTel ingestion test.

greptile-apps · 2026-05-23T00:13:39Z

Prompt To Fix All With AI

Fix the following 1 code review issue. Work through them one at a time, proposing concise fixes.

---

### Issue 1 of 1
nodejs/src/ingestion/ai/tools/extract-tool-calls.test.ts:681-709
This is the only new test in `processAiToolCallExtraction` that isn't parameterised. Given that the team always prefers parameterised tests, this test (along with the existing non-parameterised neighbours) could be rolled into an `it.each` table in the same style used for `extractToolCallNames` above, making it easy to add further provider shapes later.

```suggestion
    it.each([
        [
            'Pydantic AI OTel tool_call parts from stringified JSON',
            JSON.stringify([
                {
                    role: 'assistant',
                    parts: [
                        { type: 'text', content: 'Let me check.' },
                        { type: 'tool_call', id: 'call_abc', name: 'get_weather', arguments: '{"city":"NYC"}' },
                        { type: 'tool_call', id: 'call_def', name: 'search_docs', arguments: '{"q":"weather"}' },
                    ],
                },
            ]),
            'get_weather,search_docs',
            2,
        ],
    ])('%s', (_description, outputChoices, expectedToolsCalled, expectedCount) => {
        const event = createEvent('$ai_generation', { $ai_output_choices: outputChoices })
        const result = processAiToolCallExtraction(event)
        expect(result.properties!['$ai_tools_called']).toBe(expectedToolsCalled)
        expect(result.properties!['$ai_tool_call_count']).toBe(expectedCount)
    })
```

_{Reviews (1): Last reviewed commit: "fix(llma): extract pydantic-ai tool call..." | Re-trigger Greptile}

Address review feedback by converting the Pydantic AI processAiToolCallExtraction case to an it.each table.

carlos-marchal-ph

Looks good to me, thanks for the contribution! I'm gonna go ahead with the test change I suggested and then merge it.

carlos-marchal-ph · 2026-05-27T08:22:57Z

+    it.each([
+        [
+            'Pydantic AI OTel tool_call parts from stringified JSON',
+            JSON.stringify([
+                {
+                    role: 'assistant',
+                    parts: [
+                        { type: 'text', content: 'Let me check.' },
+                        {
+                            type: 'tool_call',
+                            id: 'call_abc',
+                            name: 'get_weather',
+                            arguments: '{"city":"NYC"}',
+                        },
+                        {
+                            type: 'tool_call',
+                            id: 'call_def',
+                            name: 'search_docs',
+                            arguments: '{"q":"weather"}',
+                        },
+                    ],
+                },
+            ]),
+            'get_weather,search_docs',
+            2,
+        ],


No need to make this a tabular test since it has a single input

deployment-status-posthog · 2026-05-27T10:09:15Z

Deploy status

Environment	Status	Deployed At	Workflow
dev	✅ Deployed	2026-05-27 10:09 UTC	Run
prod-us	✅ Deployed	2026-05-27 10:20 UTC	Run
prod-eu	✅ Deployed	2026-05-27 10:24 UTC	Run

assign-reviewers-posthog Bot requested review from a team May 23, 2026 00:11

greptile-apps Bot reviewed May 23, 2026

View reviewed changes

Comment thread nodejs/src/ingestion/ai/tools/extract-tool-calls.test.ts

test(llma): parameterize pydantic-ai tool call test

a2c49b8

Address review feedback by converting the Pydantic AI processAiToolCallExtraction case to an it.each table.

Radu-Raicea requested a review from carlos-marchal-ph May 26, 2026 20:17

carlos-marchal-ph reviewed May 27, 2026

View reviewed changes

test: drop it.each for single-case pydantic ai tool call test

b2c5cce

carlos-marchal-ph approved these changes May 27, 2026

View reviewed changes

carlos-marchal-ph merged commit 42a7cf3 into PostHog:master May 27, 2026
143 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(llma): extract pydantic-ai tool calls from output parts#59755

fix(llma): extract pydantic-ai tool calls from output parts#59755
carlos-marchal-ph merged 3 commits into
PostHog:masterfrom
shauryapednekar:shauryapednekar/fix-llma-pydantic-ai-tool-calls

shauryapednekar commented May 23, 2026

Uh oh!

greptile-apps Bot commented May 23, 2026

Uh oh!

Uh oh!

carlos-marchal-ph left a comment

Uh oh!

carlos-marchal-ph May 27, 2026

Uh oh!

Uh oh!

deployment-status-posthog Bot commented May 27, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

shauryapednekar commented May 23, 2026

Problem

Changes

How did you test this code?

Publish to changelog?

Docs update

🤖 Agent context

Uh oh!

greptile-apps Bot commented May 23, 2026

Uh oh!

Uh oh!

carlos-marchal-ph left a comment

Choose a reason for hiding this comment

Uh oh!

carlos-marchal-ph May 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

deployment-status-posthog Bot commented May 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploy status

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

deployment-status-posthog Bot commented May 27, 2026 •

edited

Loading