Better thinking (and more generally, logging) support for the llm backend by yiyunliu · Pull Request #632 · BasisResearch/effectful

yiyunliu · 2026-04-09T15:47:41Z

This PR aims to address #605 by adding a general LoggingHandler that lets users retrieve thinking and other diagnostic information returned by the completion api more easily by implementing LoggingListener, which amounts to registering callback functions that are called upon entering or exiting tool/template calls and completion.

An intuitive way of getting the thinking trace is to simply override the completion operator, although the completion lacks the context in which the function is called. That problem is addressed by keeping track of a stack of template and tool calls through the CallStackListener so the logging function can have the full context on when the completion call was made.

The thinking functionality is implemented outside the completions.py file in obs-example. Both files in that directory implement the same functionality. One relies on Python's cooperative multiple-inheritance to combine different logging functionalities whereas the latter does the same by composing multiple handlers.

The way thinking trace is logged is very ad-hoc and I'm not sure if it really belongs to the effectful library itself, and I wonder if it would be enough to just have the logging infrastructure around and supplement that with more documentation.

I'm also not quite sure about the API design aspect of the problem. In particular, is separating out the listener class even necessary when one can just craft new logging functionality by overriding the operators directly? The listener API does help in the sense as it hardwires some logic so the control flow can't be modified in unexpected way when all you want is logging.

I'd like to have some discussion about what the ideal API should look like. Functionality wise, I think the PR is quite complete. In the meantime, I'll stress test the thinking functionality by porting https://github.com/BasisResearch/MARA/tree/yz-pareto-code/MARA/domains/autumnbench/pareto/paretoviz to use effectful.

This issue is really caused but rather exposed by the PR. The completion operator shouldn't copy litellm.completion's type signature

eb8680 · 2026-04-10T19:35:14Z

This is a good start, but I think a lot of the implementation is unnecessary, including maintaining a coarse approximation of the call stack. We don't want to be doing any of this stuff ourselves if we can avoid it.

The simplest thing to do is use one of the observability integrations shipped with litellm, or a related library like weave.

Another alternative would be to use litellm's custom callback API together with the logging module in the standard library. logging can inject stack trace metadata into its messages automatically, and can send them off to any number of destinations.

More broadly, there are a few distinct use cases here that don't necessarily share the same backend solutions. We should pick one of those concrete use cases and work backward to an implementation.

yiyunliu force-pushed the obs branch from 24bd2e2 to 83b72cb Compare April 9, 2026 15:55

yiyunliu added 15 commits April 10, 2026 12:58

add obs handler

355d5f0

Add demo completion logger

d7fa4d8

fix bugs in the obs handler implementation

e919ce0

Add some test cases

9c9ab24

add terrible demo that uses multiple inheritance

c14f1b8

delete unused variable

304bff4

add everything to completions.py

59f746f

extend the call stack listener to include an extra dict field

9b9fc61

pulling out the listeners into the test file

b20ef70

remove claude.md from version control

a6d8604

create a temporary folder with examples

bcf1b03

remove redundant import

944cc48

linter pass

d59854d

rename from observability to logging

7d4e6eb

fix ci

e24d135

This issue is really caused but rather exposed by the PR. The completion operator shouldn't copy litellm.completion's type signature

yiyunliu force-pushed the obs branch from 10bcfb0 to e24d135 Compare April 10, 2026 16:59

kiranandcode requested review from eb8680 and jfeser April 10, 2026 19:30

This was referenced Apr 13, 2026

lazy __signature__ property doesn't work well with typing.TYPE_CHECKING #636

Open

add weave and langfuse observability support #638

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better thinking (and more generally, logging) support for the llm backend#632

Better thinking (and more generally, logging) support for the llm backend#632
yiyunliu wants to merge 15 commits intoBasisResearch:masterfrom
yiyunliu:obs

yiyunliu commented Apr 9, 2026

Uh oh!

eb8680 commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yiyunliu commented Apr 9, 2026

Uh oh!

eb8680 commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants