feature(wren-ai-service): integrate Langfuse SDK to represent the evaluation result by paopa · Pull Request #395 · Canner/WrenAI

paopa · 2024-06-11T10:08:42Z

This PR aims to integrate the decorator-based Langfuse SDK to collect data for the evaluation process. By default, the environment setup is disabled. To enable Langfuse, follow these steps:

# .env.dev file
# Langfuse configuration
LANGFUSE_ENABLE=True
LANGFUSE_SECRET_KEY=
LANGFUSE_PUBLIC_KEY=
LANGFUSE_HOST=

Fill in the environment variables. You can generate the secret key, public key, and host by following the instructions at Langfuse Documentation.

Currently, we don't set the session ID. When implementing the main process for the evaluation, refer to the following code to set up the session ID.

@observe()
async def main(user_id: str):
    langfuse_context.update_current_trace(
        user_id=user_id, 
        session_id=f"{user_id}_{uuid.uuid4()}"
    )
    return await story()


async def run():
    await asyncio.gather(main("foo"), main("bar"))
    langfuse_context.flush()


asyncio.run(run())

Screenshots

cyyeh

should we call langfuse_context.flush() during the server shutdown event in the lifespan function in wren-ai-service/src/__main__.py?

Also in each pipeline, you call langfuse_context.flush() in the end, I wonder what would happen if langfuse is disabled then?

cyyeh · 2024-06-12T01:30:22Z

wren-ai-service/.env.dev.example

I suppose we also need to add langfuse related env to .env.prod.example, wren-ai-service/docker/docker-compose.yml, docker/.env.example and docker/docker-compose-dev.yaml, docker/docker-compose.yaml?

Maybe we can also setup thie environment variable in the future? LANGFUSE_DEBUG

This can be added in the initialization part: Authentication check

We should manually add input/output data if needed: https://langfuse.com/docs/sdk/python/decorators#large-inputoutput-data

cyyeh · 2024-06-12T01:49:01Z

Overall LGTM, I would like to discuss with you the capture_input part

paopa · 2024-06-12T09:15:53Z

After a discussion with @cyyeh, we decided not to capture the span input for all steps. Other suggestions, such as the Langfuse debug mode and authentication check, are interesting features. However, we will not include them in this PR as it is focused on the evaluation framework.

cyyeh

LGTM

…luation result (#395) * fix: phony demo folder (#394) * chore: upgrade the libs * chore: environment variables for Langfuse * feature: langfuse configuration initialize * feature: integrate the langfuse into pipelines * feature: don't capture the input and embedding output * feature: decorator on the concept pipelines * chore: add env variables into docker componse file

…luation result (#395)

…luation result (#395) (#495) * feature(wren-ai-service): integrate Langfuse SDK to represent the evaluation result (#395) * chore: modify the import operation to the suitable place

paopa added 6 commits June 11, 2024 17:31

fix: phony demo folder (#394)

30746c0

chore: upgrade the libs

cf2a1c7

chore: environment variables for Langfuse

f52a41a

feature: langfuse configuration initialize

3a554b6

feature: integrate the langfuse into pipelines

02ed4df

feature: don't capture the input and embedding output

b536a2b

paopa marked this pull request as ready for review June 11, 2024 11:07

paopa requested a review from cyyeh June 11, 2024 11:07

cyyeh changed the base branch from main to epic/ai-service/evaluation-framework-v1 June 11, 2024 15:05

cyyeh changed the base branch from epic/ai-service/evaluation-framework-v1 to main June 12, 2024 01:23

cyyeh changed the base branch from main to epic/ai-service/evaluation-framework-v1 June 12, 2024 01:23

cyyeh reviewed Jun 12, 2024

View reviewed changes

paopa added 2 commits June 12, 2024 16:42

feature: decorator on the concept pipelines

9db3c3c

chore: add env variables into docker componse file

ad2d379

paopa requested a review from cyyeh June 12, 2024 09:08

cyyeh approved these changes Jun 12, 2024

View reviewed changes

paopa merged commit 3a34b2f into epic/ai-service/evaluation-framework-v1 Jun 12, 2024

paopa deleted the feature/integrate-langfuse-inot-pipes branch June 12, 2024 09:21

paopa added a commit that referenced this pull request Jul 9, 2024

feature(wren-ai-service): integrate Langfuse SDK to represent the eva…

afecfc3

…luation result (#395)

paopa mentioned this pull request Jul 9, 2024

feature(wren-ai-service): integrate Langfuse SDK to represent the evaluation result (#395) #495

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature(wren-ai-service): integrate Langfuse SDK to represent the evaluation result#395

feature(wren-ai-service): integrate Langfuse SDK to represent the evaluation result#395
paopa merged 8 commits intoepic/ai-service/evaluation-framework-v1from
feature/integrate-langfuse-inot-pipes

paopa commented Jun 11, 2024 •

edited

Loading

Uh oh!

cyyeh left a comment •

edited

Loading

Uh oh!

cyyeh Jun 12, 2024

Uh oh!

cyyeh Jun 12, 2024

Uh oh!

cyyeh Jun 12, 2024

Uh oh!

cyyeh Jun 12, 2024

Uh oh!

cyyeh commented Jun 12, 2024

Uh oh!

paopa commented Jun 12, 2024

Uh oh!

cyyeh left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

paopa commented Jun 11, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Screenshots

Uh oh!

cyyeh left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cyyeh Jun 12, 2024

Choose a reason for hiding this comment

Uh oh!

cyyeh Jun 12, 2024

Choose a reason for hiding this comment

Uh oh!

cyyeh Jun 12, 2024

Choose a reason for hiding this comment

Uh oh!

cyyeh Jun 12, 2024

Choose a reason for hiding this comment

Uh oh!

cyyeh commented Jun 12, 2024

Uh oh!

paopa commented Jun 12, 2024

Uh oh!

cyyeh left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

paopa commented Jun 11, 2024 •

edited

Loading

cyyeh left a comment •

edited

Loading