Chat Generatol Protocol POC #9037

anakin87 · 2025-03-14T10:36:17Z

As discussed in deepset-ai/haystack-experimental#218, I propose introducing a ChatGenerator Protocol to qualify ChatGenerator components from a static type-checking perspective.

The current PR outlines a POC demonstrating how this could work and the benefits it provides.

Why

Chat Generators are central components in Haystack. They are used directly and often incorporated into more complex components like LLMMetadataExtractor, Agent, and Summarizer. Even LLMEvaluator could fit, though it currently uses a Generator.
Using them inside other components has been handled in various ways. For example, see the LLMMetadataExtractor implementation. This often results in several lazy imports, support for limited set of Chat Generators, inconsistent user experience.
Introducing a shallow abstraction to qualify Chat Generators would improve flexibility, reduce code duplication and standardize the integration in other components.

Pros

Introducing a Protocol for Chat Generator would give us static type checking (mypy/pyright), also providing better guidance for users.
This could simplify existing components and introduce a consistent way of defining components based on Chat Generators.

No Runtime Checking

While the Protocol ensures static type safety, it does not provide strong runtime guarantees.

If we mark the protocol as @runtime_checkable, runtime checks would only verify method existence—not their signatures (see mypy docs).
For this reason, I propose avoiding @runtime_checkable entirely and handling runtime validation where needed.

Protocol vs Abstract Class

An abstract class would offer stronger guarantees both statically and dynamically, but at the cost of flexibility.
A Protocol, on the other hand, requires little to no changes to our codebase.
- Chat Generators wouldn't need to inherit from a base class.
- Most Chat Generators (including community ones) would already implement the protocol.

For more details on the implementation, look at comments in this PR.

anakin87 · 2025-03-14T12:49:11Z

haystack/__init__.py

@@ -2,57 +2,33 @@
 #
 # SPDX-License-Identifier: Apache-2.0

-import sys


slightly related: after intense debugging, I found that the current haystack/__init__.py, which mixes eager and lazy imports, somehow overshadows the static type of component. This means that we would not able to check Protocol adherence of run methods decorated with @component.output_types.

For this reason, I decided to go back to eager imports in this module.

Is this a bug when using all protocols based on components? If so it could be good to bring this change in a separate PR.

Yes. I will do this in a different PR.

BTW, this PR is just a POC and not meant to be merged.

anakin87 · 2025-03-14T12:52:21Z

haystack/components/extractors/llm_metadata_extractor.py

@@ -13,50 +13,15 @@

 from haystack import Document, component, default_from_dict, default_to_dict, logging
 from haystack.components.builders import PromptBuilder
-from haystack.components.generators.chat import AzureOpenAIChatGenerator, OpenAIChatGenerator


In this module, you can find how we could simplify the LLMMetadataExtractor.
This is just an example, and we should respect our breaking changes policy, first introducing chat_generator (in 2.x.0 release) and then removing generator_api and generator_api_params (in 2.x+1.0 release).

anakin87 · 2025-03-14T12:54:43Z

haystack/components/generators/chat/types.py

+        """
+        ...
+
+    def run(self, messages: List[ChatMessage]) -> Dict[str, Any]:


This is compatible with more rich run methods.
For example def run (self, messages: List[ChatMessage], param2="default", param3="another_default")

Creating a minimal Protocol ensures that our Chat Generators are already compatible.

anakin87 · 2025-03-14T12:56:07Z

haystack/core/component/component.py

@@ -442,7 +447,7 @@ def run(self, value: int):
            instance, {name: OutputSocket(name=name, type=type_) for name, type_ in types.items()}, OutputSocket
        )

-    def output_types(self, **types):
+    def output_types(self, **types: Any) -> Callable[[Callable[P, R]], Callable[P, R]]:


to be able to check the adherence of run method to the Protocol, we need to add type hints to the decorator and the decorator factory.

anakin87 · 2025-03-14T12:56:32Z

try_chatgenerator_protcol.py

@@ -0,0 +1,58 @@
+# SPDX-FileCopyrightText: 2022-present deepset GmbH <info@deepset.ai>


just a simple file showing the Protocol in action

sjrl · 2025-03-14T13:19:37Z

@anakin87 this looks really cool and looks good to me!

sjrl · 2025-03-14T13:30:54Z

Also after rereading the thread on the original issue I believe this protocol is helpful for the static type checking as well as visually as a Haystack user I see that the type hint is ChatGenerator rather than just Component.

Although I still do think that we should add runtime checking where needed (as you originally mention @anakin87).

coveralls · 2025-03-14T14:08:51Z

Pull Request Test Coverage Report for Build 13855199401

Warning: This coverage report may be inaccurate.

This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.

For more information on this, see Tracking coverage changes with pull request builds.
To avoid this issue with future PRs, see these Recommended CI Configurations.
For a quick fix, rebase this PR at GitHub. Your next report should be accurate.

Details

0 of 0 changed or added relevant lines in 0 files are covered.
41 unchanged lines in 3 files lost coverage.
Overall coverage increased (+0.08%) to 90.0%

Files with Coverage Reduction	New Missed Lines	%
core/component/component.py	1	99.38%
lazy_imports.py	3	78.57%
components/extractors/llm_metadata_extractor.py	37	68.38%

Totals
Change from base Build 13855177274:	0.08%
Covered Lines:	9702
Relevant Lines:	10780

💛 - Coveralls

julian-risch · 2025-03-17T13:42:49Z

PoC looks good to me! 👍 The examples in there are very helpful and I tried pipeline.dumps() with the updated LLMMetadataExtractor to see what it would look like in YAML.

experiment updates

fd8f48d

github-actions bot added topic:tests topic:core type:documentation Improvements on the docs labels Mar 14, 2025

anakin87 added the ignore-for-release-notes PRs with this flag won't be included in the release notes. label Mar 14, 2025

pin ruff

3eeeec3

github-actions bot added the topic:build/distribution label Mar 14, 2025

anakin87 added 5 commits March 14, 2025 11:46

readd docstring

635fc04

headers

d05afa2

paramspec typing_extensions

d0262c4

Merge branch 'main' into chatgenerator-protocol-experiments

621afb6

fixes

8830936

anakin87 commented Mar 14, 2025

View reviewed changes

anakin87 requested review from julian-risch and sjrl March 14, 2025 12:59

anakin87 mentioned this pull request Mar 14, 2025

fix: use eager imports in haystack/__init__.py #9042

Merged

This was referenced Mar 18, 2025

Chat Generator Protocol #9058

Closed

Check ChatGenerators compatibility with ChatGenerator Protocol #9060

Closed

feat: ChatGenerator protocol #9074

Merged

anakin87 closed this Mar 20, 2025

julian-risch mentioned this pull request May 9, 2025

feat : adding a new Protocol for TextEmbedder #9353

Merged

anakin87 deleted the chatgenerator-protocol-experiments branch May 14, 2025 10:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Chat Generatol Protocol POC #9037

Chat Generatol Protocol POC #9037

Uh oh!

anakin87 commented Mar 14, 2025 •

edited

Loading

Uh oh!

anakin87 Mar 14, 2025

Uh oh!

sjrl Mar 14, 2025

Uh oh!

anakin87 Mar 14, 2025

Uh oh!

anakin87 Mar 14, 2025

Uh oh!

anakin87 Mar 14, 2025

Uh oh!

anakin87 Mar 14, 2025

Uh oh!

anakin87 Mar 14, 2025

Uh oh!

sjrl commented Mar 14, 2025

Uh oh!

sjrl commented Mar 14, 2025

Uh oh!

coveralls commented Mar 14, 2025 •

edited

Loading

Uh oh!

julian-risch commented Mar 17, 2025

Uh oh!

Uh oh!

		@@ -0,0 +1,58 @@
		# SPDX-FileCopyrightText: 2022-present deepset GmbH <info@deepset.ai>

Chat Generatol Protocol POC #9037

Chat Generatol Protocol POC #9037

Uh oh!

Conversation

anakin87 commented Mar 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Why

Pros

No Runtime Checking

Protocol vs Abstract Class

Uh oh!

anakin87 Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

sjrl Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

anakin87 Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

anakin87 Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

anakin87 Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

anakin87 Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

anakin87 Mar 14, 2025

Choose a reason for hiding this comment

Uh oh!

sjrl commented Mar 14, 2025

Uh oh!

sjrl commented Mar 14, 2025

Uh oh!

coveralls commented Mar 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Test Coverage Report for Build 13855199401

Warning: This coverage report may be inaccurate.

Details

💛 - Coveralls

Uh oh!

julian-risch commented Mar 17, 2025

Uh oh!

Uh oh!

anakin87 commented Mar 14, 2025 •

edited

Loading

coveralls commented Mar 14, 2025 •

edited

Loading