feat: Add Anthropic invocation layer #4818

silvanocerza · 2023-05-05T12:37:01Z

Proposed Changes:

Add new invocation layer to support Anthropic Claude.

How did you test it?

I wrote tests and run them locally.

Notes for the reviewer

Supersedes #4512.

coveralls · 2023-05-05T12:56:05Z

Coverage: 37.524% (+1.0%) from 36.543% when pulling fd67dce on AnthropicClaude into 705a2c0 on main.

vblagoje · 2023-05-10T10:01:19Z

haystack/nodes/prompt/invocation_layer/anthropic_claude.py

+        # if a List[str] is used we must also set is_pretokenized to True.
+        # We split at spaces because if we pass the string directly the encoded prompts
+        # contains strange characters in place of spaces.
+        encoded_prompt: Encoding = self.tokenizer.encode(prompt.split(" "), is_pretokenized=True)


The code comment and the code seem to be out of sync? Do we deal with List[str] here, most likely not, right?

No, the comment is correct.

At line 146 we fail when prompt is a List so in here it can only be a str.

The first two lines clarify why we're setting is_pretokenized and the other clarifies why we split prompt.

vblagoje · 2023-05-10T10:05:15Z

haystack/nodes/prompt/invocation_layer/anthropic_claude.py

+
+        kwargs_with_defaults = self.model_input_kwargs
+
+        if "stop_sequence" in kwargs:


We have stop_words as the parameter at the prompt node init level that we then pass to the invocation layer. And this invocation layer then translates "stop_words" to the specific parameter name in the invocation layer. As we stand here, but I am not 100% sure, it seems like stop _words from PromptNode level will not get translated to "stop_sequence" parameter. Please double check

Update: it will get translated but double-check why the default case when no stop_words are passed from prompt node level text generation fails

The test_invoke_non_streamed integration test passes with not issues.
That one invokes the layers with just the prompt, it should be enough shouldn't it?

It should, but in this case, it is a problem because stop_words from PromptNode get passed as None because we allow None. And Anthropic, unlike all other providers, doesn't like None and barfs the exception.

Ah, makes sense. I'll see to handle that case.

vblagoje · 2023-05-10T10:16:14Z

haystack/nodes/prompt/invocation_layer/anthropic_claude.py

+        stream = (
+            kwargs_with_defaults.get("stream", False) or kwargs_with_defaults.get("stream_handler", None) is not None
+        )
+        stop_words = kwargs_with_defaults.get("stop_words", [human_prompt])


I believe we should ensure that user-specified stop_words should be added to the default Anthropic uses, i.e. we should have a list that contains user-specified stop_words and their default ["\n\nHuman:"]

I think it's added in any case by Anthropic server side but I handled it in any case.

vblagoje

Left some minor comments regarding stop_words

vblagoje

LGTM

recrudesce and others added 22 commits May 5, 2023 12:23

feat: Add Anthropic Claude Invocation Layer

30cf833

feat: Add AnthropicClaude Invocation Layer

e543189

fix: Permission changes

14a9de3

fix: Permission changes

09bc362

Move anthropic utils in anthropic invocation layer file

062b79a

Rework method to post data

6484814

Simplify invoke

65fa2e3

Simplify supports classmethod

3066757

Remove unnecessary functions

78b0d59

Use always same tokenizer

8f31a32

Add module import

4cdb2ca

Rename some members and kwargs

a436de7

Add tests

9d2f1f1

Fix _post not handling HTTPError

e91d294

Fix handling of streamed response

422c8c2

Fix kwargs handling

5f34f88

Update tests

2253b30

Update supports to be generic

1a0cf55

Fix failing test

6c11fd6

Use correct tokenizer and fix tests

9504b30

Update lg

03069d3

Fix mypy issue

56e79ce

silvanocerza self-assigned this May 5, 2023

silvanocerza requested a review from a team as a code owner May 5, 2023 12:37

silvanocerza requested review from julian-risch and removed request for a team May 5, 2023 12:37

github-actions bot added topic:tests type:documentation Improvements on the docs labels May 5, 2023

silvanocerza requested a review from vblagoje May 5, 2023 12:37

Move requests-cache from dev to base dependencies

adf7e71

github-actions bot added topic:build/distribution topic:dependencies labels May 5, 2023

Merge branch 'main' into AnthropicClaude

3a32feb

julian-risch removed their request for review May 8, 2023 07:36

Fix failing test

3ebbc5d

vblagoje reviewed May 10, 2023

View reviewed changes

vblagoje requested changes May 10, 2023

View reviewed changes

Handle all stop words use cases

fd67dce

vblagoje self-requested a review May 10, 2023 13:44

vblagoje approved these changes May 10, 2023

View reviewed changes

silvanocerza merged commit 98947e4 into main May 11, 2023

silvanocerza deleted the AnthropicClaude branch May 11, 2023 08:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add Anthropic invocation layer #4818

feat: Add Anthropic invocation layer #4818

silvanocerza commented May 5, 2023

coveralls commented May 5, 2023 •

edited

Loading

vblagoje May 10, 2023

silvanocerza May 10, 2023 •

edited

Loading

vblagoje May 10, 2023

vblagoje May 10, 2023

silvanocerza May 10, 2023

vblagoje May 10, 2023

silvanocerza May 10, 2023

vblagoje May 10, 2023

silvanocerza May 10, 2023

vblagoje left a comment

vblagoje left a comment


		kwargs_with_defaults = self.model_input_kwargs

		if "stop_sequence" in kwargs:

feat: Add Anthropic invocation layer #4818

feat: Add Anthropic invocation layer #4818

Conversation

silvanocerza commented May 5, 2023

Proposed Changes:

How did you test it?

Notes for the reviewer

coveralls commented May 5, 2023 • edited Loading

Choose a reason for hiding this comment

silvanocerza May 10, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vblagoje left a comment

Choose a reason for hiding this comment

vblagoje left a comment

Choose a reason for hiding this comment

coveralls commented May 5, 2023 •

edited

Loading

silvanocerza May 10, 2023 •

edited

Loading