sync with upstream #12

Abirdcfly · 2024-04-10T06:08:26Z

update to tmc@8b67ef3 (v0.1.8 add 11 commit)
Highlights:

change mrkl-prompt to python version tmc/langchaingo#653 update mrkl prompt to python version. Our agents depend on it.
httputil: Add httputil package to provide some common helpers tmc/langchaingo#702 add http debugging tools, which can help us figure out why there are 400 errors when call fastchat. (we have to rewrite this method to log not fmt.Print)
tooling: Update minimum go version to 1.22, update golangci-lint tmc/langchaingo#722 update to go 1.22
googleai: add safety/harm threshold settings tmc/langchaingo#744 may cause less github action error

Signed-off-by: Abirdcfly <fp544037857@gmail.com>

chore: Pinning chroma-go ahead of major new release

Add huggingface documentation

A new functionality was introduced to allow the setting of a specified data format (currently Ollama only supports JSON). This is done via the `WithFormat` option. The change provides more flexibility and control over the format of data processed by the client. Moreover, the test `TestWithFormat` has been added to assert the proper functioning of this new feature. Using the JSON format allows you to simulate `Functions` using prompt injection, as it forces Ollama to respond in valid JSON.

Add an example showing how the Ollama `JSON` format in combination with prompt injection can be used to simulate Functions.

…-go modules

* main: chains: fix add ignore StreamingFunc (tmc#639) Update huggingface.mdx chore: Pinning chroma-go ahead of major new release Clean up sequential_chain_example to make it a bit more readable (tmc#635) Fix llm_math_chain example (tmc#634) Update comments, bump example dependencies and clarify chain example (tmc#633) chains: fix issue with overriding defaults in chainCallOption (tmc#632) chains: add test with GoogleAI (tmc#628) Revert "googleai: fix options need add default value" (tmc#627) googleai: fix options need add default value

Fix agent stream callback

feat: add JSON format option to ollama

feat: run integration tests for vector databases using testcontainers-go modules

* Add bedrock embedding provider * Add bedrock tests --------- Co-authored-by: Travis Cline <travis.cline@gmail.com>

* Update `chroma-go` to the latest version * Add error handling to NewOpenAIEmbeddingFunction * Add a new property to the store (`openaiOrganization`) and pass it to `chroma-go`

…e `OpenAI`

vectorstores: Add support for OpenAI Organization ID header in Chroma

change mrkl-prompt to python version

examples: Point to v0.1.5

1. add pgvector into test 2. add OPENAI_API_KEY and GENAI_API_KEY into test 3. deprecate pgvector table name Sanitize function 4. reset pgvector Search sql and make TestDeduplicater rerun 5. add test TestWithAllOptions for test all option 6. because of StuffDocuments.joinDocuments ignore document's metadata, update some tests Signed-off-by: Abirdcfly <fp544037857@gmail.com>

Signed-off-by: Abirdcfly <fp544037857@gmail.com>

It seems like a bunch of links were broken and docs.langchain.com now redirects to the python docs.

* llms/anthropic: complete support for messages api * llms/anthropic: fixed linting errors * llms/anthropic: remove fatals * llms/anthropic: fixed linting errors * llms/anthropic: remove fatal from completions * llms/anthropic: Default to use messages api, update example to use Opus --------- Co-authored-by: Travis Cline <travis.cline@gmail.com>

…Content with images (tmc#713) * llms/bedrock/internal/bedrockclient: Currently, antropicBinGenerationInputSource.Type is fixed to base64. According to the Claude3 API documentation, the current image input format only accepts base64. https://docs.anthropic.com/claude/reference/messages_post Therefore, the existing implementation will generate the following error when making a request with an image ```` operation error Bedrock Runtime: InvokeModel, https response error StatusCode: 400, RequestID: 00000000-0000-0000-0000-0000000000000000,. ValidationException: messages.0.content.0.image.source: Input tag 'image' found using 'type' does not match any of the expected tags: 'base64' exit status 1 ``` This commit corrects the above error and allows Claude3 to be called via Bedrock with image input. * llms/bedrock/internal/bedrockclient: Consider MultiPart MessageContent. The current implementation of llms/bedrock/internal/bedrockclient/provider_anthropic.processInputMessagesAnthropic does not seem to account for MessageContent containing multiple Part MessageContent with multiple parts. Passing a MessageContent like the following will result in an error. ``` []llms.MessageContent{ { Role: schema.ChatMessageTypeHuman,. Parts: []llms.ContentPart{ llms.BinaryPart("image/png", image), llms. TextPart("Please text what is written on this image."), llms. }, } }, } }, } ``` ``` operation error Bedrock Runtime: InvokeModel, https response error StatusCode: 400, RequestID: 00000000-0000-0000-0000-0000000000000000, ValidationException: messages: roles must alternate between "user" and "assistant", but found multiple "user" roles in a row ```` This is due to the fact that []llms.MessageContent is converted to []bedrockclient.Message. So, this commit fixes the above by modifying the procssInputMessagesAnthropic code. Chunking the argument []bedrockclient.Message with a group of the same Role. Then, each Chunk is converted to anthropicTextGenerationInputMessage. * llms/bedrock/internal/bedrockclient: fix lint for Consider pre-allocating `currentChunk` (prealloc) golang-ci lint error message ``` Error: Consider pre-allocating `currentChunk` (prealloc) ``` fix this * llms/bedrock/internal/bedrockclient: fix lint goconst fix golang lint ``` string `text` has 3 occurrences, but such constant `AnthropicMessageTypeText` already exists (goconst) ```

* Add Cloudflare Workers AI LLM * lint fixes * text generation: support streaming response * add tests * review fixes * minor http client fix

…mc#715) * examples: add new example for Added sample code for OCR using Claude3's Vision feature with Bedrock

* llms: added mistral

…#722) * go: Update to go 1.22, update golangci-lint config * lint: Address various lint issues * chains: fix lint complaint in TestApplyWithCanceledContext * lint: Address addtional lint issues * lint: Address addtional lint issues * tools: update golangci-lint to 1.57

Fixes tmc#728

* examples: Add nvidia completion example * examples: Tidy up examples * examples: Point nvidia example to main

…pport (tmc#709) * openai: Take steps to make tool calls over the older function calling API * openai: Additional steps to evolve towards newer tool calling interface * openai: Connect tool calling for openai backend * openai: Fix up lint issue * examples: pull httputil use * tools: iterate on tools support * openai: Fix up tool call response mapping * llms: Cover additional message type in additional backends * examples: temporarily point to branch * openai: change type switch for ToolCallResponse * examples: Clean up and refactor openai function calling example * mistral: respect ChatMessageTypeTool

…red model to use for embedding (tmc#731) * chore: issue tmc#729 * chore: issue tmc#729

* feat: add Seed in mistral * feat: add Seed in mistral , openai issue tmc#723

tmc#742)

* googleai: combine options for googleai and vertex * lint

* googleai: add safety/harm settings * tests: make configuration options testable

feat: update image

feat: update postgres image

feat: update mysql image

* feat: update qdrant image * feat: add opts in test, because default model embedding not work * feat: add opts in test, because default model embedding not work * chore: removed 5 lines for lint

examples: clearify openai-function-call-example clean up the flow and don't use globals

devinyf and others added 30 commits February 18, 2024 16:54

fix: agent_stream_callback sometimes failed to detect keyword

b534738

trim potential spaces and colon in the beginning

6d008d1

Set PrintOutput to be true only once

5ba4c9e

add test case

491ba5f

table driven test

65fc5f7

inline the test cases

d28c808

chore: Pinning chroma-go ahead of major new release

c4ddf0b

Update huggingface.mdx

0804075

chains: fix add ignore StreamingFunc (tmc#639)

17002cc

Signed-off-by: Abirdcfly <fp544037857@gmail.com>

Merge pull request tmc#640 from amikos-tech/chore/chroma-go-pin

0dedf90

chore: Pinning chroma-go ahead of major new release

Merge pull request tmc#641 from devalexandre/patch-1

66bf2dd

Add huggingface documentation

example: add new Ollama functions example

997607a

Add an example showing how the Ollama `JSON` format in combination with prompt injection can be used to simulate Functions.

feat: run integration tests for vector databases using testcontainers…

3320490

…-go modules

chore: skip qdrant tests if the OpenAI api key is not set

1d71027

Merge pull request tmc#614 from devinyf/fix_agent_stream_callback

b74cceb

Fix agent stream callback

Merge pull request tmc#647 from corani/corani/ollamajson

81643a8

feat: add JSON format option to ollama

Merge pull request tmc#648 from mdelapenya/more-its

8cbd678

feat: run integration tests for vector databases using testcontainers-go modules

embeddings: Add Amazon Bedrock embeddings (tmc#643)

a51062f

* Add bedrock embedding provider * Add bedrock tests --------- Co-authored-by: Travis Cline <travis.cline@gmail.com>

vectorstores: Add support for OpenAI Organization ID header in Chroma

f67b196

* Update `chroma-go` to the latest version * Add error handling to NewOpenAIEmbeddingFunction * Add a new property to the store (`openaiOrganization`) and pass it to `chroma-go`

chore: Refactor Chroma to change functions name unsing OpenAi to us…

7947f6d

…e `OpenAI`

Merge pull request tmc#646 from AshDevFr/chroma-openai-org-id

3b7175c

vectorstores: Add support for OpenAI Organization ID header in Chroma

change prompt: defaultMrklPrefix to python version

25c6665

Merge pull request tmc#653 from devinyf/fix_mrkl_prompt

ed99f8c

change mrkl-prompt to python version

examples: Point to v0.1.5

6cc64f8

Merge pull request tmc#656 from tmc/update-examples

01bf6a4

examples: Point to v0.1.5

googleai: return err not log.Fatal when stream get error (tmc#663)

24cb833

Signed-off-by: Abirdcfly <fp544037857@gmail.com>

docs: fixup a bunch of links (tmc#659)

255f6a9

It seems like a bunch of links were broken and docs.langchain.com now redirects to the python docs.

joeychilson and others added 27 commits March 21, 2024 20:58

readme: Include contributors (tmc#714)

83bf27c

llms/cloudflare: Implement Cloudflare Workers AI LLM (tmc#679)

7fb9a13

* Add Cloudflare Workers AI LLM * lint fixes * text generation: support streaming response * add tests * review fixes * minor http client fix

add new example for OCR using Claude3's Vision feature with Bedrock (t…

d5f11f0

…mc#715) * examples: add new example for Added sample code for OCR using Claude3's Vision feature with Bedrock

llms: Add mistral hosted inference llm implementation (tmc#717)

8d90359

* llms: added mistral

vectorstores/weaviate: Update testcontainer image (tmc#719)

3932b31

openai: Render single text content parts directly (tmc#734)

1261877

Fixes tmc#728

all: set explicit 1.22 version in go.mod (tmc#727)

319b863

googleai: increase default max tokens setting (tmc#726)

d822839

examples: Fix and tidy examples, add nvidia example (tmc#735)

b15223e

* examples: Add nvidia completion example * examples: Tidy up examples * examples: Point nvidia example to main

openai: WithEmbeddingModel option is incorrectly designating the desi…

05ab264

…red model to use for embedding (tmc#731) * chore: issue tmc#729 * chore: issue tmc#729

llms: Add Seed option to all supporting backends (tmc#732)

0b63daa

* feat: add Seed in mistral * feat: add Seed in mistral , openai issue tmc#723

examples: Update examples to v0.1.8 (tmc#736)

a47ef50

googleai: vertex - upgrade dep version and increase default max tokens (

97e3644

tmc#742)

googleai: combine options for googleai and vertex (tmc#743)

14a2806

* googleai: combine options for googleai and vertex * lint

googleai: add safety/harm threshold settings (tmc#744)

ce2a479

* googleai: add safety/harm settings * tests: make configuration options testable

GH actions: update lint workflow to newer version of Go (tmc#745)

73710c5

vectorstores/milvus: Update testcontainer image (tmc#741)

930e0fb

feat: update image

tools/sqldataase: update postgres image (tmc#740)

7bbb2d8

feat: update postgres image

chains: Update mysql testcontainer image (tmc#739)

1f45c81

feat: update mysql image

vectorstores/qdrant: Update testcontainer image (tmc#737)

33b8795

* feat: update qdrant image * feat: add opts in test, because default model embedding not work * feat: add opts in test, because default model embedding not work * chore: removed 5 lines for lint

doc: fix typo (tmc#758)

d161462

examples: clarify openai-function-call-example (tmc#751)

8b67ef3

examples: clearify openai-function-call-example clean up the flow and don't use globals

Merge remote-tracking branch 'upstream/main' into dev-update

ce2f2a3

Abirdcfly changed the title ~~Dev update~~ sync with upstream Apr 10, 2024

bjwswang marked this pull request as ready for review April 16, 2024 09:17

bjwswang marked this pull request as draft April 16, 2024 09:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sync with upstream #12

sync with upstream #12

Abirdcfly commented Apr 10, 2024 •

edited

sync with upstream #12

Are you sure you want to change the base?

sync with upstream #12

Conversation

Abirdcfly commented Apr 10, 2024 • edited

Abirdcfly commented Apr 10, 2024 •

edited