feat: support ollama ai model #1001

Claire-w · 2024-05-25T04:06:02Z

Ⅰ. Describe what this PR did

提供了支持Ollama接口的wasm插件。支持在局域网内访问Ollama服务器。API文档：https://github.com/ollama/ollama/blob/main/docs/openai.md

Ⅱ. Does this pull request fix one issue?

fixes #956

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

envoy.yaml

# File generated by hgctl. Modify as required.

admin:
  address:
    socket_address:
      protocol: TCP
      address: 0.0.0.0
      port_value: 9901
static_resources:
  listeners:
    - name: listener_0
      address:
        socket_address:
          protocol: TCP
          address: 0.0.0.0
          port_value: 10000
      filter_chains:
        - filters:
            - name: envoy.filters.network.http_connection_manager
              typed_config:
                "@type": type.googleapis.com/envoy.extensions.filters.network.http_connection_manager.v3.HttpConnectionManager
                scheme_header_transformation:
                  scheme_to_overwrite: https
                stat_prefix: ingress_http
                # Output envoy logs to stdout
                access_log:
                  - name: envoy.access_loggers.stdout
                    typed_config:
                      "@type": type.googleapis.com/envoy.extensions.access_loggers.stream.v3.StdoutAccessLog
                # Modify as required
                route_config:
                  name: local_route
                  virtual_hosts:
                    - name: local_service
                      domains: [ "*" ]
                      routes:
                        - match:
                            prefix: "/"
                          route:
                            cluster: ollama
                            timeout: 300s
                http_filters:
                  - name: wasmtest
                    typed_config:
                      "@type": type.googleapis.com/udpa.type.v1.TypedStruct
                      type_url: type.googleapis.com/envoy.extensions.filters.http.wasm.v3.Wasm
                      value:
                        config:
                          name: wasmtest
                          vm_config:
                            runtime: envoy.wasm.runtime.v8
                            code:
                              local:
                                filename: /etc/envoy/plugin.wasm
                          configuration:
                            "@type": "type.googleapis.com/google.protobuf.StringValue"
                            value: |
                              {
                                "provider": {
                                  "type": "ollama",
                                  "ollamaServerHost": "192.168.10.8",
                                  "ollamaServerPort": 11434,
                                  "apiTokens": [
                                    "****",
                                    "****"
                                  ],
                                  "timeout": 1200000,
                                  "modelMapping": {
                                    "gpt-4-turbo": "llama2",
                                    "*": "llama2"
                                  }
                                }
                              }
                  - name: envoy.filters.http.router
  clusters:
    - name: httpbin
      connect_timeout: 30s
      type: LOGICAL_DNS
      # Comment out the following line to test on v6 networks
      dns_lookup_family: V4_ONLY
      lb_policy: ROUND_ROBIN
      load_assignment:
        cluster_name: httpbin
        endpoints:
          - lb_endpoints:
              - endpoint:
                  address:
                    socket_address:
                      address: httpbin
                      port_value: 80
    - name: ollama
      connect_timeout: 30s
      type: LOGICAL_DNS
      dns_lookup_family: V4_ONLY
      lb_policy: ROUND_ROBIN
      load_assignment:
        cluster_name: ollama
        endpoints:
          - lb_endpoints:
              - endpoint:
                  address:
                    socket_address:
                      address: 192.168.10.8
                      port_value: 11434

Ⅴ. Special notes for reviews

CLAassistant · 2024-05-25T04:06:08Z

All committers have signed the CLA.

plugins/wasm-go/extensions/ai-proxy/README.md

plugins/wasm-go/Makefile

plugins/wasm-go/extensions/ai-proxy/provider/ollama.go

…a.go

plugins/wasm-go/extensions/ai-proxy/README.md

plugins/wasm-go/extensions/ai-proxy/envoy.yaml

plugins/wasm-go/extensions/ai-proxy/provider/ollama.go

… envoy.yaml

CH3CHO

LGTM. Thanks.

Claire-w and others added 14 commits May 22, 2024 23:53

feat: Add Ollama AI support (alibaba#956)

3e316c5

Merge remote-tracking branch 'upstream/main'

fc24aa6

Merge remote-tracking branch 'upstream/main'

3c06380

test function

70db0d9

test function

1177767

test function

4a9b853

test function

7d6498e

test functions

8e65dfa

test function

ad6f226

test function

8aa44b8

feat: support ollama ai model

aecb0e5

Merge branch 'main' into main

f3b1142

feat: support ollama ai model

777d23f

Merge branch 'main' of https://github.com/Claire-w/higress

6f78dc7

Claire-w requested review from johnlanni, WeixinX and CH3CHO as code owners May 25, 2024 04:06

CH3CHO reviewed May 25, 2024

View reviewed changes

plugins/wasm-go/extensions/ai-proxy/README.md Outdated Show resolved Hide resolved

CH3CHO reviewed May 25, 2024

View reviewed changes

plugins/wasm-go/Makefile Outdated Show resolved Hide resolved

plugins/wasm-go/extensions/ai-proxy/provider/ollama.go Outdated Show resolved Hide resolved

plugins/wasm-go/extensions/ai-proxy/provider/ollama.go Show resolved Hide resolved

Claire-w added 2 commits May 25, 2024 01:21

Change ollamaServerPort type to number; remove getcontenterr in ollam…

c02e421

…a.go

Modified ollama.go onRequestBody

3e8443c

CH3CHO reviewed May 27, 2024

View reviewed changes

plugins/wasm-go/extensions/ai-proxy/README.md Outdated Show resolved Hide resolved

plugins/wasm-go/extensions/ai-proxy/envoy.yaml Outdated Show resolved Hide resolved

plugins/wasm-go/extensions/ai-proxy/provider/ollama.go Outdated Show resolved Hide resolved

Changed ollamaServerIP to ollamaServerHost; added moonshot cluster to…

3d9f47f

… envoy.yaml

Claire-w changed the title ~~Support for Ollama API~~ feat: support ollama ai model May 27, 2024

Change envoy.yaml to the original one

e66ffa1

CH3CHO approved these changes May 28, 2024

View reviewed changes

Merge branch 'main' into main

6173b17

CH3CHO merged commit 50f79c9 into alibaba:main May 28, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: support ollama ai model #1001

feat: support ollama ai model #1001

Claire-w commented May 25, 2024 •

edited

Loading

CLAassistant commented May 25, 2024 •

edited

Loading

CH3CHO left a comment

feat: support ollama ai model #1001

feat: support ollama ai model #1001

Conversation

Claire-w commented May 25, 2024 • edited Loading

Ⅰ. Describe what this PR did

Ⅱ. Does this pull request fix one issue?

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

Ⅴ. Special notes for reviews

CLAassistant commented May 25, 2024 • edited Loading

CH3CHO left a comment

Choose a reason for hiding this comment

Claire-w commented May 25, 2024 •

edited

Loading

CLAassistant commented May 25, 2024 •

edited

Loading