help request: How to represent the LLM

### Description

1.  Call the LLM directly：
curl https://api-inference.modelscope.cn/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer xxxxxxxxxxxxxxxx" \
  -d '{
        "model": "deepseek-ai/DeepSeek-R1-Distill-Qwen-7B",
        "messages": [
          {"role": "user", "content": "hello"}
        ],
        "stream": false
      }'

<img width="1691" height="295" alt="Image" src="https://github.com/user-attachments/assets/6c8bfd8c-4637-41ce-ad3b-71378680c429" />

2. Add a route in APISIX：
curl -X PUT http://127.0.0.1:9180/apisix/admin/routes/1 \
  -d '{
    "uri": "/v1/chat/completions",
    "upstream": {
      "nodes": {
        "api-inference.modelscope.cn:443": 1
      },
      "type": "roundrobin",
      "scheme": "https"
    }
  }'

<img width="1694" height="278" alt="Image" src="https://github.com/user-attachments/assets/6ac7f640-f097-4634-957d-0f96652b60a8" />

<img width="1755" height="5896" alt="Image" src="https://github.com/user-attachments/assets/752d5f51-bbbf-450c-b29d-f132c7d9dcf3" />

3.Calling LLM through APISIX：
curl -X POST "http://127.0.0.1:9080/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -H "Authorization: Bearer xxxxxxxxxxx" \
  -d '{
    "model": "deepseek-ai/DeepSeek-R1-Distill-Qwen-7B",
    "messages": [
      {"role": "user", "content": "你是谁"}
    ],
    "stream": false
  }'

<img width="666" height="360" alt="Image" src="https://github.com/user-attachments/assets/0cc1d2d7-413c-4789-8b90-f49adb5a4e98" />

I checked it again and again, but found nothing wrong. Why didn't I successfully represent the big model?
I'm using the curl-sl https://run.api7.ai/apisix/quickstart | sh command to deploy APISIX directly.

### Environment

- APISIX version (run `apisix version`):3.15
- Operating system (run `uname -a`):Rocky Linux-8.10
- OpenResty / Nginx version (run `openresty -V` or `nginx -V`):
- etcd version, if relevant (run `curl http://127.0.0.1:9090/v1/server_info`):
- APISIX Dashboard version, if relevant:
- Plugin runner version, for issues related to plugin runners:
- LuaRocks version, for installation issues (run `luarocks --version`):


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

help request: How to represent the LLM #13121

Description

Environment

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

help request: How to represent the LLM #13121

Description

Description

Environment

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions