opea-project · lvliang-intel · Nov 22, 2024 · Nov 21, 2024
@@ -290,7 +290,7 @@ docker compose up -d
    Try the command below to check whether the TGI service is ready.
 
    ```bash
-   docker logs ${CONTAINER_ID} | grep Connected
+   docker logs chatqna-tgi-server | grep Connected
    ```
 
    If the service is ready, you will get the response like below.

@@ -314,7 +314,7 @@ For validation details, please refer to [how-to-validate_service](./how_to_valid
    Try the command below to check whether the LLM serving is ready.
 
    ```bash
-   docker logs tgi-service | grep Connected
+   docker logs tgi-gaudi-server | grep Connected
    ```
 
    If the service is ready, you will get the response like below.
@@ -327,15 +327,15 @@ For validation details, please refer to [how-to-validate_service](./how_to_valid
 
    ```bash
    # TGI service
-   curl http://${host_ip}:9009/v1/chat/completions \
+   curl http://${host_ip}:8005/v1/chat/completions \
      -X POST \
      -d '{"model": ${LLM_MODEL_ID}, "messages": [{"role": "user", "content": "What is Deep Learning?"}], "max_tokens":17}' \
      -H 'Content-Type: application/json'
    ```
 
    ```bash
    # vLLM Service
-   curl http://${host_ip}:9009/v1/chat/completions \
+   curl http://${host_ip}:8007/v1/chat/completions \
      -H "Content-Type: application/json" \
      -d '{"model": ${LLM_MODEL_ID}, "messages": [{"role": "user", "content": "What is Deep Learning?"}]}'
    ```

@@ -273,7 +273,7 @@ docker compose up -d
    Try the command below to check whether the TGI service is ready.
 
    ```bash
-   docker logs ${CONTAINER_ID} | grep Connected
+   docker logs tgi-server | grep Connected
    ```
 
    If the service is ready, you will get the response like below.
@@ -285,7 +285,7 @@ docker compose up -d
    Then try the `cURL` command below to validate TGI.
 
    ```bash
-   curl http://${host_ip}:9009/v1/chat/completions \
+   curl http://${host_ip}:8008/v1/chat/completions \
      -X POST \
      -d '{"model": "Intel/neural-chat-7b-v3-3", "messages": [{"role": "user", "content": "What is Deep Learning?"}], "max_tokens":17}' \
      -H 'Content-Type: application/json'